RDF DATABASES – CASE STUDY AND PERFORMANCE EVALUATION

Autor:	Tony Nacional, Marko Niinimäki, Matti Heikkurinen
Rok vydání:	2019
Předmět:	Information retrieval Computer science Process (engineering) business.industry InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Big data InformationSystems_DATABASEMANAGEMENT computer.file_format NoSQL computer.software_genre Scalability ComputingMethodologies_DOCUMENTANDTEXTPROCESSING SPARQL Performance improvement RDF business Semantic Web computer
Zdroj:	MATTER: International Journal of Science and Technology. 5:01-14
ISSN:	2454-5880
DOI:	10.20319/mijst.2019.53.0114
Popis:	The Resource Description Framework (RDF) data presentation model and the SPARQL query language have been the core of the semantic web technologies since the early 2000’s. In this article, we evaluate three RDF storage technologies. Our motivation is to find a storage solution that can be used to process “big data” RDF sets. Our method is based on measuring query response times with large samples (hundreds of thousands of RDF documents, millions of RDF statements). We find that all the proposed technologies provide much better performance than querying RDF data stored in files. However, with 300 000 documents, even with the fastest technology, an aggregation query still lasts more than 100 seconds in our environment. As a further performance improvement, we test the same data and queries with MongoDB, demonstrate its performance (10 seconds instead of 100) and scalability (up to 1000 000 documents). However, despite its benefits we must note that because of its data presentation and query limitations, MongoDB probably cannot serve as a generic storage for all kinds of RDF documents.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::eaf94d60d53e8e0ea2f712d52d018071 https://doi.org/10.20319/mijst.2019.53.0114 Zobrazit plný text záznamu