Towards a Scalable Semantic-Based Distributed Approach for SPARQL Query Evaluation

Autor: Gezim Sejdiu, Imran Khan, Hajira Jabeen, Jens Lehmann, Ioanna Lytra, Damien Graux
Rok vydání: 2019
Předmět:
Zdroj: Lecture Notes in Computer Science
Semantic Systems. The Power of AI and Knowledge Graphs
Semantic Systems. The Power of AI and Knowledge Graphs-15th International Conference, SEMANTiCS 2019, Karlsruhe, Germany, September 9–12, 2019, Proceedings
Lecture Notes in Computer Science ISBN: 9783030332198
SEMANTiCS
Lecture Notes in Computer Science-Semantic Systems. The Power of AI and Knowledge Graphs
ISSN: 1611-3349
0302-9743
DOI: 10.1007/978-3-030-33220-4_22
Popis: Over the last two decades, the amount of data which has been created, published and managed using Semantic Web standards and especially via Resource Description Framework (RDF) has been increasing. As a result, efficient processing of such big RDF datasets has become challenging. Indeed, these processes require, both efficient storage strategies and query-processing engines, to be able to scale in terms of data size. In this study, we propose a scalable approach to evaluate SPARQL queries over distributed RDF datasets using a semantic-based partition and is implemented inside the state-of-the-art RDF processing framework: SANSA. An evaluation of the performance of our approach in processing large-scale RDF datasets is also presented. The preliminary results of the conducted experiments show that our approach can scale horizontally and perform well as compared with the previous Hadoop-based system. It is also comparable with the in-memory SPARQL query evaluators when there is less shuffling involved.
Databáze: OpenAIRE