Optimizing Execution Plans in a Multistore

Autor: Enrico Gallinucci, Matteo Golfarelli, Chiara Forresi, Matteo Francia
Přispěvatelé: Forresi, Chiara, Francia, Matteo, Gallinucci, Enrico, Golfarelli, Matteo
Rok vydání: 2021
Předmět:
Zdroj: Advances in Databases and Information Systems ISBN: 9783030824716
ADBIS
Popis: Multistores are data management systems that enable query processing across different database management systems (DBMSs); besides the distribution of data, complexity factors like schema heterogeneity and data replication must be resolved through integration and data fusion activities. In a recent work [2], we have proposed a multistore solution that relies on a dataspace to provide the user with an integrated view of the available data and enables the formulation and execution of GPSJ (generalized projection, selection and join) queries. In this paper, we propose a technique to optimize the execution of GPSJ queries by finding the most efficient execution plan on the multistore. In particular, we devise three different strategies to carry out joins and data fusion, and we build a cost model to enable the evaluation of different execution plans. Through the experimental evaluation, we are able to profile the suitability of each strategy to different multistore configurations, thus validating our multi-strategy approach and motivating further research on this topic.
Databáze: OpenAIRE