Source Selection based on Predicate Assignment Optimization: A Novel Approach for Large Scale Mediation Systems
Autor: | Pomares, Alexandra, Roncancio, Claudia, Cung, Van-Dat, Abasolo, Jose, Villamil, María Del Pilar |
---|---|
Přispěvatelé: | Laboratoire d'Informatique de Grenoble (LIG), Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS)-Université Pierre Mendès France - Grenoble 2 (UPMF)-Université Joseph Fourier - Grenoble 1 (UJF), Universidad de los Andes [Bogota] (UNIANDES), Pontificia Universidad Javeriana (PUJ), Recherche Opérationnelle pour les Systèmes de Production (G-SCOP_ROSP), Laboratoire des sciences pour la conception, l'optimisation et la production (G-SCOP), Université Joseph Fourier - Grenoble 1 (UJF)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS)-Université Joseph Fourier - Grenoble 1 (UJF)-Institut polytechnique de Grenoble - Grenoble Institute of Technology (Grenoble INP )-Institut National Polytechnique de Grenoble (INPG)-Centre National de la Recherche Scientifique (CNRS), Comunicación y Tecnología de Información (COMIT) |
Jazyk: | angličtina |
Rok vydání: | 2010 |
Předmět: |
Source Selection
[INFO.INFO-DB]Computer Science [cs]/Databases [cs.DB] [INFO.INFO-IR]Computer Science [cs]/Information Retrieval [cs.IR] Combinatorial Optimization [INFO.INFO-WB]Computer Science [cs]/Web Large Scale Data Mediation [INFO.INFO-RO]Computer Science [cs]/Operations Research [cs.RO] [INFO.INFO-DC]Computer Science [cs]/Distributed Parallel and Cluster Computing [cs.DC] |
Zdroj: | Base de Données Avancées 2010 (BDA2010) Base de Données Avancées 2010 (BDA2010), Oct 2010, Toulouse, France. Will be completed |
Popis: | International audience; This paper presents OptiSource, a novel approach of source selection that reduces the number of data sources accessed during query evaluation in complex large scale distributed data contexts in virtual organizations (VO). In these contexts autonomous organizations share data about a group of domain concepts (e.g. patient, client, gene). The instances of such concepts are constructed from non-disjointed fragments provided by several local data sources. This fact, in addition to the absence of reliable statistics on source contents and the large number of sources, make current proposals unsuitable in terms of response quality and/or response time. OptiSource optimizes data source selection in query evaluation using a combinatorial optimization model to distinguish the sets of sources that maximize benefits and minimize the number of sources to contact to while satisfying resource constraints. The precision and recall of source selection during query planning is highly improved as demonstrated by the tests performed with the OptiSource prototype. Furthermore, tests with the optimization model confirmed that the approach can handle different levels of precision on the benefit prediction. |
Databáze: | OpenAIRE |
Externí odkaz: |