A Sample Advisor for Approximate Query Processing
Autor: | Wolfgang Lehner, Philipp Rösch |
---|---|
Rok vydání: | 2010 |
Předmět: |
Sample selection
Data-Warehouse-Systeme moderne Datenverwaltungssysteme automatische Stichprobenauswahl Sample Advisor approximative Abfrageverarbeitung Computer science business.industry Data management Computation Database sampling Workload computer.software_genre Data warehouse data warehouse systems modern data management systems. automatic sample selection Sample Advisor Approximate Query Processing Data mining ddc:004 business Merge (version control) computer |
Zdroj: | Advances in Databases and Information Systems ISBN: 9783642155758 ADBIS |
DOI: | 10.1007/978-3-642-15576-5_37 |
Popis: | The rapid growth of current data warehouse systems makes random sampling a crucial component of modern data management systems. Although there is a large body of work on database sampling, the problem of automatic sample selection remained (almost) unaddressed. In this paper, we tackle the problem with a sample advisor. We propose a cost model to evaluate a sample for a given query. Based on this, our sample advisor determines the optimal set of samples for a given set of queries specified by an expert. We further propose an extension to utilize recorded workload information. In this case, the sample advisor takes the set of queries and a given memory bound into account for the computation of a sample advice. Additionally, we consider the merge of samples in case of overlapping sample advice and present both an exact and a heuristic solution. Within our evaluation, we analyze the properties of the cost model and compare the proposed algorithms. We further demonstrate the effectiveness and the efficiency of the heuristic solutions with a variety of experiments. |
Databáze: | OpenAIRE |
Externí odkaz: |