Efficient evaluation of partially-dimensional range queries in large OLAP datasets
Autor: | Yaokai Feng, Kunihiko Kaneko, Akifumi Makinouchi |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2011 |
Předmět: |
Structure (mathematical logic)
Information retrieval OLAP R^*-tree Computer science Online analytical processing InformationSystems_INFORMATIONSTORAGEANDRETRIEVAL Multidimensional data multidimensional range queries InformationSystems_DATABASEMANAGEMENT Space (commercial competition) computer.software_genre Computer Science Applications Management Information Systems B^+-tree Index (publishing) Modeling and Simulation multidimensional index relational data Data mining computer |
Zdroj: | International Journal of Data Mining, Modelling and Management. 3(2):150-171 |
ISSN: | 1759-1163 |
Popis: | In light of the increasing requirement for processing multidimensional queries on OLAP (relational) data, the database community has focused on the queries (especially range queries) on the large OLAP datasets from the view of multidimensional data. It is well-known that multidimensional indices are helpful to improve the performance of such queries. However, we found that much information irrelevant to queries also has to be read from disk if the existing multidimensional indices are used with OLAP data, which greatly degrade the search performance. This problem comes from particularity on the actual queries exerted on OLAP data. That is, in many OLAP applications, the query conditions probably are only with partial dimensions (not all) of the whole index space. Such range queries are called partially-dimensional (PD) range queries in this study. Based on R*-tree, we propose a new index structure, called AR*-tree, to counter the actual queries on OLAP data. The results of both mathematical analysis and many experiments with different datasets indicate that the AR*-tree can clearly improve the performance of PD range queries, esp. for large OLAP datasets. |
Databáze: | OpenAIRE |
Externí odkaz: |