An OLAM Operator for Multi-Dimensional Shrink
Autor: | Matteo Golfarelli, Simone Graziani, Stefano Rizzi |
---|---|
Přispěvatelé: | Stefano Rizzi, Matteo Golfarelli, Simone Graziani |
Rok vydání: | 2015 |
Předmět: |
OLAP
Theoretical computer science HIERARCHICAL CLUSTERING Generalization Computer science Online analytical processing QUERY PERSONALIZATION Hierarchical clustering Operator (computer programming) visual OLAP Hardware and Architecture Cube Dimension (data warehouse) Greedy algorithm Algorithm pervasive BI Software Pivot table |
Zdroj: | International Journal of Data Warehousing and Mining. 11:68-97 |
ISSN: | 1548-3932 1548-3924 |
Popis: | Shrink is an OLAM (On-Line Analytical Mining) operator based on hierarchical clustering, and it has been previously proposed in mono-dimensional form to balance precision with size in the visualization of cubes via pivot tables during OLAP analyses. It can be applied to the cube resulting from a query to decrease its size while controlling the approximation introduced; the idea is to fuse similar facts together and replace them with a single representative fact, respecting the bounds posed by dimension hierarchies. In this paper the authors propose a multi-dimensional generalization of the shrink operator, where facts are fused along multiple dimensions. Multi-dimensional shrink comes in two flavors: lazy and eager, where the bounds posed by hierarchies are respectively weaker and stricter. Greedy algorithms based on agglomerative clustering are presented for both lazy and eager shrink, and experimentally evaluated in terms of efficiency and effectiveness. |
Databáze: | OpenAIRE |
Externí odkaz: |