Random access with a distributed Bitmap Join Index for Star Joins

Autor: Jaqueline J. Brito, Thiago Mosqueiro, Ricardo R. Ciferri, Cristina D.A. Ciferri
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: Heliyon, Vol 6, Iss 2, Pp e03342- (2020)
Druh dokumentu: article
ISSN: 2405-8440
DOI: 10.1016/j.heliyon.2020.e03342
Popis: Indices improve the performance of relational databases, especially on queries that return a small portion of the data (i.e., low-selectivity queries). Star joins are particularly expensive operations that commonly rely on indices for improved performance at scale. The development and support of index-based solutions for Star Joins are still at very early stages. To address this gap, we propose a distributed Bitmap Join Index (dBJI) and a framework-agnostic strategy to solve join predicates in linear time. For empirical analysis, we used common Hadoop technologies (e.g., HBase and Spark) to show that dBJI significantly outperforms full scan approaches by a factor between 59% and 88% in queries with low selectivity from the Star Schema Benchmark (SSB). Thus, distributed indices may significantly enhance low-selectivity query performance even in very large databases.
Databáze: Directory of Open Access Journals