The Time Machine in Columnar NoSQL Databases: The Case of Apache HBase

Autor: Chia-Ping Tsai, Che-Wei Chang, Hung-Chang Hsiao, Haiying Shen
Jazyk: angličtina
Rok vydání: 2022
Předmět:
Zdroj: Future Internet, Vol 14, Iss 3, p 92 (2022)
Druh dokumentu: article
ISSN: 1999-5903
DOI: 10.3390/fi14030092
Popis: Not Only SQL (NoSQL) is a critical technology that is scalable and provides flexible schemas, thereby complementing existing relational database technologies. Although NoSQL is flourishing, present solutions lack the features required by enterprises for critical missions. In this paper, we explore solutions to the data recovery issue in NoSQL. Data recovery for any database table entails restoring the table to a prior state or replaying (insert/update) operations over the table given a time period in the past. Recovery of NoSQL database tables enables applications such as failure recovery, analysis for historical data, debugging, and auditing. Particularly, our study focuses on columnar NoSQL databases. We propose and evaluate two solutions to address the data recovery problem in columnar NoSQL and implement our solutions based on Apache HBase, a popular NoSQL database in the Hadoop ecosystem widely adopted across industries. Our implementations are extensively benchmarked with an industrial NoSQL benchmark under real environments.
Databáze: Directory of Open Access Journals
Nepřihlášeným uživatelům se plný text nezobrazuje