State of the art in MapReduce

Autor: Labdaoui Imane, Tabii Youness
Rok vydání: 2017
Předmět:
Zdroj: BDCA
DOI: 10.1145/3090354.3090397
Popis: In the last years, new data sources appeared: social networks, mobile, internet of things, open Data, etc., and therefore data are rapidly increasing. These data is voluminous, various, and difficult to measure and analyze, which appears the concept of Big Data. The vast amount of data makes the ETL (Extract-Transform-Load) process heavy in data warehousing, renders the data mining process more complex, and makes the slow loading of data in database management systems. The solution to make these process more efficient is the use of parallelization technologies, many researchers opt for the use of MapReduce paradigm for its flexibility and powerful. In this paper, we provide an overview of state of the art in MapReduce research and we present its various axis.
Databáze: OpenAIRE