An Overview of Hadoop MapReduce, Spark, and Scalable Graph Processing Architecture

Autor: Karishma P. Talan, Kartik U. Sharma, Pooja P. Talan, Pratiksha P. Nawade
Rok vydání: 2018
Předmět:
Zdroj: Advances in Intelligent Systems and Computing ISBN: 9789811312793
DOI: 10.1007/978-981-13-1280-9_3
Popis: In today’s technology era, Big Data has become a buzzword. Various frameworks are available in order to process this Big Data. Both Hadoop and Spark are open source framework to process Big Data. Hadoop provides batch processing while Spark supports both batch as well as stream processing, i.e., it is a hybrid processing framework. Both frameworks have their own advantages and drawback. The contribution of this paper is to provide a comparative analysis of Hadoop MapReduce and Apache Spark. In this paper, we also propose a scalable graph processing architecture that could be used to overcome traditional limitations of Hadoop framework.
Databáze: OpenAIRE