An Overview of Hadoop MapReduce, Spark, and Scalable Graph Processing Architecture

Autor:	Karishma P. Talan, Kartik U. Sharma, Pooja P. Talan, Pratiksha P. Nawade
Rok vydání:	2018
Předmět:	Computer science business.industry Distributed computing Big data 020206 networking & telecommunications 02 engineering and technology Software_PROGRAMMINGTECHNIQUES Stream processing Open source Scalability Data_FILES 0202 electrical engineering electronic engineering information engineering Batch processing Graph (abstract data type) 020201 artificial intelligence & image processing InformationSystems_MISCELLANEOUS Architecture business Drawback
Zdroj:	Advances in Intelligent Systems and Computing ISBN: 9789811312793
DOI:	10.1007/978-981-13-1280-9_3
Popis:	In today’s technology era, Big Data has become a buzzword. Various frameworks are available in order to process this Big Data. Both Hadoop and Spark are open source framework to process Big Data. Hadoop provides batch processing while Spark supports both batch as well as stream processing, i.e., it is a hybrid processing framework. Both frameworks have their own advantages and drawback. The contribution of this paper is to provide a comparative analysis of Hadoop MapReduce and Apache Spark. In this paper, we also propose a scalable graph processing architecture that could be used to overcome traditional limitations of Hadoop framework.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::dfdf018effaa05282c39f0c733ef7970 https://doi.org/10.1007/978-981-13-1280-9_3 Zobrazit plný text záznamu