Massively parallel data analysis with PACTs on Nephele

Autor: Stephan Ewen, Max Heimel, Volker Markl, Dominic Battré, Odej Kao, Daniel Warneke, Fabian Hueske, Erik Nijkamp, Alexander Alexandrov
Rok vydání: 2010
Předmět:
Zdroj: Proceedings of the VLDB Endowment. 3:1625-1628
ISSN: 2150-8097
DOI: 10.14778/1920841.1921056
Popis: Large-scale data analysis applications require processing and analyzing of Terabytes or even Petabytes of data, particularly in the areas of web analysis or scientific data management. This trend has been discussed as "web-scale data management" in a panel at VLDB 2009. Formerly, parallel data processing was the domain of parallel database systems. Today, novel requirements like scaling out to thousands of machines, improved fault-tolerance, and schema free processing have made a case for new approaches.
Databáze: OpenAIRE