Making data platforms smarter with MOSES

Autor: Matteo Golfarelli, Enrico Gallinucci, Stefano Rizzi, Nicola Santolini, Anna Giulia Leoni, Matteo Francia
Přispěvatelé: Matteo Francia, Enrico Gallinucci, Matteo Golfarelli, Anna Giulia Leoni, Stefano Rizzi, Nicola Santolini
Rok vydání: 2021
Předmět:
Zdroj: Future Generation Computer Systems. 125:299-313
ISSN: 0167-739X
Popis: The rise of data platforms has enabled the collection and processing of huge volumes of data, but has opened to the risk of losing their control. Collecting proper metadata about raw data and transformations can significantly reduce this risk. In this paper we propose MOSES, a technology-agnostic, extensible, and customizable framework for metadata handling in big data platforms. The framework hinges on a metadata repository that stores information about the objects in the big data platform and the processes that transform them. MOSES provides a wide range of functionalities to different types of users of the platform. Differently from previous high-level proposals, MOSES is fully implemented and it was not conceived for a specific technology. Besides discussing the rationale and the features of MOSES, in this paper we describe its implementation and we test it on a real case study. The ultimate goal is to take a significant step forward towards proving that metadata handling in big data platforms is feasible and beneficial.
Databáze: OpenAIRE