Destination Earth Data Lake

Autor: Jordi Duatis Juarez, Michael Schick, Danaele Puechmaille, Miruna Stoicescu, Borys Saulyak
Rok vydání: 2023
Popis: Destination Earth is an operational service under the lead of the European Commission being implemented jointly by ESA, ECMWF and EUMETSAT.The presentation will provide insights into the EUMETSAT Data Lake Service component of the Destination Earth undertaking.The objective of the European Commission’s Destination Earth (DestinE) initiative is to deploy several highly accurate digital replicas of the Earth (Digital Twins) in order to monitor and simulate natural as well as human activities and their interactions, to develop and test “what-if” scenarios that would enable more sustainable developments and support European environmental policies. DestinE addresses the challenge to manage and make accessible the sheer amount of data generated by the Digital Twins and observation data located at external sites such as the ones depicted in the figure below. This data will be made available fast enough and in a format ready to support analysis scenarios proposed by the DestinE service users. Figure: DestinE Data Sources (green) and Stakeholders (orange) The “DestinE Data Lake” (DEDL) is one of the three Destination Earth components interacting with:the Digital Twin Engine (DTE), which runs the simulation models, under ECMWF responsibility the DestinE Core Service Platform (DESP), which represents the user entry point to the DestinE services and data, under ESA responsibility The DestinE Data Lake (DEDL) fulfils the storage and access requirements for any data that is offered to DestinE users. It provides users with a seamless access to the datasets, regardless of data type and location. Furthermore, the DEDL supports big data processing services, such as near-data processing to maximize throughput and service scalability. The data lake is built inter alia upon existing data lakes such as Copernicus DIAS, ESA, EUMETSAT, ECMWF as well as complementary data from diverse sources like federated data spaces, in-situ or socio-economic data. The DT Data Warehouse is a sub-component of the DEDL which stores relevant subsets of the output from each digital twin (DT) execution being powered by ECMWFs Hyper-Cube service.
Databáze: OpenAIRE