A Documentation Scheme for Improved Traceability of Derivative Databases and Reproducible Data Preprocessing

Autor: Jonas Hinker, Johanna M.A. Myzik, Chris Kittl
Rok vydání: 2018
Předmět:
Zdroj: ISGT Europe
DOI: 10.1109/isgteurope.2018.8571680
Popis: As the complexity of models for simulation and optimization of energy systems steadily increases, so does the amount and quality of data that is necessary to parametrize these models. Unfortunately, a lack of documentation and traceability of the corresponding input data can be observed. Identified reasons include the high diversity in shape and quality of data as well as the missing habit of assigning unique identifiers. As the credibility of future energy system studies relies on data integrity and clear documentation, this paper systematically analyzes typical tasks in the preprocessing of databases. Subsequently, suggestions are worked out to document this stage of necessary data manipulation, which includes referenceability, versioning and visualization of the evolution of databases. General usage of unique identifiers is proposed for software and different types of data, and a suggestion for best practice in documentation is made by introducing a documentation scheme.
Databáze: OpenAIRE