Standardised Versioning of Datasets: a FAIR-compliant Proposal.
Autor: | González-Cebrián A; Cloud Competency Centre, National College of Ireland, Dublin, Ireland. alba.gonzalez-cebrian@ncirl.ie., Bradford M; Cloud Competency Centre, National College of Ireland, Dublin, Ireland., Chis AE; Cloud Competency Centre, National College of Ireland, Dublin, Ireland., González-Vélez H; Cloud Competency Centre, National College of Ireland, Dublin, Ireland. |
---|---|
Jazyk: | angličtina |
Zdroj: | Scientific data [Sci Data] 2024 Apr 09; Vol. 11 (1), pp. 358. Date of Electronic Publication: 2024 Apr 09. |
DOI: | 10.1038/s41597-024-03153-y |
Abstrakt: | This paper presents a standardised dataset versioning framework for improved reusability, recognition and data version tracking, facilitating comparisons and informed decision-making for data usability and workflow integration. The framework adopts a software engineering-like data versioning nomenclature ("major.minor.patch") and incorporates data schema principles to promote reproducibility and collaboration. To quantify changes in statistical properties over time, the concept of data drift metrics (d) is introduced. Three metrics (d (© 2024. The Author(s).) |
Databáze: | MEDLINE |
Externí odkaz: |