A study on formalizing the knowledge of data curation activities across different fields.

Autor: Minamiyama Y; Research Center for Open Science and Data Platform, National Institute of Informatics, Chiyoda-City, Tokyo, Japan., Takeda H; Principles of Informatics Research Division, National Institute of Informatics, Chiyoda-City, Tokyo, Japan., Hayashi M; Research Center for Open Science and Data Platform, National Institute of Informatics, Chiyoda-City, Tokyo, Japan., Asaoka M; Research Center for Open Science and Data Platform, National Institute of Informatics, Chiyoda-City, Tokyo, Japan., Yamaji K; Research Center for Open Science and Data Platform, National Institute of Informatics, Chiyoda-City, Tokyo, Japan.
Jazyk: angličtina
Zdroj: PloS one [PLoS One] 2024 Apr 25; Vol. 19 (4), pp. e0301772. Date of Electronic Publication: 2024 Apr 25 (Print Publication: 2024).
DOI: 10.1371/journal.pone.0301772
Abstrakt: In recent years, with the trend of open science, there have been many efforts to share research data on the internet. To promote research data sharing, data curation is essential to make the data interpretable and reusable. In research fields such as life sciences, earth sciences, and social sciences, tasks and procedures have been already developed to implement efficient data curation to meet the needs and customs of individual research fields. However, not only data sharing within research fields but also interdisciplinary data sharing is required to promote open science. For this purpose, knowledge of data curation across the research fields is surveyed, analyzed, and organized as an ontology in this paper. As the survey, existing vocabularies and procedures are collected and compared as well as interviews with the data curators in research institutes in different fields are conducted to clarify commonalities and differences in data curation across the research fields. It turned out that the granularity of tasks and procedures that constitute the building blocks of data curation is not formalized. Without a method to overcome this gap, it will be challenging to promote interdisciplinary reuse of research data. Based on the analysis above, the ontology for the data curation process is proposed to describe data curation processes in different fields universally. It is described by OWL and shown as valid and consistent from the logical viewpoint. The ontology successfully represents data curation activities as the processes in the different fields acquired by the interviews. It is also helpful to identify the functions of the systems to support the data curation process. This study contributes to building a knowledge framework for an interdisciplinary understanding of data curation activities in different fields.
Competing Interests: The authors have declared that no competing interests exist.
(Copyright: © 2024 Minamiyama et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.)
Databáze: MEDLINE
Nepřihlášeným uživatelům se plný text nezobrazuje