Seek and you may (not) find: A multi-institutional analysis of where research data are shared.
Autor: | Johnston LR; Data, Academic Planning & Institutional Research, University of Wisconsin-Madison, Madison, Wisconsin, United States of America., Hofelich Mohr A; Liberal Arts Technologies and Innovation Services, University of Minnesota, Minneapolis, Minnesota, United States of America., Herndon J; Center for Data and Visualization Sciences, Duke University Libraries, Duke University, Durham, North Carolina, United States of America., Taylor S; Association of Research Libraries, Washington, D.C., United States of America., Carlson JR; University at Buffalo Libraries, University at Buffalo, Buffalo, New York, United States of America., Ge L; Milken Institute School of Public Health, George Washington University, Washington, D.C., United States of America., Moore J; University Libraries, Washington University in St. Louis, St. Louis, Missouri, United States of America., Petters J; Data Services, University Libraries, Virginia Tech, Blacksburg, Virginia, United States of America., Kozlowski W; Research Data and Open Scholarship, Cornell University Library, Cornell University, Ithaca, New York, United States of America., Hudson Vitale C; Association of Research Libraries, Washington, DC., United States of America. |
---|---|
Jazyk: | angličtina |
Zdroj: | PloS one [PLoS One] 2024 Apr 25; Vol. 19 (4), pp. e0302426. Date of Electronic Publication: 2024 Apr 25 (Print Publication: 2024). |
DOI: | 10.1371/journal.pone.0302426 |
Abstrakt: | Research data sharing has become an expected component of scientific research and scholarly publishing practice over the last few decades, due in part to requirements for federally funded research. As part of a larger effort to better understand the workflows and costs of public access to research data, this project conducted a high-level analysis of where academic research data is most frequently shared. To do this, we leveraged the DataCite and Crossref application programming interfaces (APIs) in search of Publisher field elements demonstrating which data repositories were utilized by researchers from six academic research institutions between 2012-2022. In addition, we also ran a preliminary analysis of the quality of the metadata associated with these published datasets, comparing the extent to which information was missing from metadata fields deemed important for public access to research data. Results show that the top 10 publishers accounted for 89.0% to 99.8% of the datasets connected with the institutions in our study. Known data repositories, including institutional data repositories hosted by those institutions, were initially lacking from our sample due to varying metadata standards and practices. We conclude that the metadata quality landscape for published research datasets is uneven; key information, such as author affiliation, is often incomplete or missing from source data repositories and aggregators. To enhance the findability, interoperability, accessibility, and reusability (FAIRness) of research data, we provide a set of concrete recommendations that repositories and data authors can take to improve scholarly metadata associated with shared datasets. Competing Interests: Institutions involved in this study maintain paid memberships in either the CrossRef or DataCite data sharing services. This does not alter our adherence to PLOS ONE policies on sharing data and materials. (Copyright: © 2024 Johnston et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.) |
Databáze: | MEDLINE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |