Angling for data: making biodiversity metadata more FAIR

Autor:	Philipson, Joakim
Přispěvatelé:	525 Stockholms universitetsbibliotek
Rok vydání:	2017
Předmět:	validation Data element FAIR principles Computer science Metadata standard metadata Findability General Medicine Information repository Metadata repository World Wide Web Metadata Data quality Darwin Core biodiversity
Zdroj:	Biodiversity Information Science and Standards 1: e20267
Popis:	Slightly revised (longer) PDF-version of presentation given at TDWG2017, Oct. 3, Ottawa. Abstract, published in: Biodiversity Information Science and Standards 1: e20267 (14 Aug 2017) https://doi.org/10.3897/tdwgproceedings.1.20267 The FAIR guiding principles, first launched in 2014, for making research data more Findable, Accesible, Interoperable and Re-usable, have not yet been widely implemented for biodiversity data. Partly this may be due to the FAIR principles by themselves not yet being fully operational and easy to interpret. There is work in progress to remedy this by different task groups, and different attempts have already been made. In this paper I will give some concrete tips aimed at implementing the FAIR principles for biodiversity research data, focusing on the metadata, in order to enhance the quality of data by making them more findable, accessible, interoperable and reusable. Among the steps that could be taken to make biodiversity database records more findable and accessible is for example to add schema.org markup to the html sourcecode of corresponding web pages, as has been successfully employed in the Uniprot database. Recently biocaddie.org has mapped the metadata format DATS, Data Tag Suite, to schema.org and there is also the ongoing adaptation effort of bioschemas.org. In addition, there is the highly commendable work done by former biosharing.org, which now has become the more general fairsharing.organd which aims to enhance findability, promote the adoption of metadata standards by policy makers and interlink metadata standards among themselves and with repositories (Sansone 2017). Further, to make biodiversity records more interoperable and reusable, it is essential to provide metadata export to a selection of general standards and formats. In doing this, promises should be kept, meaning that exported metadata records should also validate against the schemas for the chosen format standard. By validating against schemas of both preferred metadata standard and export formats, biodiversity data records also stand a better chance of achievingwhat has been defined by GBIF and Vertnet as Fitness-for-use, encompassing e.g. accessibility, content, completeness, dataset-level or record level, error correction etc. (Russell 2011). That is, of course, provided the relevant metadata standards have validation schemas or online tools such as the Darwin Core Archive/EML validator that are sufficiently precise to check for these properties. If not, there is always the possibility of creating tailormade validation schemas serving the data quality needs of a specialized biodiversity data repository, e.g. using Schematron or JSON schema.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::005f00d4d905a8581c16a77d5ef2c9cd https://zenodo.org/record/1140516 Zobrazit plný text záznamu