Ontology-Based Querying with Bio2RDF's Linked Open Data
Autor: | Alison Callahan, Michel Dumontier, Jose Cruz-Toledo |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2013 |
Předmět: |
0303 health sciences
Biological data 020205 medical informatics Computer Networks and Communications Computer science Health Informatics 02 engineering and technology computer.file_format Linked data Ontology (information science) Data science Basic Formal Ontology Computer Science Applications Variety (cybernetics) 03 medical and health sciences Bio2RDF Proceedings 0202 electrical engineering electronic engineering information engineering Journal Article RDF Semantic Web computer 030304 developmental biology Information Systems |
Zdroj: | Journal of biomedical semantics, 4 Suppl 1. BioMed Central Ltd Journal of Biomedical Semantics |
ISSN: | 2041-1480 |
DOI: | 10.1186/2041-1480-4-s1-s1 |
Popis: | Background A key activity for life scientists in this post “-omics” age involves searching for and integrating biological data from a multitude of independent databases. However, our ability to find relevant data is hampered by non-standard web and database interfaces backed by an enormous variety of data formats. This heterogeneity presents an overwhelming barrier to the discovery and reuse of resources which have been developed at great public expense.To address this issue, the open-source Bio2RDF project promotes a simple convention to integrate diverse biological data using Semantic Web technologies. However, querying Bio2RDF remains difficult due to the lack of uniformity in the representation of Bio2RDF datasets. Results We describe an update to Bio2RDF that includes tighter integration across 19 new and updated RDF datasets. All available open-source scripts were first consolidated to a single GitHub repository and then redeveloped using a common API that generates normalized IRIs using a centralized dataset registry. We then mapped dataset specific types and relations to the Semanticscience Integrated Ontology (SIO) and demonstrate simplified federated queries across multiple Bio2RDF endpoints. Conclusions This coordinated release marks an important milestone for the Bio2RDF open source linked data framework. Principally, it improves the quality of linked data in the Bio2RDF network and makes it easier to access or recreate the linked data locally. We hope to continue improving the Bio2RDF network of linked data by identifying priority databases and increasing the vocabulary coverage to additional dataset vocabularies beyond SIO. |
Databáze: | OpenAIRE |
Externí odkaz: |