Mapping data elements to terminological resources for integrating biomedical data sources

Autor: Olivier Bodenreider, Anita Burgun, Fleur Mougin
Přispěvatelé: Modélisation Conceptuelle des Connaissances Biomédicales, Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES), National Library of Medicine (NLM), National Institutes of Health [Bethesda] (NIH)-National Center for Biotechnology Information (NCBI), Université de Rennes (UR)
Jazyk: angličtina
Rok vydání: 2006
Předmět:
MESH: Terminology as Topic
020205 medical informatics
Databases
Factual

Computer science
Abstracting and Indexing
Information Storage and Retrieval
MESH: Algorithms
02 engineering and technology
Data_CODINGANDINFORMATIONTHEORY
computer.software_genre
Biochemistry
MESH: Semantics
[INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI]
MESH: Natural Language Processing
MESH: Software
Software
Biomedical data
Structural Biology
Artificial Intelligence
020204 information systems
Schema (psychology)
Terminology as Topic
Controlled vocabulary
0202 electrical engineering
electronic engineering
information engineering

MESH: Artificial Intelligence
Molecular Biology
Natural Language Processing
MESH: Abstracting and Indexing
Information retrieval
business.industry
Applied Mathematics
Ontology-based data integration
MESH: Information Storage and Retrieval
MESH: Vocabulary
Controlled

MESH: Databases
Factual

Data mapping
Computer Science Applications
Semantics
Proceedings
Vocabulary
Controlled

Artificial intelligence
Periodicals as Topic
business
computer
Natural language processing
Algorithms
Data integration
MESH: Periodicals as Topic
Zdroj: BMC Bioinformatics
BMC Bioinformatics, BioMed Central, 2006, 7 (S3), pp.S6. ⟨10.1186/1471-2105-7-S3-S6⟩
BMC Bioinformatics, 2006, 7 (S3), pp.S6. ⟨10.1186/1471-2105-7-S3-S6⟩
ISSN: 1471-2105
DOI: 10.1186/1471-2105-7-S3-S6⟩
Popis: Background Data integration is a crucial task in the biomedical domain and integrating data sources is one approach to integrating data. Data elements (DEs) in particular play an important role in data integration. We combine schema- and instance-based approaches to mapping DEs to terminological resources in order to facilitate data sources integration. Methods We extracted DEs from eleven disparate biomedical sources. We compared these DEs to concepts and/or terms in biomedical controlled vocabularies and to reference DEs. We also exploited DE values to disambiguate underspecified DEs and to identify additional mappings. Results 82.5% of the 474 DEs studied are mapped to entries of a terminological resource and 74.7% of the whole set can be associated with reference DEs. Only 6.6% of the DEs had values that could be semantically typed. Conclusion Our study suggests that the integration of biomedical sources can be achieved automatically with limited precision and largely facilitated by mapping DEs to terminological resources.
Databáze: OpenAIRE