Mapping data elements to terminological resources for integrating biomedical data sources
Autor: | Olivier Bodenreider, Anita Burgun, Fleur Mougin |
---|---|
Přispěvatelé: | Modélisation Conceptuelle des Connaissances Biomédicales, Université de Rennes 1 (UR1), Université de Rennes (UNIV-RENNES)-Université de Rennes (UNIV-RENNES), National Library of Medicine (NLM), National Institutes of Health [Bethesda] (NIH)-National Center for Biotechnology Information (NCBI), Université de Rennes (UR) |
Jazyk: | angličtina |
Rok vydání: | 2006 |
Předmět: |
MESH: Terminology as Topic
020205 medical informatics Databases Factual Computer science Abstracting and Indexing Information Storage and Retrieval MESH: Algorithms 02 engineering and technology Data_CODINGANDINFORMATIONTHEORY computer.software_genre Biochemistry MESH: Semantics [INFO.INFO-AI]Computer Science [cs]/Artificial Intelligence [cs.AI] MESH: Natural Language Processing MESH: Software Software Biomedical data Structural Biology Artificial Intelligence 020204 information systems Schema (psychology) Terminology as Topic Controlled vocabulary 0202 electrical engineering electronic engineering information engineering MESH: Artificial Intelligence Molecular Biology Natural Language Processing MESH: Abstracting and Indexing Information retrieval business.industry Applied Mathematics Ontology-based data integration MESH: Information Storage and Retrieval MESH: Vocabulary Controlled MESH: Databases Factual Data mapping Computer Science Applications Semantics Proceedings Vocabulary Controlled Artificial intelligence Periodicals as Topic business computer Natural language processing Algorithms Data integration MESH: Periodicals as Topic |
Zdroj: | BMC Bioinformatics BMC Bioinformatics, BioMed Central, 2006, 7 (S3), pp.S6. ⟨10.1186/1471-2105-7-S3-S6⟩ BMC Bioinformatics, 2006, 7 (S3), pp.S6. ⟨10.1186/1471-2105-7-S3-S6⟩ |
ISSN: | 1471-2105 |
DOI: | 10.1186/1471-2105-7-S3-S6⟩ |
Popis: | Background Data integration is a crucial task in the biomedical domain and integrating data sources is one approach to integrating data. Data elements (DEs) in particular play an important role in data integration. We combine schema- and instance-based approaches to mapping DEs to terminological resources in order to facilitate data sources integration. Methods We extracted DEs from eleven disparate biomedical sources. We compared these DEs to concepts and/or terms in biomedical controlled vocabularies and to reference DEs. We also exploited DE values to disambiguate underspecified DEs and to identify additional mappings. Results 82.5% of the 474 DEs studied are mapped to entries of a terminological resource and 74.7% of the whole set can be associated with reference DEs. Only 6.6% of the DEs had values that could be semantically typed. Conclusion Our study suggests that the integration of biomedical sources can be achieved automatically with limited precision and largely facilitated by mapping DEs to terminological resources. |
Databáze: | OpenAIRE |
Externí odkaz: |