Logic-based assessment of the compatibility of UMLS ontology sources
Autor: | Rafael Berlanga, Bernardo Cuenca Grau, Ernesto Jiménez-Ruiz, Ian Horrocks |
---|---|
Rok vydání: | 2011 |
Předmět: |
QA75
PubMed Information retrieval UMLS metathesaurus Computer Networks and Communications Computer science Umls metathesaurus ClinicalTrials.gov Research Unified Medical Language System Antology sources Health Informatics Audit Ontology (information science) Logical consequence Computer Science Applications Medical thesauri Encapçalaments de material--Medicina--Informàtica Subject headings--Medicine-- Data processing Compatibility (mechanics) Empirical evidence Information Systems |
Zdroj: | Journal of Biomedical Semantics Repositori Universitat Jaume I Universitat Jaume I |
ISSN: | 2041-1480 |
Popis: | This article is part of the supplement: Semantic Web Applications and Tools for Life Sciences (SWAT4LS), 2009 Background: The UMLS Metathesaurus (UMLS-Meta) is currently the most comprehensive effort for integrating independently-developed medical thesauri and ontologies. UMLS-Meta is being used in many applications, including PubMed and ClinicalTrials.gov. The integration of new sources combines automatic techniques, expert assessment, and auditing protocols. The automatic techniques currently in use, however, are mostly based on lexical algorithms and often disregard the semantics of the sources being integrated. Results: In this paper, we argue that UMLS-Meta’s current design and auditing methodologies could be significantly enhanced by taking into account the logicbased semantics of the ontology sources. We provide empirical evidence suggesting that UMLS-Meta in its 2009AA version contains a significant number of errors; these errors become immediately apparent if the rich semantics of the ontology sources is taken into account, manifesting themselves as unintended logical consequences that follow from the ontology sources together with the information in UMLS-Meta. We then propose general principles and specific logic-based techniques to effectively detect and repair such errors. Conclusions: Our results suggest that the methodologies employed in the design of UMLS-Meta are not only very costly in terms of human effort, but also error-prone. The techniques presented here can be useful for both reducing human effort in the design and maintenance of UMLS-Meta and improving the quality of its contents |
Databáze: | OpenAIRE |
Externí odkaz: |