Detecting concept relations in clinical text: Insights from a state-of-the-art model

Autor: Berry de Bruijn, Xiaodan Zhu, Svetlana Kiritchenko, Colin Cherry, Joel Martin
Rok vydání: 2013
Předmět:
Text mining
Databases
Factual

Computer science
Electronic health record
Health Informatics
02 engineering and technology
data extraction
computer.software_genre
State of the art
03 medical and health sciences
0302 clinical medicine
Artificial Intelligence
Factor (programming language)
0202 electrical engineering
electronic engineering
information engineering

Knowledge sources
Real-world scenario
Data Mining
Electronic Health Records
Humans
030212 general & internal medicine
Medical problems
computer.programming_language
Natural Language Processing
accuracy
business.industry
Medical concepts
Natural language processing systems
Variety (cybernetics)
Feature design
Semantics
Computer Science Applications
kernel method
020201 artificial intelligence & image processing
Artificial intelligence
State (computer science)
Semantic relations
business
Construct (philosophy)
computer
Natural language processing
Algorithms
Zdroj: Journal of Biomedical Informatics. 46(2):275-285
ISSN: 1532-0464
DOI: 10.1016/j.jbi.2012.11.006
Popis: This paper addresses an information-extraction problem that aims to identify semantic relations among medical concepts (problems, tests, and treatments) in clinical text. The objectives of the paper are twofold. First, we extend an earlier one-page description (appearing as a part of [5]) of a top-ranked model in the 2010 I2B2 NLP Challenge to a necessary level of details, with the belief that feature design is the most crucial factor to the success of our system and hence deserves a more detailed discussion. We present a precise quantification of the contributions of a wide variety of knowledge sources. In addition, we show the end-to-end results obtained on the noisy output of a top-ranked concept detector, which could help construct a more complete view of the state of the art in the real-world scenario. As the second major objective, we reformulate our models into a composite-kernel framework and present the best result, according to our knowledge, on the same dataset.
Databáze: OpenAIRE