À la Recherche du Temps Perdu: extracting temporal relations from medical text in the 2012 i2b2 NLP challenge

Autor:	Colin Cherry, Berry de Bruijn, Joel Martin, Xiaodan Zhu
Jazyk:	angličtina
Rok vydání:	2013
Předmět:	Patient Discharge Summaries Post hoc Computer science temporal analysis relation extraction Information Storage and Retrieval Health Informatics 02 engineering and technology data extraction computer.software_genre medical record Research and Applications Task (project management) Time Translational Research Biomedical 03 medical and health sciences 0302 clinical medicine Artificial Intelligence 020204 information systems 0202 electrical engineering electronic engineering information engineering Hospital discharge Electronic Health Records Humans 030212 general & internal medicine information extraction natural language processing semantics Recall accuracy temporal reasoning business.industry medical specialist prediction Relationship extraction clinical text hospital discharge Information extraction machine learning Artificial intelligence F1 score business computer Natural language processing
Zdroj:	Journal of the American Medical Informatics Association : JAMIA
ISSN:	1527-974X 1067-5027
Popis:	Objective: An analysis of the timing of events is critical for a deeper understanding of the course of events within a patient record. The 2012 i2b2 NLP challenge focused on the extraction of temporal relationships between concepts within textual hospital discharge summaries. Materials and methods: The team from the National Research Council Canada (NRC) submitted three system runs to the second track of the challenge: typifying the time-relationship between pre-annotated entities. The NRC system was designed around four specialist modules containing statistical machine learning classifiers. Each specialist targeted distinct sets of relationships: local relationships, 'sectime'-type relationships, non-local overlap-type relationships, and non-local causal relationships. Results: The best NRC submission achieved a precision of 0.7499, a recall of 0.6431, and an F1 score of 0.6924, resulting in a statistical tie for first place. Post hoc improvements led to a precision of 0.7537, a recall of 0.6455, and an F1 score of 0.6954, giving the highest scores reported on this task to date. Discussion and conclusions: Methods for general relation extraction extended well to temporal relations, and gave top-ranked state-of-the-art results. Careful ordering of predictions within result sets proved critical to this success.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::91df7a7e479f8efeb31e9590236f4fea http://europepmc.org/articles/PMC3756270 Zobrazit plný text záznamu