Applicability of Machine Learning Methods to Multi-label Medical Text Classification
Autor: | Georgy Kopanitsa, Iuliia Lenivtceva, Evgenia Slasten, Mariya Kashina |
---|---|
Rok vydání: | 2020 |
Předmět: |
020205 medical informatics
Computer science business.industry 02 engineering and technology computer.software_genre Machine learning Task (project management) Information extraction 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Artificial intelligence Classifier chains business computer |
Zdroj: | Lecture Notes in Computer Science ISBN: 9783030504229 ICCS (4) |
Popis: | Structuring medical text using international standards allows to improve interoperability and quality of predictive modelling. Medical text classification task facilitates information extraction. In this work we investigate the applicability of several machine learning models and classifier chains (CC) to medical unstructured text classification. The experimental study was performed on a corpus of 11671 manually labeled Russian medical notes. The results showed that using CC strategy allows to improve classification performance. Ensemble of classifier chains based on linear SVC showed the best result: 0.924 micro F-measure, 0.872 micro precision and 0.927 micro recall. |
Databáze: | OpenAIRE |
Externí odkaz: |