Multi-label Classification for Clinical Text with Feature-level Attention
Autor: | Meng Ma, Xizi Zheng, Ying Zhou, Disheng Pan, Ping Wang, Li Yang, Mengya Li, Weijie Liu |
---|---|
Rok vydání: | 2020 |
Předmět: |
Multi-label classification
Feature engineering 0303 health sciences 020205 medical informatics business.industry Plain text Computer science Deep learning Feature extraction Context (language use) 02 engineering and technology computer.file_format computer.software_genre 03 medical and health sciences 0202 electrical engineering electronic engineering information engineering Feature (machine learning) Artificial intelligence business computer Natural language processing Natural language 030304 developmental biology |
Zdroj: | BigDataSecurity/HPSC/IDS |
Popis: | Multi-label text classification, which tags a given plain text with the most relevant labels from a label space, is an important task in the natural language process. To diagnose diseases, clinical researchers use a machine-learning algorithm to do multi-label clinical text classification. However, conventional machine learning methods can neither capture deep semantic information nor the context of words strictly. Diagnostic information from the EHRs (Electronic Health Records) is mainly constructed by unstructured clinical free text which is an obstacle for clinical feature extraction. Moreover, feature engineering is time-consuming and labor-intensive. With the rapid development of deep learning, we apply neural network models to resolve this problem mentioned above. To favor multi-label classification on EHRs, we propose FAMLC-BERT (Feature-level Attention for Multi-label classification on BERT) to capture semantic features from different layers. The model uses feature-level attention with BERT to recognize the labels of EHRs. We empirically compared our model with other state-of-the-art models on real-world documents collected from the hospital. Experiments show that our model achieved significant improvements compared to other selected benchmarks. |
Databáze: | OpenAIRE |
Externí odkaz: |