Code-Switching Automatic Speech Recognition for Nursing Record Documentation: System Development and Evaluation.
Autor: | Hou SY; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Wu YL; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Chen KC; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Chang TA; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Hsu YM; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Chuang SJ; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Chang Y; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan., Hsu KC; Artificial Intelligence Center for Medical Diagnosis, China Medical University Hospital, Taichung City, Taiwan. |
---|---|
Jazyk: | angličtina |
Zdroj: | JMIR nursing [JMIR Nurs] 2022 Dec 07; Vol. 5 (1), pp. e37562. Date of Electronic Publication: 2022 Dec 07. |
DOI: | 10.2196/37562 |
Abstrakt: | Background: Taiwan has insufficient nursing resources due to the high turnover rate of health care providers. Therefore, reducing the heavy workload of these employees is essential. Herein, speech transcription, which has various potential clinical applications, was employed for the documentation of nursing records. The requirement of including only one speaker per transcription facilitated data collection and system development. Moreover, authorization from patients was unnecessary. Objective: The aim of this study was to construct a speech recognition system for nursing records such that health care providers can complete nursing records without typing or with only a few edits. Methods: Nursing records in Taiwan are mainly written in Mandarin, with technical terms and abbreviations presented in both Mandarin and English. Therefore, the training set consisted of English code-switching information. Next, transfer learning (TL) and meta-TL (MTL) methods, which perform favorably in code-switching scenarios, were applied. Results: As of September 2021, the China Medical University Hospital Artificial Intelligence Speech (CMaiSpeech) data set was established by manually annotating approximately 100 hours of recordings from 525 speakers. The word error rate (WER) of the benchmark model of syllable-based TL was 29.54% in code-switching. The WER of the proposed model of syllable-based MTL was 22.20% in code-switching. The test set comprised 17,247 words. Moreover, in a clinical case, the proposed model of syllable-based MTL yielded a WER of 31.06% in code-switching. The clinical test set contained 1159 words. Conclusions: This paper has two main contributions. First, the CMaiSpeech data set-a Mandarin-English corpus-has been established. Health care providers in Taiwan are often compelled to use a mixture of Mandarin and English in nursing records. Second, an automatic speech recognition system for nursing record document conversion was proposed. The proposed system can shorten the work handover time and further reduce the workload of health care providers. (©Shih-Yen Hou, Ya-Lun Wu, Kai-Ching Chen, Ting-An Chang, Yi-Min Hsu, Su-Jung Chuang, Ying Chang, Kai-Cheng Hsu. Originally published in JMIR Nursing (https://nursing.jmir.org), 07.12.2022.) |
Databáze: | MEDLINE |
Externí odkaz: |