Systematic Evaluation and Enhancement of Speech Recognition in Operational Medical Environments

Autor:	Sufeng Niu, Ju Lin, Melissa C. Smith, Kuang-Ching Wang, Caleb Linduff, Nicholas Deas, MinJae Woo, Prabodh Mishra, Ronald W. Gimbel, Yuzhe Yang, D. Hudson Smith, Jerome McClendon, Snigdhaswin Kar
Rok vydání:	2021
Předmět:	Speech enhancement Documentation Data collection Artificial neural network Computer science Speech recognition Language model Noise (video) Adaptation (computer science) Test data
Zdroj:	IJCNN
Popis:	Operational medical environments require reliable hands-free solutions to extract data from audio captured under noisy scenarios during rescue missions and provide timely information. However, approaches using automatic speech recognition (ASR) and natural language processing (NLP) techniques are complex as these conversations have a wide range of noise, involve medical terms from multiple speakers, and occur in high-stress environments, among others. These are further complicated by the lack of large training datasets for operational medical scenarios. To address these issues, we developed a platform that enables resilient hands-free data collection, preserves complete documentation through stages of care, and presents the information in near real-time, critical for the medical operation. Our work uniquely focused on systematic evaluation and improvement of a deep neural network-based ASR system by leveraging realistic testing data obtained from medical simulations of battlefield scenarios, which to our knowledge have not been addressed in any prior work. The system performance is shown to improve significantly using multi-style training, language model adaptation for the medical domain, speech enhancement, and NLP techniques.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::850b08f837f71bfff8c7865ff9d853b9 https://doi.org/10.1109/ijcnn52387.2021.9533607 Zobrazit plný text záznamu