Objective Prediction of Hearing Aid Benefit Across Listener Groups Using Machine Learning: Speech Recognition Performance With Binaural Noise-Reduction Algorithms
Autor: | Anna Warzybok, Birger Kollmeier, Marc René Schädler |
---|---|
Rok vydání: | 2018 |
Předmět: |
Adult
Male Hearing aid Computer science Noise reduction medicine.medical_treatment Speech recognition hearing aid 01 natural sciences Machine Learning Young Adult 03 medical and health sciences Speech and Hearing Speech recognition performance Hearing Aids 0302 clinical medicine 0103 physical sciences medicine Humans Speech speech perception modeling 030223 otorhinolaryngology 010301 acoustics Aged speech recognition Auditory Threshold hearing impairment Middle Aged lcsh:Otorhinolaryngology lcsh:RF1-547 Otorhinolaryngology Speech Perception Female Original Article Fade Binaural recording Forecasting |
Zdroj: | Trends in Hearing, Vol 22 (2018) Trends in Hearing |
ISSN: | 2331-2165 |
DOI: | 10.1177/2331216518768954 |
Popis: | The simulation framework for auditory discrimination experiments (FADE) was adopted and validated to predict the individual speech-in-noise recognition performance of listeners with normal and impaired hearing with and without a given hearing-aid algorithm. FADE uses a simple automatic speech recognizer (ASR) to estimate the lowest achievable speech reception thresholds (SRTs) from simulated speech recognition experiments in an objective way, independent from any empirical reference data. Empirical data from the literature were used to evaluate the model in terms of predicted SRTs and benefits in SRT with the German matrix sentence recognition test when using eight single- and multichannel binaural noise-reduction algorithms. To allow individual predictions of SRTs in binaural conditions, the model was extended with a simple better ear approach and individualized by taking audiograms into account. In a realistic binaural cafeteria condition, FADE explained about 90% of the variance of the empirical SRTs for a group of normal-hearing listeners and predicted the corresponding benefits with a root-mean-square prediction error of 0.6 dB. This highlights the potential of the approach for the objective assessment of benefits in SRT without prior knowledge about the empirical data. The predictions for the group of listeners with impaired hearing explained 75% of the empirical variance, while the individual predictions explained less than 25%. Possibly, additional individual factors should be considered for more accurate predictions with impaired hearing. A competing talker condition clearly showed one limitation of current ASR technology, as the empirical performance with SRTs lower than −20 dB could not be predicted. |
Databáze: | OpenAIRE |
Externí odkaz: |