The role of cue enhancement and frequency fine-tuning in hearing impaired phone recognition
Autor: | Jont B. Allen, Ali Abavisani |
---|---|
Rok vydání: | 2019 |
Předmět: |
Consonant
Hearing aid Signal Processing (eess.SP) FOS: Computer and information sciences Computer Science - Machine Learning Fine-tuning medicine.medical_specialty Sound (cs.SD) Acoustics and Ultrasonics Computer science medicine.medical_treatment Audiology Quantitative Biology - Quantitative Methods Computer Science - Sound Machine Learning (cs.LG) Arts and Humanities (miscellaneous) Phone Audio and Speech Processing (eess.AS) Vowel medicine FOS: Electrical engineering electronic engineering information engineering Electrical Engineering and Systems Science - Signal Processing Quantitative Methods (q-bio.QM) medicine.diagnostic_test FOS: Biological sciences Hearing test Hearing impaired Noise (video) Electrical Engineering and Systems Science - Audio and Speech Processing |
DOI: | 10.48550/arxiv.1908.04751 |
Popis: | A speech-based hearing test is designed to identify the susceptible error-prone phones for individual hearing impaired (HI) ear. Only robust tokens in the experiment noise levels had been chosen for the test. The noise-robustness of tokens is measured as SNR90 of the token, which is the signal to the speech-weighted noise ratio where a normal hearing (NH) listener would recognize the token with an accuracy of 90% on average. Two sets of tokens T1 and T2 having the same consonant-vowels but different talkers with distinct SNR90 had been presented with flat gain at listeners' most comfortable level. We studied the effects of frequency fine-tuning of the primary cue by presenting tokens of the same consonant but different vowels with similar SNR90. Additionally, we investigated the role of changing the intensity of primary cue in HI phone recognition, by presenting tokens from both sets T1 and T2. On average, 92% of tokens are improved when we replaced the CV with the same CV but with a more robust talker. Additionally, using CVs with similar SNR90, on average, tokens are improved by 75%, 71%, 63%, and 72%, when we replaced vowels /A, ae, I, E/, respectively. The confusion pattern in each case provides insight into how these changes affect the phone recognition in each HI ear. We propose to prescribe hearing aid amplification tailored to individual HI ears, based on the confusion pattern, the response from cue enhancement, and the response from frequency fine-tuning of the cue. Comment: 16 pages, 10 figures, proceedings of the Acoustical Society of America meeting, May 2019, Louisville, KY |
Databáze: | OpenAIRE |
Externí odkaz: |