A method for realistic, conversational signal-to-noise ratio estimation
Autor: | Adam Westermann, Marton Marschall, Naim Mansour, Tobias May, Torsten Dau |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2021 |
Předmět: |
Voice activity detection
Acoustics and Ultrasonics Microphone Computer science Speech recognition media_common.quotation_subject Signal-To-Noise Ratio Signal Power (physics) 03 medical and health sciences 0302 clinical medicine Signal-to-noise ratio Hearing Aids Arts and Humanities (miscellaneous) Hearing Speech Perception Humans Conversation 030223 otorhinolaryngology Noise 030217 neurology & neurosurgery Impulse response media_common |
Zdroj: | Mansour, N, Marschall, M, May, T, Westermann, A & Dau, T 2021, ' A method for realistic, conversational signal-to-noise ratio estimation ', Journal of the Acoustical Society of America, vol. 149, no. 3, pp. 1559-1566 . https://doi.org/10.1121/10.0003626 |
Popis: | The analysis of real-world conversational signal-to-noise ratios (SNRs) can provide insight into people's communicative strategies and difficulties and guide the development of hearing devices.However, measuring SNRs accurately is challenging in everyday recording conditions in which only a mixture of sound sources can be captured. This study introduces a method for accurate in situ SNR estimation where the speech signal of a target talker in natural conversation is captured by a cheek-mounted microphone, adjusted for free-field conditions and convolved with a measured impulse response to estimate its power at the receiving talker. A microphone near the receiver provides the noise-only component through voice activity detection. The method is applied to in situ recordings of conversations in two real-world sound scenarios. It is shown that the broadband speech level and SNRdistributions are estimated more accurately by the proposed method compared to a typical single-channel method, especially in challenging, low-SNR environments. The application of the proposed two-channel method may render more realistic estimates of conversational SNRs and provide valuable inputto hearing instrument processing strategies whose operating points are determined by accurate SNR estimates. |
Databáze: | OpenAIRE |
Externí odkaz: |