Exploiting Nonacoustic Sensors for Speech Encoding

Autor:	D. Messing, P.D. Gatewood, Clifford J. Weinstein, Kevin Brady, Joseph P. Campbell, Michael S. Brandstein, William M. Campbell, J.D. Tardelli, Thomas F. Quatieri
Rok vydání:	2006
Předmět:	Acoustics and Ultrasonics Frequency band Computer science Microphone Acoustics Speech recognition Speech coding Acoustic wave Electrical and Electronic Engineering Intelligibility (communication) Speech processing Vocal tract Nasality
Zdroj:	IEEE Transactions on Audio, Speech and Language Processing. 14:533-544
ISSN:	1558-7916
DOI:	10.1109/tsa.2005.855838
Popis:	The intelligibility of speech transmitted through low-rate coders is severely degraded when high levels of acoustic noise are present in the acoustic environment. Recent advances in nonacoustic sensors, including microwave radar, skin vibration, and bone conduction sensors, provide the exciting possibility of both glottal excitation and, more generally, vocal tract measurements that are relatively immune to acoustic disturbances and can supplement the acoustic speech waveform. We are currently investigating methods of combining the output of these sensors for use in low-rate encoding according to their capability in representing specific speech characteristics in different frequency bands. Nonacoustic sensors have the ability to reveal certain speech attributes lost in the noisy acoustic signal; for example, low-energy consonant voice bars, nasality, and glottalized excitation. By fusing nonacoustic low-frequency and pitch content with acoustic-microphone content, we have achieved significant intelligibility performance gains using the DRT across a variety of environments over the government standard 2400-bps MELPe coder. By fusing quantized high-band 4-to-8-kHz speech, requiring only an additional 116 bps, we obtain further DRT performance gains by exploiting the ear's insensitivity to fine spectral detail in this frequency region.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::93c9b63e46cb17603f2582ccc2952143 https://doi.org/10.1109/tsa.2005.855838 Zobrazit plný text záznamu