Speech enhancement using MMSE estimation under phase uncertainty
Autor: | Ravikumar Kandagatla, P.V. Subbaiah |
---|---|
Rok vydání: | 2017 |
Předmět: |
Linguistics and Language
Minimum mean square error Computer science Noise reduction Speech recognition Generalized gamma distribution Short-time Fourier transform Estimator 020206 networking & telecommunications 02 engineering and technology Intelligibility (communication) Language and Linguistics Human-Computer Interaction Speech enhancement 030507 speech-language pathology & audiology 03 medical and health sciences Computer Science::Sound Prior probability 0202 electrical engineering electronic engineering information engineering Computer Vision and Pattern Recognition 0305 other medical science Software |
Zdroj: | International Journal of Speech Technology. 20:373-385 |
ISSN: | 1572-8110 1381-2416 |
Popis: | Most of the speech enhancement algorithms process the amplitudes of speech, but the phase of noisy speech is left unprocessed as it may cause undesired artifacts. Recently, short time Fourier transform based single channel speech enhancement algorithms are developed by considering uncertain prior knowledge of phase. The uncertain knowledge of the phase is obtained from the phase reconstruction algorithms. The goal of this paper is to develop joint minimum mean square error estimate of complex speech coefficients given uncertainty phase (CUP) information by considering Nagakami probability density function (PDF) and gamma PDF as speech spectral amplitude priors and generalized gamma PDF for noise prior. Estimators like amplitudes given uncertainty phase, which uses uncertain phase only for amplitude estimation and not for phase improvement are developed. Experimental results shows that incorporating uncertain phase information improves quality and intelligibility of speech. Also novel phase-blind estimators are developed using Nagakami PDF/gamma as speech priors and generalized gamma as noise prior. Finally comparison of all estimators using uncertain prior phase information is discussed and how initial phase information affects the enhancement process is analyzed with novel estimators. For comparison of all the derived estimators, the speech signals uttered by male and female speakers are taken from TIMIT database. The proposed CUP estimators outperforms the existing algorithms in terms of objective performance measure segmental signal to noise ratio, phase signal to noise ratio, perceptual evaluation of speech quality, short time objective intelligibility. |
Databáze: | OpenAIRE |
Externí odkaz: |