A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model

Autor: Yang Xiang, Liming Shi, Jesper Lisby Højvang, Morten Højfeldt Rasmussen, Mads Grasboll Christensen
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Xiang, Y, Shi, L, Lisby Højvang, J, Højfeldt Rasmussen, M & Christensen, M G 2021, A novel NMF-HMM speech enhancement algorithm based on Poisson mixture model . in ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) ., 9414620, IEEE, I E E E International Conference on Acoustics, Speech and Signal Processing. Proceedings, pp. 721-725, ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), Toronto, Ontario, Canada, 06/06/2021 . https://doi.org/10.1109/ICASSP39728.2021.9414620
ICASSP
DOI: 10.1109/ICASSP39728.2021.9414620
Popis: In this paper, we propose a novel non-negative matrix factorization (NMF) and hidden Markov model (NMF-HMM) based speech enhancement algorithm, which employs a Poisson mixture model (PMM). {Compared to} the previously proposed NMF-HMM method, the new algorithm, termed PMM-NMF-HMM, {uses} the Poisson mixture distribution for the state conditional likelihood function for a HMM rather than the single Poisson distribution. {This means that there are the more basis matrices that can be used to model the speech and noise signals, so more signal information can be captured by the resulting model. The proposed method is supervised and thus includes a training and an enhancement stage. It is shown that, in the training stage, the proposed method can be implemented efficiently using multiplicative update (MU) for the model parameters, much like the NMF-HMM algorithm. In the speech enhancement stage, which can be performed online, a novel PMM-NMF-HMM minimum mean-square error (MMSE) estimator is developed. The experimental results indicate that the PMM-NMF-HMM method can obtain higher short-time objective intelligibility (STOI) and perceptual evaluation of speech quality (PESQ) score than NMF-HMM. Additionally, the {method also outperforms other state-of-the-art NMF-based supervised speech enhancement algorithms.
Databáze: OpenAIRE