Maximum-likelihood stochastic-transformation adaptation of hidden Markov models
Autor: | Vassilios Diakoloukas, Vassilios Digalakis |
---|---|
Rok vydání: | 1999 |
Předmět: |
Analysis of covariance
Vocabulary Acoustics and Ultrasonics business.industry Computer science Speech recognition media_common.quotation_subject Pattern recognition Speaker recognition Speech processing Linear map symbols.namesake ComputingMethodologies_PATTERNRECOGNITION Robustness (computer science) symbols Computer Vision and Pattern Recognition Artificial intelligence Electrical and Electronic Engineering business Hidden Markov model Gaussian process Software media_common |
Zdroj: | IEEE Transactions on Speech and Audio Processing. 7:177-187 |
ISSN: | 1063-6676 |
DOI: | 10.1109/89.748122 |
Popis: | The recognition accuracy in previous large vocabulary automatic speech recognition (ASR) systems is highly related to the existing mismatch between the training and testing sets. For example, dialect differences across the training and testing speakers result in a significant degradation in recognition performance. Some popular adaptation approaches improve the recognition performance of speech recognizers based on hidden Markov models with continuous mixture densities by using linear transformations to adapt the means, and possibly the covariances of the mixture Gaussians. The linear assumption, however, is too restrictive, and in this paper we propose a novel adaptation technique that adapts the means and, optionally, the covariances of the mixture Gaussians by using multiple stochastic transformations. We perform both speaker and dialect adaptation experiments, and we show that our method significantly improves the recognition accuracy and the robustness of our system. The experiments are carried out with SRI's DECIPHER speech recognition system. |
Databáze: | OpenAIRE |
Externí odkaz: |