Adaptive Duration Modeling for Speaker Adaptation of Speaking Rate
Autor: | Ping-Cheng Lin, 林秉正 |
---|---|
Rok vydání: | 2002 |
Druh dokumentu: | 學位論文 ; thesis |
Popis: | 90 Speaking rate is one of the mismatches between training and testing environments. Even though the same user speaks the same utterance, the speech signal especially speaking rate changes because of the emotion or other factors. Most speech recognition performance is degraded when speaking rate is faster or slower than normal condition. Speaker adaptation is an important technique which improves the speech recognition performance. MAP adaptation combines prior probability and few adaptation data to adapt model parameters. Duration model is feasible to describe the property of speaking rate. The recognition estimates both the HMM and duration model parameters during training. In adaptation phase, we apply MAP theory to adapt HMM and duration model parameters together. This paper presents a new method to adapt model duration parameters. The MAP adaptation technique here is aimed at dealing with the problem of changing speaking rate. From the experiments, the recognition performance is significantly improved by adapting the duration model parameters. The adapted models are more robust when recognizing the utterance in fast speaking rate. |
Databáze: | Networked Digital Library of Theses & Dissertations |
Externí odkaz: |