A Speech Parameter Generation Algorithm Considering Global Variance for HMM-Based Speech Synthesis
Autor: | Keiichi Tokuda, Tomoki Toda |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2007 |
Předmět: |
speech parameter generation
Audio signal Speech recognition maximum likelihood criterion HMM-based speech synthesis Speech synthesis Variance (accounting) computer.software_genre Reduction (complexity) Constraint (information theory) over-smoothing effect Naturalness Artificial Intelligence Hardware and Architecture Computer Science::Sound global variance Trajectory Computer Vision and Pattern Recognition Electrical and Electronic Engineering Hidden Markov model computer Algorithm Software Mathematics |
Zdroj: | IEICE Transactions on Information and Systems. (5):816-824 |
ISSN: | 0916-8532 |
Popis: | This paper describes a novel parameter generation algorithm for an HMM-based speech synthesis technique. The conventional algorithm generates a parameter trajectory of static features that maximizes the likelihood of a given HMM for the parameter sequence consisting of the static and dynamic features under an explicit constraint between those two features. The generated trajectory is often excessively smoothed due to the statistical processing. Using the over-smoothed speech parameters usually causes muffled sounds. In order to alleviate the over-smoothing effect, we propose a generation algorithm considering not only the HMM likelihood maximized in the conventional algorithm but also a likelihood for a global variance (GV) of the generated trajectory. The latter likelihood works as a penalty for the over-smoothing, i.e., a reduction of the GV of the generated trajectory. The result of a perceptual evaluation demonstrates that the proposed algorithm causes considerably large improvements in the naturalness of synthetic speech. |
Databáze: | OpenAIRE |
Externí odkaz: |