Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis
Autor: | Michael Pucher, Dietmar Schabus, Junichi Yamagishi, Friedrich Neubarth, Volker Strom |
---|---|
Rok vydání: | 2010 |
Předmět: |
Linguistics and Language
Computer science Speech recognition Speech synthesis 02 engineering and technology computer.software_genre Language and Linguistics German Modelling and Simulation 0202 electrical engineering electronic engineering information engineering Hidden Markov model Cluster analysis Categorical variable business.industry Communication 020206 networking & telecommunications Phonology language.human_language Computer Science Applications Acoustic space Phonological rule Modeling and Simulation language 020201 artificial intelligence & image processing Computer Vision and Pattern Recognition Artificial intelligence business computer Software Natural language processing |
Zdroj: | Pucher, M, Schabus, D, Yamagishi, J, Neubarth, F & Strom, V 2010, ' Modeling and interpolation of Austrian German and Viennese dialect in HMM-based speech synthesis ', Speech Communication, vol. 52, no. 2, pp. 164-179 . https://doi.org/10.1016/j.specom.2009.09.004 |
ISSN: | 0167-6393 |
DOI: | 10.1016/j.specom.2009.09.004 |
Popis: | An HMM-based speech synthesis framework is applied to both standard Austrian German and a Viennese dialectal variety and several training strategies for multi-dialect modeling such as dialect clustering and dialect-adaptive training are investigated. For bridging the gap between processing on the level of HMMs and on the linguistic level, we add phonological transformations to the HMM interpolation and apply them to dialect interpolation. The crucial steps are to employ several formalized phonological rules between Austrian German and Viennese dialect as constraints for the HMM interpolation. We verify the effectiveness of this strategy in a number of perceptual evaluations. Since the HMM space used is not articulatory but acoustic space, there are some variations in evaluation results between the phonological rules. However, in general we obtained good evaluation results which show that listeners call perceive both continuous and categorical changes of dialect varieties by using phonological transformations employed as switching rules in the HMM interpolation. (C) 2009 Elsevier B.V. All rights reserved. |
Databáze: | OpenAIRE |
Externí odkaz: |