Autor: |
Badino, Leonardo, Clark, Robert A.J., Wester, Mirjam |
Rok vydání: |
2012 |
Zdroj: |
Badino, L, Clark, R A J & Wester, M 2012, Towards Hierarchical Prosodic Prominence Generation in TTS Synthesis . in INTERSPEECH 2012 13th Annual Conference of the International Speech Communication Association . pp. 2398-2401 . < http://www.isca-speech.org/archive/archive_papers/interspeech_2012/i12_2398.pdf > |
DOI: |
10.21437/interspeech.2012-628 |
Popis: |
We address the problem of identification (from text) and generation of pitch accents in HMM-based English TTS synthesis. We show, through a large scale perceptual test, that a large improvement of the binary discrimination between pitch accented and non-accented words has no effect on the quality of the speech generated by the system. On the other side adding a third accent type that emphatically marks words that convey ”contrastive” focus (automatically identified from text) produces beneficial effects on the synthesized speech. These results support the accounts on prosodic prominence that consider the prosodic patterns of utterances as hierarchical structured and point out the limits of a flattening of such structure resulting from a simple accent/non-accent distinction. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|