Integrated Expression Prediction and Speech Synthesis From Text

Autor:	Norbert Braunschweiler, Masami Akamine, Kate Knill, Mark J. F. Gales, Langzhou Chen
Rok vydání:	2014
Předmět:	Scheme (programming language) Artificial neural network Computer science business.industry Discrete space Speech recognition Speech synthesis Space (commercial competition) computer.software_genre Expression (mathematics) Signal Processing Artificial intelligence Electrical and Electronic Engineering Hidden Markov model business Representation (mathematics) computer Natural language processing computer.programming_language
Zdroj:	IEEE Journal of Selected Topics in Signal Processing. 8:323-335
ISSN:	1941-0484 1932-4553
DOI:	10.1109/jstsp.2013.2294938
Popis:	Generating expressive, naturally sounding, speech from text using a speech synthesis (TTS) system is a highly challenging problem. However for tasks such as audiobooks it is essential if their use is to become widespread. Generating expressive speech from text can be divided into two parts: predicting expressive information from text; and synthesizing the speech with a particular expression. Traditionally these components have been studied separately. This paper proposes an integrated approach, where the training data and representation of expressive synthesis is shared across the two components. There are several advantages to this scheme including: robust handling of automatically generated expressive labels; support for a continuous representation of expressions; and joint training of the expression predictor and speech synthesizer. Synthesis experiments indicated that the proposed approach produced far more expressive speech than both a neutral TTS and one where the expression was randomly selected. The experimental results also show the advantage of a continuous expressive synthesis space over a discrete space.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::1f531dc7ffc5044495337d94ea39fb3f https://doi.org/10.1109/jstsp.2013.2294938 Zobrazit plný text záznamu