Zobrazeno 1 - 10
of 31
pro vyhledávání: '"Mattheyses Wesley"'
Publikováno v:
EURASIP Journal on Audio, Speech, and Music Processing, Vol 2009, Iss 1, p 169819 (2009)
Audiovisual text-to-speech systems convert a written text into an audiovisual speech signal. Typically, the visual mode of the synthetic speech is synthesized separately from the audio, the latter being either natural or synthesized speech. However,
Externí odkaz:
https://doaj.org/article/b01d2ca3fc8e43c68c6899da72701e28
Autor:
Mattheyses, Wesley, Verhelst, Werner
Publikováno v:
In Speech Communication February 2015 66:182-217
Publikováno v:
In Speech Communication September 2013 55(7-8):857-876
Even though speech synthesis is the most frequently needed language technology for people with communicative disabilities (e.g. see [1]), the number of commercially-available synthetic voices is still rather small, especially for medium-sized languag
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::69399b0f061b99c95aedb739ed8ab261
https://biblio.vub.ac.be/vubir/highquality-flemish-texttospeech-synthesis(288dbe02-e8c5-4f8a-8d7c-36a5f35a6034).html
https://biblio.vub.ac.be/vubir/highquality-flemish-texttospeech-synthesis(288dbe02-e8c5-4f8a-8d7c-36a5f35a6034).html
Both auditory and audiovisual speech synthesis have been the subject of many research projects throughout the years. Unfortunately, in recent years only very few research focuses on synthesis for the Dutch language. Especially for audiovisual synthes
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::93fd9d15de6b05a67670ddf16c0e0d3e
https://biblio.vub.ac.be/vubir/auditory-and-photorealistic-audiovisual-speech-synthesis-for-dutch(a7ba6270-5c45-4980-bb35-3714c634a1ae).html
https://biblio.vub.ac.be/vubir/auditory-and-photorealistic-audiovisual-speech-synthesis-for-dutch(a7ba6270-5c45-4980-bb35-3714c634a1ae).html
Publikováno v:
Interspeech 2011.
One of the key challenges of optimizing a unit selection voice is obtaining suitable target and join cost weights. In this paper we investigate several strategies to train these weights automatically. Two training algorithms are tested, which are bas
Active appearance models can represent image information in terms of shape and texture parameters. This paper explains why this makes them highly suitable for data-based 2D audiovisual text-to-speech synthesis. We elaborate on how the differentiation
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::545762575498904824c507765dfa762e
https://hdl.handle.net/20.500.14017/0193c1ed-dfe3-400b-b900-71c6bbe434fc
https://hdl.handle.net/20.500.14017/0193c1ed-dfe3-400b-b900-71c6bbe434fc
In this paper we describe the voices we submitted to the 2010 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. One of the goals of a data-driven synthesizer, such as ours, is to generalize the speech databa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::89589ba9e5313bd97de548a9e129a15c
https://biblio.vub.ac.be/vubir/the-vub-blizzard-challenge-2010-entry-towards-automatic-voice-building(607ff2bc-921d-4317-bbfb-634460ba0770).html
https://biblio.vub.ac.be/vubir/the-vub-blizzard-challenge-2010-entry-towards-automatic-voice-building(607ff2bc-921d-4317-bbfb-634460ba0770).html
This paper proposes a 2D audiovisual text-to-speech synthesis system that constructs the output signal by selecting and concatenating multimodal segments containing natural combinations of audio and video. We describe the experiments that were conduc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::25228f981d0297fb573e2e06d891bd5f
https://biblio.vub.ac.be/vubir/multimodal-coherency-issues-in-designing-and-optimizing-audiovisual-speech-synthesis-techniques(625a30ea-4151-473f-8b6d-cf3694380cdc).html
https://biblio.vub.ac.be/vubir/multimodal-coherency-issues-in-designing-and-optimizing-audiovisual-speech-synthesis-techniques(625a30ea-4151-473f-8b6d-cf3694380cdc).html
In this paper we describe the voices we submitted to the 2009 Blizzard Challenge, a yearly challenge to evaluate auditory speech synthesis on common data. Since it is the second time we participate in this challenge, in this paper we focus on the cha
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______3848::16191554e68a25adc5a21e00081c38a8
https://biblio.vub.ac.be/vubir/the-vub-blizzard-challenge-2009-entry(91bf4fe9-0619-4284-ac47-fb7d9b7ef99b).html
https://biblio.vub.ac.be/vubir/the-vub-blizzard-challenge-2009-entry(91bf4fe9-0619-4284-ac47-fb7d9b7ef99b).html