Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Hanbin Bae"'
Publikováno v:
IEEE Access, Vol 8, Pp 161713-161719 (2020)
In this paper, we propose an effective technique to transplant a source speaker’s emotional expression to a new target speaker’s voice within an end-to-end text-to-speech (TTS) framework. We modify an expressive TTS model pre-trained using a sour
Externí odkaz:
https://doaj.org/article/0c976761d225494ca487e81f335503d3
Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models. Prosodic speech can be generated by conditioning acoustic features. However, synthesized speech with a large pitch-shift sc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f235d5e28a6bc189bd93f846078e07c6
Recently, end-to-end Korean singing voice systems have been designed to generate realistic singing voices. However, these systems still suffer from a lack of robustness in terms of pronunciation accuracy. In this paper, we propose N-Singer, a non-aut
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d37f7097974d47e4ee87f93f7c424797
Publikováno v:
ICASSP
Recently, it has become easier to obtain speech data from various media such as the internet or YouTube, but directly utilizing them to train a neural text-to-speech (TTS) model is difficult. The proportion of clean speech is insufficient and the rem
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::347866965a86a69d5eca7659604548ba
Publikováno v:
INTERSPEECH
This paper proposes a controllable end-to-end text-to-speech (TTS) system to control the speaking speed (speed-controllable TTS; SCTTS) of synthesized speech with sentence-level speaking-rate value as an additional input. The speaking-rate value, the
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::147f9868d545a86b18708db1d00012c8