Výsledky vyhledávání

Akademický článek

Effective Emotion Transplantation in an End-to-End Text-to-Speech System

Autor: Young-Sun Joo, Hanbin Bae, Young-Ik Kim, Hoon-Young Cho, Hong-Goo Kang

Publikováno v: IEEE Access, Vol 8, Pp 161713-161719 (2020)

In this paper, we propose an effective technique to transplant a source speaker’s emotional expression to a new target speaker’s voice within an end-to-end text-to-speech (TTS) framework. We modify an expressive TTS model pre-trained using a sour

Externí odkaz: https://doaj.org/article/0c976761d225494ca487e81f335503d3

Zobrazit plný text záznamu

FastPitchFormant: Source-filter based Decomposed Modeling for Speech Synthesis

Autor: Jae-Sung Bae, Taejun Bak, Hoon-Young Cho, Hanbin Bae, Young-Ik Kim

Methods for modeling and controlling prosody with acoustic features have been proposed for neural text-to-speech (TTS) models. Prosodic speech can be generated by conditioning acoustic features. However, synthesized speech with a large pitch-shift sc

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::f235d5e28a6bc189bd93f846078e07c6

Zobrazit plný text záznamu

N-Singer: A Non-Autoregressive Korean Singing Voice Synthesis System for Pronunciation Enhancement

Autor: Hanbin Bae, Gyeong-Hoon Lee, Tae-Woo Kim, Min-Ji Lee, Hoon-Young Cho, Young-Ik Kim

Recently, end-to-end Korean singing voice systems have been designed to generate realistic singing voices. However, these systems still suffer from a lack of robustness in terms of pronunciation accuracy. In this paper, we propose N-Singer, a non-aut

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d37f7097974d47e4ee87f93f7c424797

Zobrazit plný text záznamu

A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music

Autor: Jae-Sung Bae, Young-Ik Kim, Young-Sun Joo, Hoon-Young Cho, Hanbin Bae

Publikováno v: ICASSP

Recently, it has become easier to obtain speech data from various media such as the internet or YouTube, but directly utilizing them to train a neural text-to-speech (TTS) model is difficult. The proportion of clean speech is insufficient and the rem

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::347866965a86a69d5eca7659604548ba

Zobrazit plný text záznamu

Speaking Speed Control of End-to-End Speech Synthesis using Sentence-Level Conditioning

Autor: Hanbin Bae, Gyeong-Hoon Lee, Jae-Sung Bae, Junmo Lee, Hoon-Young Cho, Young-Sun Joo

Publikováno v: INTERSPEECH

This paper proposes a controllable end-to-end text-to-speech (TTS) system to control the speaking speed (speed-controllable TTS; SCTTS) of synthesized speech with sentence-level speaking-rate value as an additional input. The speaking-rate value, the

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::147f9868d545a86b18708db1d00012c8

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání