Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Detai Xin"'
Publikováno v:
IEEE Access, Vol 12, Pp 19752-19764 (2024)
We present the JVNV, a Japanese emotional speech corpus with verbal content and nonverbal vocalizations whose scripts are generated by a large-scale language model. Existing emotional speech corpora lack not only proper emotional scripts but also non
Externí odkaz:
https://doaj.org/article/b121501764a048fc8ac3fdaf60d2cfd7
Pause insertion, also known as phrase break prediction and phrasing, is an essential part of TTS systems because proper pauses with natural duration significantly enhance the rhythm and intelligibility of synthetic speech. However, conventional phras
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2cfcd44e1a66851b87c8700e9d7c7213
In this paper, we propose a method for intermediating multiple speakers' attributes and diversifying their voice characteristics in ``speaker generation,'' an emerging task that aims to synthesize a nonexistent speaker's naturally sounding voice. The
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::eec0972e1974421c176eed162e5fa2a6
Publikováno v:
Interspeech 2021.
Publikováno v:
ICASSP
We propose a method for obtaining disentangled speaker and language representations via mutual information minimization and domain adaptation for cross-lingual text-to-speech (TTS) synthesis. The proposed method extracts speaker and language embeddin
Publikováno v:
INTERSPEECH