Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Jawaid, Ahad"'
Recent advances in style transfer text-to-speech (TTS) have improved the expressiveness of synthesized speech. However, encoding stylistic information (e.g., timbre, emotion, and prosody) from diverse and unseen reference speech remains a challenge.
Externí odkaz:
http://arxiv.org/abs/2406.03637