Výsledky vyhledávání - "Du, Zongyang"

Report

Towards Naturalistic Voice Conversion: NaturalVoices Dataset with an Automatic Processing Pipeline

Autor: Salman, Ali N., Du, Zongyang, Chandra, Shreeram Suresh, Ulgen, Ismail Rasim, Busso, Carlos, Sisman, Berrak

Voice conversion (VC) research traditionally depends on scripted or acted speech, which lacks the natural spontaneity of real-life conversations. While natural speech data is limited for VC, our study focuses on filling in this gap. We introduce a no

Externí odkaz: http://arxiv.org/abs/2406.04494

Zobrazit plný text záznamu

Report

Exploring speech style spaces with language models: Emotional TTS without emotion labels

Autor: Chandra, Shreeram Suresh, Du, Zongyang, Sisman, Berrak

Many frameworks for emotional text-to-speech (E-TTS) rely on human-annotated emotion labels that are often inaccurate and difficult to obtain. Learning emotional prosody implicitly presents a tough challenge due to the subjective nature of emotions.

Externí odkaz: http://arxiv.org/abs/2405.11413

Zobrazit plný text záznamu

Report

Converting Anyone's Voice: End-to-End Expressive Voice Conversion with a Conditional Diffusion Model

Autor: Du, Zongyang, Lu, Junchen, Zhou, Kun, Kaushik, Lakshmish, Sisman, Berrak

Expressive voice conversion (VC) conducts speaker identity conversion for emotional speakers by jointly converting speaker identity and emotional style. Emotional style modeling for arbitrary speakers in expressive VC has not been extensively explore

Externí odkaz: http://arxiv.org/abs/2405.01730

Zobrazit plný text záznamu

Report

Revealing Emotional Clusters in Speaker Embeddings: A Contrastive Learning Strategy for Speech Emotion Recognition

Autor: Ulgen, Ismail Rasim, Du, Zongyang, Busso, Carlos, Sisman, Berrak

Speaker embeddings carry valuable emotion-related information, which makes them a promising resource for enhancing speech emotion recognition (SER), especially with limited labeled data. Traditionally, it has been assumed that emotion information is

Externí odkaz: http://arxiv.org/abs/2401.11017

Zobrazit plný text záznamu

Report

Disentanglement of Emotional Style and Speaker Identity for Expressive Voice Conversion

Autor: Du, Zongyang, Sisman, Berrak, Zhou, Kun, Li, Haizhou

Expressive voice conversion performs identity conversion for emotional speakers by jointly converting speaker identity and emotional style. Due to the hierarchical structure of speech emotion, it is challenging to disentangle the emotional style for

Externí odkaz: http://arxiv.org/abs/2110.10326

Zobrazit plný text záznamu

Report

Expressive Voice Conversion: A Joint Framework for Speaker Identity and Emotional Style Transfer

Autor: Du, Zongyang, Sisman, Berrak, Zhou, Kun, Li, Haizhou

Traditional voice conversion(VC) has been focused on speaker identity conversion for speech with a neutral expression. We note that emotional expression plays an essential role in daily communication, and the emotional style of speech can be speaker-

Externí odkaz: http://arxiv.org/abs/2107.03748

Zobrazit plný text záznamu

Report

Spectrum and Prosody Conversion for Cross-lingual Voice Conversion with CycleGAN

Autor: Du, Zongyang, Zhou, Kun, Sisman, Berrak, Li, Haizhou

Cross-lingual voice conversion aims to change source speaker's voice to sound like that of target speaker, when source and target speakers speak different languages. It relies on non-parallel training data from two different languages, hence, is more

Externí odkaz: http://arxiv.org/abs/2008.04562

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání