Zobrazeno 1 - 10
of 8 023
pro vyhledávání: '"VOICE CONVERSION"'
Autor:
Li, Yuke, Zhu, Xinfa, Li, Hanzhao, Yao, JiXun, Tian, WenJie, Yang, XiPeng, Chen, YunLin, Li, Zhifei, Xie, Lei
Zero-shot voice conversion (VC) aims to convert the original speaker's timbre to any target speaker while keeping the linguistic content. Current mainstream zero-shot voice conversion approaches depend on pre-trained recognition models to disentangle
Externí odkaz:
http://arxiv.org/abs/2411.18918
SKQVC: One-Shot Voice Conversion by K-Means Quantization with Self-Supervised Speech Representations
One-shot voice conversion (VC) is a method that enables the transformation between any two speakers using only a single target speaker utterance. Existing methods often rely on complex architectures and pre-trained speaker verification (SV) models to
Externí odkaz:
http://arxiv.org/abs/2411.16147
Autor:
Liu, Songting
Zero-shot voice conversion aims to transform a source speech utterance to match the timbre of a reference speech from an unseen speaker. Traditional approaches struggle with timbre leakage, insufficient timbre representation, and mismatches between t
Externí odkaz:
http://arxiv.org/abs/2411.09943
Autor:
Ghosh, Suhita, Jouaiti, Melanie, Das, Arnab, Sinha, Yamini, Polzehl, Tim, Siegert, Ingo, Stober, Sebastian
Speech anonymisation aims to protect speaker identity by changing personal identifiers in speech while retaining linguistic content. Current methods fail to retain prosody and unique speech patterns found in elderly and pathological speech domains, w
Externí odkaz:
http://arxiv.org/abs/2410.15500
Autor:
Hsu, Wen-Shin1,2 (AUTHOR) wshsu@csmu.edu.tw, Lin, Guang-Tao1 (AUTHOR) todlin89@gmail.com, Wang, Wei-Hsun3,4,5,6 (AUTHOR) cmch10011@gmail.com
Publikováno v:
Diagnostics (2075-4418). Dec2024, Vol. 14 Issue 23, p2693. 19p.
Zero-shot voice conversion (VC) aims to transfer the timbre from the source speaker to an arbitrary unseen speaker while preserving the original linguistic content. Despite recent advancements in zero-shot VC using language model-based or diffusion-b
Externí odkaz:
http://arxiv.org/abs/2412.04724
Autor:
He, Haorui, Song, Yuchen, Wang, Yuancheng, Li, Haoyang, Zhang, Xueyao, Wang, Li, Huang, Gongping, Chng, Eng Siong, Wu, Zhizheng
One-shot voice conversion (VC) aims to alter the timbre of speech from a source speaker to match that of a target speaker using just a single reference speech from the target, while preserving the semantic content of the original source speech. Despi
Externí odkaz:
http://arxiv.org/abs/2411.19770
Zero-shot voice conversion (VC) aims to transform the timbre of a source speaker into any previously unseen target speaker, while preserving the original linguistic content. Despite notable progress, attaining a degree of speaker similarity and natur
Externí odkaz:
http://arxiv.org/abs/2411.02026
Utterances by L2 speakers can be unintelligible due to mispronunciation and improper prosody. In computer-aided language learning systems, textual feedback is often provided using a speech recognition engine. However, an ideal form of feedback for L2
Externí odkaz:
http://arxiv.org/abs/2410.02239
Autor:
Yang, Yuguang, Pan, Yu, Yao, Jixun, Zhang, Xiang, Ye, Jianhao, Zhou, Hongbin, Xie, Lei, Ma, Lei, Zhao, Jianjun
Zero-shot voice conversion (VC) aims to transform the source speaker timbre into an arbitrary unseen one without altering the original speech content.While recent advancements in zero-shot VC methods have shown remarkable progress, there still remain
Externí odkaz:
http://arxiv.org/abs/2410.01350