Zobrazeno 1 - 10
of 10
pro vyhledávání: '"Ju, Zeqian"'
Autor:
Ye, Zhen, Ju, Zeqian, Liu, Haohe, Tan, Xu, Chen, Jianyi, Lu, Yiwen, Sun, Peiwen, Pan, Jiahao, Bian, Weizhen, He, Shulin, Liu, Qifeng, Guo, Yike, Xue, Wei
Recent progress in large-scale zero-shot speech synthesis has been significantly advanced by language models and diffusion models. However, the generation process of both methods is slow and computationally intensive. Efficient speech synthesis using
Externí odkaz:
http://arxiv.org/abs/2404.14700
Autor:
Xin, Detai, Tan, Xu, Shen, Kai, Ju, Zeqian, Yang, Dongchao, Wang, Yuancheng, Takamichi, Shinnosuke, Saruwatari, Hiroshi, Liu, Shujie, Li, Jinyu, Zhao, Sheng
We present RALL-E, a robust language modeling method for text-to-speech (TTS) synthesis. While previous work based on large language models (LLMs) shows impressive performance on zero-shot TTS, such methods often suffer from poor robustness, such as
Externí odkaz:
http://arxiv.org/abs/2404.03204
Autor:
Ju, Zeqian, Wang, Yuancheng, Shen, Kai, Tan, Xu, Xin, Detai, Yang, Dongchao, Liu, Yanqing, Leng, Yichong, Song, Kaitao, Tang, Siliang, Wu, Zhizheng, Qin, Tao, Li, Xiang-Yang, Ye, Wei, Zhang, Shikun, Bian, Jiang, He, Lei, Li, Jinyu, Zhao, Sheng
While recent large-scale text-to-speech (TTS) models have achieved significant progress, they still fall short in speech quality, similarity, and prosody. Considering speech intricately encompasses various attributes (e.g., content, prosody, timbre,
Externí odkaz:
http://arxiv.org/abs/2403.03100
Autor:
Leng, Yichong, Guo, Zhifang, Shen, Kai, Tan, Xu, Ju, Zeqian, Liu, Yanqing, Liu, Yufei, Yang, Dongchao, Zhang, Leying, Song, Kaitao, He, Lei, Li, Xiang-Yang, Zhao, Sheng, Qin, Tao, Bian, Jiang
Speech conveys more information than text, as the same word can be uttered in various voices to convey diverse information. Compared to traditional text-to-speech (TTS) methods relying on speech prompts (reference speech) for voice variability, using
Externí odkaz:
http://arxiv.org/abs/2309.02285
Autor:
Shen, Kai, Ju, Zeqian, Tan, Xu, Liu, Yanqing, Leng, Yichong, He, Lei, Qin, Tao, Zhao, Sheng, Bian, Jiang
Scaling text-to-speech (TTS) to large-scale, multi-speaker, and in-the-wild datasets is important to capture the diversity in human speech such as speaker identities, prosodies, and styles (e.g., singing). Current large TTS systems usually quantize s
Externí odkaz:
http://arxiv.org/abs/2304.09116
Audio editing is applicable for various purposes, such as adding background sound effects, replacing a musical instrument, and repairing damaged audio. Recently, some diffusion-based methods achieved zero-shot audio editing by using a diffusion and d
Externí odkaz:
http://arxiv.org/abs/2304.00830
Autor:
Ju, Zeqian, Lu, Peiling, Tan, Xu, Wang, Rui, Zhang, Chen, Wu, Songruoyao, Zhang, Kejun, Li, Xiangyang, Qin, Tao, Liu, Tie-Yan
Lyric-to-melody generation is an important task in automatic songwriting. Previous lyric-to-melody generation systems usually adopt end-to-end models that directly generate melodies from lyrics, which suffer from several issues: 1) lack of paired lyr
Externí odkaz:
http://arxiv.org/abs/2109.09617
Symbolic music understanding, which refers to the understanding of music from the symbolic data (e.g., MIDI format, but not audio), covers many music applications such as genre classification, emotion classification, and music pieces matching. While
Externí odkaz:
http://arxiv.org/abs/2106.05630
Autor:
Yang, Wenmian, Zeng, Guangtao, Tan, Bowen, Ju, Zeqian, Chakravorty, Subrato, He, Xuehai, Chen, Shu, Yang, Xingyi, Wu, Qingyang, Yu, Zhou, Xing, Eric, Xie, Pengtao
Under the pandemic of COVID-19, people experiencing COVID19-related symptoms or exposed to risk factors have a pressing need to consult doctors. Due to hospital closure, a lot of consulting services have been moved online. Because of the shortage of
Externí odkaz:
http://arxiv.org/abs/2005.05442
Autor:
He, Xuehai, Chen, Shu, Ju, Zeqian, Dong, Xiangyu, Fang, Hongchao, Wang, Sicheng, Yang, Yue, Zeng, Jiaqi, Zhang, Ruisi, Zhang, Ruoyu, Zhou, Meng, Zhu, Penghui, Xie, Pengtao
Medical dialogue systems are promising in assisting in telemedicine to increase access to healthcare services, improve the quality of patient care, and reduce medical costs. To facilitate the research and development of medical dialogue systems, we b
Externí odkaz:
http://arxiv.org/abs/2004.03329