Zobrazeno 1 - 10
of 434
pro vyhledávání: '"Wang, Yuejiao"'
Autor:
Kang, Jiawen, Meng, Lingwei, Cui, Mingyu, Wang, Yuejiao, Wu, Xixin, Liu, Xunying, Meng, Helen
Multi-talker speech recognition (MTASR) faces unique challenges in disentangling and transcribing overlapping speech. To address these challenges, this paper investigates the role of Connectionist Temporal Classification (CTC) in speaker disentanglem
Externí odkaz:
http://arxiv.org/abs/2409.12388
Autor:
Meng, Lingwei, Hu, Shujie, Kang, Jiawen, Li, Zhaoqing, Wang, Yuejiao, Wu, Wenxuan, Wu, Xixin, Liu, Xunying, Meng, Helen
Recent advancements in large language models (LLMs) have revolutionized various domains, bringing significant progress and new opportunities. Despite progress in speech-related tasks, LLMs have not been sufficiently explored in multi-talker scenarios
Externí odkaz:
http://arxiv.org/abs/2409.08596
Functional magnetic resonance imaging (fMRI) is essential for developing encoding models that identify functional changes in language-related brain areas of individuals with Neurocognitive Disorders (NCD). While large language model (LLM)-based fMRI
Externí odkaz:
http://arxiv.org/abs/2407.10376
Autor:
Meng, Lingwei, Kang, Jiawen, Wang, Yuejiao, Jin, Zengrui, Wu, Xixin, Liu, Xunying, Meng, Helen
Multi-talker speech recognition and target-talker speech recognition, both involve transcription in multi-talker contexts, remain significant challenges. However, existing methods rarely attempt to simultaneously address both tasks. In this study, we
Externí odkaz:
http://arxiv.org/abs/2407.09817
Autor:
Chen, Xueyuan, Wang, Yuejiao, Wu, Xixin, Wang, Disong, Wu, Zhiyong, Liu, Xunying, Meng, Helen
Dysarthric speech reconstruction (DSR) aims to transform dysarthric speech into normal speech by improving the intelligibility and naturalness. This is a challenging task especially for patients with severe dysarthria and speaking in complex, noisy a
Externí odkaz:
http://arxiv.org/abs/2401.17796
Dysarthric speech reconstruction (DSR) systems aim to automatically convert dysarthric speech into normal-sounding speech. The technology eases communication with speakers affected by the neuromotor disorder and enhances their social inclusion. NED-b
Externí odkaz:
http://arxiv.org/abs/2401.14664
Although automatic speech recognition (ASR) can perform well in common non-overlapping environments, sustaining performance in multi-talker overlapping speech recognition remains challenging. Recent research revealed that ASR model's encoder captures
Externí odkaz:
http://arxiv.org/abs/2302.09908
Autor:
Song, Xinlun, Cui, Junshuo, Lou, Zhenning, Shan, Weijun, Yu, Haibiao, Feng, Xiaogeng, Wang, Yuejiao, Xiong, Ying
Publikováno v:
In Journal of Alloys and Compounds 15 August 2024 995
Autor:
Wang, Yuejiao1 (AUTHOR) wangyuejiao88@hnfnu.edu.cn, Cai, Chenguang2 (AUTHOR) caichenguang@hufe.edu.cn
Publikováno v:
Axioms (2075-1680). May2024, Vol. 13 Issue 5, p319. 16p.
Publikováno v:
In Separation and Purification Technology 3 December 2024 349