Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Yang, Gaobin"'
Although fully end-to-end speaker diarization systems have made significant progress in recent years, modular systems often achieve superior results in real-world scenarios due to their greater adaptability and robustness. Historically, modular speak
Externí odkaz:
http://arxiv.org/abs/2409.16803
Autor:
Niu, Shutong, Wang, Ruoyu, Du, Jun, Yang, Gaobin, Tu, Yanhui, Wu, Siyuan, Qian, Shuangqing, Wu, Huaxin, Xu, Haitao, Zhang, Xueyang, Zhong, Guolong, Yu, Xindi, Chen, Jieru, Wang, Mengzhi, Cai, Di, Gao, Tian, Wan, Genshun, Ma, Feng, Pan, Jia, Gao, Jianqing
This technical report outlines our submission system for the CHiME-8 NOTSOFAR-1 Challenge. The primary difficulty of this challenge is the dataset recorded across various conference rooms, which captures real-world complexities such as high overlap r
Externí odkaz:
http://arxiv.org/abs/2409.02041
Autor:
Yang, Gaobin, He, Maokui, Niu, Shutong, Wang, Ruoyu, Yue, Yanyan, Qian, Shuangqing, Wu, Shilong, Du, Jun, Lee, Chin-Hui
We propose a novel neural speaker diarization system using memory-aware multi-speaker embedding with sequence-to-sequence architecture (NSD-MS2S), which integrates the strengths of memory-aware multi-speaker embedding (MA-MSE) and sequence-to-sequenc
Externí odkaz:
http://arxiv.org/abs/2309.09180
Autor:
Wang, Ruoyu, He, Maokui, Du, Jun, Zhou, Hengshun, Niu, Shutong, Chen, Hang, Yue, Yanyan, Yang, Gaobin, Wu, Shilong, Sun, Lei, Tu, Yanhui, Tang, Haitao, Qian, Shuangqing, Gao, Tian, Wang, Mengzhi, Wan, Genshun, Pan, Jia, Gao, Jianqing, Lee, Chin-Hui
This technical report details our submission system to the CHiME-7 DASR Challenge, which focuses on speaker diarization and speech recognition under complex multi-speaker scenarios. Additionally, it also evaluates the efficiency of systems in handlin
Externí odkaz:
http://arxiv.org/abs/2308.14638