Zobrazeno 1 - 10
of 27
pro vyhledávání: '"NIU Shutong"'
Publikováno v:
Fushe yanjiu yu fushe gongyi xuebao, Vol 42, Iss 2, Pp 020203-020203 (2024)
Borosilicate glass is currently the best comprehensive solidified material for deep disposal of high-level radioactive waste. The stability of borosilicate glass under irradiation is an important factor affecting the leakage of radioactive isotopes i
Externí odkaz:
https://doaj.org/article/e70d0591a89a4d058492651bf1f04b35
In multimodal sentiment analysis, collecting text data is often more challenging than video or audio due to higher annotation costs and inconsistent automatic speech recognition (ASR) quality. To address this challenge, our study has developed a robu
Externí odkaz:
http://arxiv.org/abs/2410.15029
In the two-person conversation scenario with one wearing smart glasses, transcribing and displaying the speaker's content in real-time is an intriguing application, providing a priori information for subsequent tasks such as translation and comprehen
Externí odkaz:
http://arxiv.org/abs/2410.05986
Although fully end-to-end speaker diarization systems have made significant progress in recent years, modular systems often achieve superior results in real-world scenarios due to their greater adaptability and robustness. Historically, modular speak
Externí odkaz:
http://arxiv.org/abs/2409.16803
Autor:
Niu, Shutong, Wang, Ruoyu, Du, Jun, Yang, Gaobin, Tu, Yanhui, Wu, Siyuan, Qian, Shuangqing, Wu, Huaxin, Xu, Haitao, Zhang, Xueyang, Zhong, Guolong, Yu, Xindi, Chen, Jieru, Wang, Mengzhi, Cai, Di, Gao, Tian, Wan, Genshun, Ma, Feng, Pan, Jia, Gao, Jianqing
This technical report outlines our submission system for the CHiME-8 NOTSOFAR-1 Challenge. The primary difficulty of this challenge is the dataset recorded across various conference rooms, which captures real-world complexities such as high overlap r
Externí odkaz:
http://arxiv.org/abs/2409.02041
Autor:
Yang, Gaobin, He, Maokui, Niu, Shutong, Wang, Ruoyu, Yue, Yanyan, Qian, Shuangqing, Wu, Shilong, Du, Jun, Lee, Chin-Hui
We propose a novel neural speaker diarization system using memory-aware multi-speaker embedding with sequence-to-sequence architecture (NSD-MS2S), which integrates the strengths of memory-aware multi-speaker embedding (MA-MSE) and sequence-to-sequenc
Externí odkaz:
http://arxiv.org/abs/2309.09180
Autor:
Wang, Ruoyu, He, Maokui, Du, Jun, Zhou, Hengshun, Niu, Shutong, Chen, Hang, Yue, Yanyan, Yang, Gaobin, Wu, Shilong, Sun, Lei, Tu, Yanhui, Tang, Haitao, Qian, Shuangqing, Gao, Tian, Wang, Mengzhi, Wan, Genshun, Pan, Jia, Gao, Jianqing, Lee, Chin-Hui
This technical report details our submission system to the CHiME-7 DASR Challenge, which focuses on speaker diarization and speech recognition under complex multi-speaker scenarios. Additionally, it also evaluates the efficiency of systems in handlin
Externí odkaz:
http://arxiv.org/abs/2308.14638
Most neural speaker diarization systems rely on sufficient manual training data labels, which are hard to collect under real-world scenarios. This paper proposes a semi-supervised speaker diarization system to utilize large-scale multi-channel traini
Externí odkaz:
http://arxiv.org/abs/2307.08688
Autor:
He, Maokui, Lv, Xiang, Zhou, Weilin, Yin, JingJing, Zhang, Xiaoqi, Wang, Yuxuan, Niu, Shutong, Cao, Yuhang, Lu, Heng, Du, Jun, Lee, Chin-Hui
We propose two improvements to target-speaker voice activity detection (TS-VAD), the core component in our proposed speaker diarization system that was submitted to the 2022 Multi-Channel Multi-Party Meeting Transcription (M2MeT) challenge. These tec
Externí odkaz:
http://arxiv.org/abs/2202.04855
Autor:
Wang, Yuxuan, He, Maokui, Niu, Shutong, Sun, Lei, Gao, Tian, Fang, Xin, Pan, Jia, Du, Jun, Lee, Chin-Hui
This system description describes our submission system to the Third DIHARD Speech Diarization Challenge. Besides the traditional clustering based system, the innovation of our system lies in the combination of various front-end techniques to solve t
Externí odkaz:
http://arxiv.org/abs/2103.10661