Zobrazeno 1 - 10
of 24
pro vyhledávání: '"Liang, Chengdong"'
The performance of speaker verification degrades significantly in adverse acoustic environments with strong reverberation and noise. To address this issue, this paper proposes a spatial-temporal graph convolutional network (GCN) method for the multi-
Externí odkaz:
http://arxiv.org/abs/2307.01386
Autor:
Wang, Shuai, Liang, Chengdong, Xiang, Xu, Han, Bing, Chen, Zhengyang, Wang, Hongji, Ding, Wen
This report showcases the results achieved using the wespeaker toolkit for the VoxSRC2023 Challenge. Our aim is to provide participants, especially those with limited experience, with clear and straightforward guidelines to develop their initial syst
Externí odkaz:
http://arxiv.org/abs/2306.15161
Autor:
Liang, Chengdong, Zhang, Xiao-Lei, Zhang, BinBin, Wu, Di, Li, Shengqiang, Song, Xingchen, Peng, Zhendong, Pan, Fuping
Recently, the unified streaming and non-streaming two-pass (U2/U2++) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy and latency. In this paper, we present fast-U2++, an enhanced version
Externí odkaz:
http://arxiv.org/abs/2211.00941
Autor:
Wang, Hongji, Liang, Chengdong, Wang, Shuai, Chen, Zhengyang, Zhang, Binbin, Xiang, Xu, Deng, Yanlei, Qian, Yanmin
Speaker modeling is essential for many related tasks, such as speaker recognition and speaker diarization. The dominant modeling approach is fixed-dimensional vector representation, i.e., speaker embedding. This paper introduces a research and produc
Externí odkaz:
http://arxiv.org/abs/2210.17016
Autor:
Liu, Shupei, Feng, Linfeng, Gong, Yijun, Liang, Chengdong, Zhang, Chen, Zhang, Xiao-Lei, Li, Xuelong
While deep-learning-based speaker localization has shown advantages in challenging acoustic environments, it often yields only direction-of-arrival (DOA) cues rather than precise two-dimensional (2D) coordinates. To address this, we propose a novel d
Externí odkaz:
http://arxiv.org/abs/2210.10265
Autor:
Zhang, Hengtong, Gao, Yike, Zhu, Gaolong, Tan, Tiening, Liang, Chengdong, Hao, Shuai, Zhao, Chang, Chen, Wei, Ren, Dongsheng
Publikováno v:
In Chemical Engineering Journal 1 November 2024 499
Speaker verification based on ad-hoc microphone arrays has the potential of reducing the error significantly in adverse acoustic environments. However, existing approaches extract utterance-level speaker embeddings from each channel of an ad-hoc micr
Externí odkaz:
http://arxiv.org/abs/2110.05975
Autor:
Wang, Shuai, Chen, Zhengyang, Han, Bing, Wang, Hongji, Liang, Chengdong, Zhang, Binbin, Xiang, Xu, Ding, Wen, Rohdin, Johan, Silnova, Anna, Qian, Yanmin, Li, Haizhou
Publikováno v:
In Speech Communication July 2024 162
Deep neural networks provide effective solutions to small-footprint keyword spotting (KWS). However, if training data is limited, it remains challenging to achieve robust and highly accurate KWS in real-world scenarios where unseen sounds that are ou
Externí odkaz:
http://arxiv.org/abs/2107.05859
Recently, ad-hoc microphone array has been widely studied. Unlike traditional microphone array settings, the spatial arrangement and number of microphones of ad-hoc microphone arrays are not known in advance, which hinders the adaptation of tradition
Externí odkaz:
http://arxiv.org/abs/2107.00178