Zobrazeno 1 - 10
of 664
pro vyhledávání: '"Yan, Xiaopeng"'
Autor:
Liu, Mingshuai, Chen, Zhuangqi, Yan, Xiaopeng, Lv, Yuanjun, Xia, Xianjun, Huang, Chuanzeng, Xiao, Yijian, Xie, Lei
In real-time speech communication systems, speech signals are often degraded by multiple distortions. Recently, a two-stage Repair-and-Denoising network (RaD-Net) was proposed with superior speech quality improvement in the ICASSP 2024 Speech Signal
Externí odkaz:
http://arxiv.org/abs/2406.07498
Autor:
Liu, Mingshuai, Chen, Zhuangqi, Yan, Xiaopeng, Lv, Yuanjun, Xia, Xianjun, Huang, Chuanzeng, Xiao, Yijian, Xie, Lei
This paper introduces our repairing and denoising network (RaD-Net) for the ICASSP 2024 Speech Signal Improvement (SSI) Challenge. We extend our previous framework based on a two-stage network and propose an upgraded model. Specifically, we replace t
Externí odkaz:
http://arxiv.org/abs/2401.04389
Autor:
Han, Runduo, Yan, Xiaopeng, Xu, Weiming, Guo, Pengcheng, Sun, Jiayao, Wang, He, Lu, Quan, Jiang, Ning, Xie, Lei
This paper describes our audio-quality-based multi-strategy approach for the audio-visual target speaker extraction (AVTSE) task in the Multi-modal Information based Speech Processing (MISP) 2023 Challenge. Specifically, our approach adopts different
Externí odkaz:
http://arxiv.org/abs/2401.03697
Deep learning based techniques have been popularly adopted in acoustic echo cancellation (AEC). Utilization of speaker representation has extended the frontier of AEC, thus attracting many researchers' interest in personalized acoustic echo cancellat
Externí odkaz:
http://arxiv.org/abs/2310.04715
This paper describes our NPU-Elevoc personalized speech enhancement system (NAPSE) for the 5th Deep Noise Suppression Challenge at ICASSP 2023. Based on the superior two-stage model TEA-PSE 2.0, our system particularly explores better strategy for sp
Externí odkaz:
http://arxiv.org/abs/2303.06811
In the field of cross-modal retrieval, single encoder models tend to perform better than dual encoder models, but they suffer from high latency and low throughput. In this paper, we present a dual encoder model called BagFormer that utilizes a cross
Externí odkaz:
http://arxiv.org/abs/2212.14322
Publikováno v:
In Engineering Applications of Artificial Intelligence October 2024 136 Part B
Autor:
Yan, Xiaopeng, Xiao, Jin, Kiki, Claude, Zhang, Yuanyuan, Manzi, Habasi Patrick, Zhao, Guangpu, Wang, ShengDa, Sun, Qian
Publikováno v:
In Environment International September 2024 191
Publikováno v:
In Heliyon 15 August 2024 10(15)
Publikováno v:
In Chaos, Solitons and Fractals: the interdisciplinary journal of Nonlinear Science, and Nonequilibrium and Complex Phenomena August 2024 185