Zobrazeno 1 - 10
of 115
pro vyhledávání: '"fan, Cunhang"'
At a cocktail party, humans exhibit an impressive ability to direct their attention. The auditory attention detection (AAD) approach seeks to identify the attended speaker by analyzing brain signals, such as EEG signals. However, current AAD algorith
Externí odkaz:
http://arxiv.org/abs/2410.11181
Autor:
Qin, Zhanyue, Wang, Haochuan, Wang, Zecheng, Liu, Deyuan, Fan, Cunhang, Lv, Zhao, Tu, Zhiying, Chu, Dianhui, Sui, Dianbo
In recent years, with the maturation of large language model (LLM) technology and the emergence of high-quality programming code datasets, researchers have become increasingly confident in addressing the challenges of program synthesis automatically.
Externí odkaz:
http://arxiv.org/abs/2410.07820
Speech enhancement (SE) aims to extract the clean waveform from noise-contaminated measurements to improve the speech quality and intelligibility. Although learning-based methods can perform much better than traditional counterparts, the large comput
Externí odkaz:
http://arxiv.org/abs/2409.13285
Autor:
Qin, Zhanyue, Wang, Haochuan, Liu, Deyuan, Song, Ziyang, Fan, Cunhang, Lv, Zhao, Wu, Jinlin, Lei, Zhen, Tu, Zhiying, Chu, Dianhui, Yu, Xiaoyan, Sui, Dianbo
Sequential decision-making refers to algorithms that take into account the dynamics of the environment, where early decisions affect subsequent decisions. With large language models (LLMs) demonstrating powerful capabilities between tasks, we can't h
Externí odkaz:
http://arxiv.org/abs/2406.16382
Autor:
Liu, Deyuan, Qin, Zhanyue, Wang, Hairu, Yang, Zhao, Wang, Zecheng, Rong, Fangying, Liu, Qingbin, Hao, Yanchao, Chen, Xi, Fan, Cunhang, Lv, Zhao, Tu, Zhiying, Chu, Dianhui, Li, Bo, Sui, Dianbo
While large language models (LLMs) excel in many domains, their complexity and scale challenge deployment in resource-limited environments. Current compression techniques, such as parameter pruning, often fail to effectively utilize the knowledge fro
Externí odkaz:
http://arxiv.org/abs/2406.16330
In the telephony scenarios, the fake speech detection (FSD) task to combat speech spoofing attacks is challenging. Data augmentation (DA) methods are considered effective means to address the FSD task in telephony scenarios, typically divided into ti
Externí odkaz:
http://arxiv.org/abs/2406.09664
Autor:
Chen, Yujie, Yi, Jiangyan, Xue, Jun, Wang, Chenglong, Zhang, Xiaohui, Dong, Shunbo, Zeng, Siding, Tao, Jianhua, Zhao, Lv, Fan, Cunhang
Fake artefacts for discriminating between bonafide and fake audio can exist in both short- and long-range segments. Therefore, combining local and global feature information can effectively discriminate between bonafide and fake audio. This paper pro
Externí odkaz:
http://arxiv.org/abs/2406.06086
Publikováno v:
(2024) Vol. 38 No. 8: AAAI-24 Technical Tracks 8 Vol. 38 No. 8: AAAI-24 Technical Tracks 8 Vol. 38 No. 8: AAAI-24 Technical Tracks 8 Proceedings of the AAAI Conference on Artificial Intelligence, 38(8), 8380-8388
In recent years, knowledge graph completion (KGC) models based on pre-trained language model (PLM) have shown promising results. However, the large number of parameters and high computational cost of PLM models pose challenges for their application i
Externí odkaz:
http://arxiv.org/abs/2401.12997
Most research in synthetic speech detection (SSD) focuses on improving performance on standard noise-free datasets. However, in actual situations, noise interference is usually present, causing significant performance degradation in SSD systems. To i
Externí odkaz:
http://arxiv.org/abs/2310.08869
Autor:
Fan, Cunhang, Zhang, Hongyu, Huang, Wei, Xue, Jun, Tao, Jianhua, Yi, Jiangyan, Lv, Zhao, Wu, Xiaopei
Auditory Attention Detection (AAD) aims to detect target speaker from brain signals in a multi-speaker environment. Although EEG-based AAD methods have shown promising results in recent years, current approaches primarily rely on traditional convolut
Externí odkaz:
http://arxiv.org/abs/2309.07147