Zobrazeno 1 - 10
of 1 087
pro vyhledávání: '"XU, XIANGMIN"'
Currently, large language models (LLMs) have made significant progress in the field of psychological counseling. However, existing mental health LLMs overlook a critical issue where they do not consider the fact that different psychological counselor
Externí odkaz:
http://arxiv.org/abs/2412.13660
Autor:
Yan, Huachao, Guo, Kailing, Song, Shiwei, Dai, Yihai, Wei, Xiaoqiang, Xing, Xiaofen, Xu, Xiangmin
Diagnosing seizure onset zone (SOZ) is a challenge in neurosurgery, where stereoelectroencephalography (sEEG) serves as a critical technique. In sEEG SOZ identification, the existing studies focus solely on the intra-patient representation of epilept
Externí odkaz:
http://arxiv.org/abs/2412.12651
We present a novel approach to personalized sleep health management using few-shot Chain-of-Thought (CoT) distillation, enabling small-scale language models (> 2B parameters) to rival the performance of large language models (LLMs) in specialized hea
Externí odkaz:
http://arxiv.org/abs/2410.16924
This paper explores the growing need for task-oriented communications in warehouse logistics, where traditional communication Key Performance Indicators (KPIs)-such as latency, reliability, and throughput-often do not fully meet task requirements. As
Externí odkaz:
http://arxiv.org/abs/2410.01515
Speech emotion recognition plays a crucial role in human-machine interaction systems. Recently various optimized Transformers have been successfully applied to speech emotion recognition. However, the existing Transformer architectures focus more on
Externí odkaz:
http://arxiv.org/abs/2410.00390
Utilizing functional near-infrared spectroscopy (fNIRS) signals for emotion recognition is a significant advancement in understanding human emotions. However, due to the lack of artificial intelligence data and algorithms in this field, current resea
Externí odkaz:
http://arxiv.org/abs/2409.16081
Autor:
Li, Zhipeng, Xing, Xiaofen, Wang, Jun, Chen, Shuaiqi, Yu, Guoqiao, Wan, Guanglu, Xu, Xiangmin
In recent years, there has been significant progress in Text-to-Speech (TTS) synthesis technology, enabling the high-quality synthesis of voices in common scenarios. In unseen situations, adaptive TTS requires a strong generalization capability to sp
Externí odkaz:
http://arxiv.org/abs/2409.05730
Supporting real-time interactions between human controllers and remote devices remains a challenging goal in the Metaverse due to the stringent requirements on computing workload, communication throughput, and round-trip latency. In this paper, we es
Externí odkaz:
http://arxiv.org/abs/2407.16591
Real-time three-dimensional (3D) scene representations serve as one of the building blocks that bolster various innovative applications, e.g., digital manufacturing, Virtual/Augmented/Extended/Mixed Reality (VR/AR/XR/MR), and the metaverse. Despite s
Externí odkaz:
http://arxiv.org/abs/2407.16575
Multimodal large language models (MLLMs) are flourishing, but mainly focus on images with less attention than videos, especially in sub-fields such as prompt engineering, video chain-of-thought (CoT), and instruction tuning on videos. Therefore, we t
Externí odkaz:
http://arxiv.org/abs/2407.05355