Výsledky vyhledávání - "XU, XIANGMIN"

Report

PsyDT: Using LLMs to Construct the Digital Twin of Psychological Counselor with Personalized Counseling Style for Psychological Counseling

Autor: Xie, Haojie, Chen, Yirong, Xing, Xiaofen, Lin, Jingkai, Xu, Xiangmin

Currently, large language models (LLMs) have made significant progress in the field of psychological counseling. However, existing mental health LLMs overlook a critical issue where they do not consider the fact that different psychological counselor

Externí odkaz: http://arxiv.org/abs/2412.13660

Zobrazit plný text záznamu

Report

Shared Attention-based Autoencoder with Hierarchical Fusion-based Graph Convolution Network for sEEG SOZ Identification

Autor: Yan, Huachao, Guo, Kailing, Song, Shiwei, Dai, Yihai, Wei, Xiaoqiang, Xing, Xiaofen, Xu, Xiangmin

Diagnosing seizure onset zone (SOZ) is a challenge in neurosurgery, where stereoelectroencephalography (sEEG) serves as a critical technique. In sEEG SOZ identification, the existing studies focus solely on the intra-patient representation of epilept

Externí odkaz: http://arxiv.org/abs/2412.12651

Zobrazit plný text záznamu

Report

SleepCoT: A Lightweight Personalized Sleep Health Model via Chain-of-Thought Distillation

Autor: Zheng, Huimin, Xing, Xiaofeng, Xu, Xiangmin

We present a novel approach to personalized sleep health management using few-shot Chain-of-Thought (CoT) distillation, enabling small-scale language models (> 2B parameters) to rival the performance of large language models (LLMs) in specialized hea

Externí odkaz: http://arxiv.org/abs/2410.16924

Zobrazit plný text záznamu

Report

Task-Oriented Edge-Assisted Cooperative Data Compression, Communications and Computing for UGV-Enhanced Warehouse Logistics

Autor: Yang, Jiaming, Meng, Zhen, Xu, Xiangmin, Chen, Kan, Li, Emma Liying, Zhao, Philip Guodong G.

This paper explores the growing need for task-oriented communications in warehouse logistics, where traditional communication Key Performance Indicators (KPIs)-such as latency, reliability, and throughput-often do not fully meet task requirements. As

Externí odkaz: http://arxiv.org/abs/2410.01515

Zobrazit plný text záznamu

Report

Multi-Scale Temporal Transformer For Speech Emotion Recognition

Autor: Li, Zhipeng, Xing, Xiaofen, Fang, Yuanbo, Zhang, Weibin, Fan, Hengsheng, Xu, Xiangmin

Speech emotion recognition plays a crucial role in human-machine interaction systems. Recently various optimized Transformers have been successfully applied to speech emotion recognition. However, the existing Transformer architectures focus more on

Externí odkaz: http://arxiv.org/abs/2410.00390

Zobrazit plný text záznamu

Report

Online Multi-level Contrastive Representation Distillation for Cross-Subject fNIRS Emotion Recognition

Autor: Lai, Zhili, Qing, Chunmei, Tan, Junpeng, Luo, Wanxiang, Xu, Xiangmin

Utilizing functional near-infrared spectroscopy (fNIRS) signals for emotion recognition is a significant advancement in understanding human emotions. However, due to the lack of artificial intelligence data and algorithms in this field, current resea

Externí odkaz: http://arxiv.org/abs/2409.16081

Zobrazit plný text záznamu

Report

AS-Speech: Adaptive Style For Speech Synthesis

Autor: Li, Zhipeng, Xing, Xiaofen, Wang, Jun, Chen, Shuaiqi, Yu, Guoqiao, Wan, Guanglu, Xu, Xiangmin

In recent years, there has been significant progress in Text-to-Speech (TTS) synthesis technology, enabling the high-quality synthesis of voices in common scenarios. In unseen situations, adaptive TTS requires a strong generalization capability to sp

Externí odkaz: http://arxiv.org/abs/2409.05730

Zobrazit plný text záznamu

Report

Real-Time Interactions Between Human Controllers and Remote Devices in Metaverse

Autor: Chen, Kan, Meng, Zhen, Xu, Xiangmin, She, Changyang, Zhao, Philip G.

Supporting real-time interactions between human controllers and remote devices remains a challenging goal in the Metaverse due to the stringent requirements on computing workload, communication throughput, and round-trip latency. In this paper, we es

Externí odkaz: http://arxiv.org/abs/2407.16591

Zobrazit plný text záznamu

Report

Timeliness-Fidelity Tradeoff in 3D Scene Representations

Autor: Xu, Xiangmin, Meng, Zhen, Zhang, Yichi, She, Changyang, Zhao, Philip G.

Real-time three-dimensional (3D) scene representations serve as one of the building blocks that bolster various innovative applications, e.g., digital manufacturing, Virtual/Augmented/Extended/Mixed Reality (VR/AR/XR/MR), and the metaverse. Despite s

Externí odkaz: http://arxiv.org/abs/2407.16575

Zobrazit plný text záznamu

Report

VideoCoT: A Video Chain-of-Thought Dataset with Active Annotation Tool

Autor: Wang, Yan, Zeng, Yawen, Zheng, Jingsheng, Xing, Xiaofen, Xu, Jin, Xu, Xiangmin

Multimodal large language models (MLLMs) are flourishing, but mainly focus on images with less attention than videos, especially in sub-fields such as prompt engineering, video chain-of-thought (CoT), and instruction tuning on videos. Therefore, we t

Externí odkaz: http://arxiv.org/abs/2407.05355

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání