Zobrazeno 1 - 10
of 167
pro vyhledávání: '"Li Mingcheng"'
Autor:
Yang, Dingkang, Xiao, Dongling, Wei, Jinjie, Li, Mingcheng, Chen, Zhaoyu, Li, Ke, Zhang, Lihua
Despite their remarkable capabilities, Large Language Models (LLMs) are prone to generate responses that contradict verifiable facts, i.e., unfaithful hallucination content. Existing efforts generally focus on optimizing model parameters or editing s
Externí odkaz:
http://arxiv.org/abs/2408.12325
Autor:
Zhao, Xiao, Zhang, Xukun, Yang, Dingkang, Sun, Mingyang, Li, Mingcheng, Wang, Shunli, Zhang, Lihua
Accurate and robust multimodal multi-task perception is crucial for modern autonomous driving systems. However, current multimodal perception research follows independent paradigms designed for specific perception tasks, leading to a lack of compleme
Externí odkaz:
http://arxiv.org/abs/2408.09122
Autor:
Zhao, Xiao, Chen, Bo, Sun, Mingyang, Yang, Dingkang, Wang, Youxing, Zhang, Xukun, Li, Mingcheng, Kou, Dongliang, Wei, Xiaoyi, Zhang, Lihua
Vision-based 3D semantic scene completion (SSC) describes autonomous driving scenes through 3D volume representations. However, the occlusion of invisible voxels by scene surfaces poses challenges to current SSC methods in hallucinating refined 3D ge
Externí odkaz:
http://arxiv.org/abs/2408.09104
Autor:
Wang, Shuaibing, Wang, Shunli, Li, Mingcheng, Yang, Dingkang, Kuang, Haopeng, Qian, Ziyun, Zhang, Lihua
Temporal Action Segmentation (TAS) is an essential task in video analysis, aiming to segment and classify continuous frames into distinct action segments. However, the ambiguous boundaries between actions pose a significant challenge for high-precisi
Externí odkaz:
http://arxiv.org/abs/2408.02024
Understanding human intentions (e.g., emotions) from videos has received considerable attention recently. Video streams generally constitute a blend of temporal data stemming from distinct modalities, including natural language, facial expressions, a
Externí odkaz:
http://arxiv.org/abs/2407.04955
Autor:
Jiang, Yue, Chen, Jiawei, Yang, Dingkang, Li, Mingcheng, Wang, Shunli, Wu, Tong, Li, Ke, Zhang, Lihua
Automatic medical report generation (MRG), which possesses significant research value as it can aid radiologists in clinical diagnosis and report composition, has garnered increasing attention. Despite recent progress, generating accurate reports rem
Externí odkaz:
http://arxiv.org/abs/2406.11451
Autor:
Chen, Jiawei, Yang, Dingkang, Wu, Tong, Jiang, Yue, Hou, Xiaolu, Li, Mingcheng, Wang, Shunli, Xiao, Dongling, Li, Ke, Zhang, Lihua
Large Vision Language Models (LVLMs) are increasingly integral to healthcare applications, including medical visual question answering and imaging report generation. While these models inherit the robust capabilities of foundational Large Language Mo
Externí odkaz:
http://arxiv.org/abs/2406.10185
Autor:
Yang, Dingkang, Wei, Jinjie, Xiao, Dongling, Wang, Shunli, Wu, Tong, Li, Gang, Li, Mingcheng, Wang, Shuaibing, Chen, Jiawei, Jiang, Yue, Xu, Qingyao, Li, Ke, Zhai, Peng, Zhang, Lihua
Developing intelligent pediatric consultation systems offers promising prospects for improving diagnostic efficiency, especially in China, where healthcare resources are scarce. Despite recent advances in Large Language Models (LLMs) for Chinese medi
Externí odkaz:
http://arxiv.org/abs/2405.19266
Autor:
Qian, Ziyun, Xiao, Zeyu, Wu, Zhenyi, Yang, Dingkang, Li, Mingcheng, Wang, Shunli, Wang, Shuaibing, Kou, Dongliang, Zhang, Lihua
Motion style transfer is a significant research direction in multimedia applications. It enables the rapid switching of different styles of the same motion for virtual digital humans, thus vastly increasing the diversity and realism of movements. It
Externí odkaz:
http://arxiv.org/abs/2405.02844
Autor:
Li, Mingcheng, Yang, Dingkang, Zhao, Xiao, Wang, Shuaibing, Wang, Yan, Yang, Kun, Sun, Mingyang, Kou, Dongliang, Qian, Ziyun, Zhang, Lihua
Multimodal sentiment analysis (MSA) aims to understand human sentiment through multimodal data. Most MSA efforts are based on the assumption of modality completeness. However, in real-world applications, some practical factors cause uncertain modalit
Externí odkaz:
http://arxiv.org/abs/2404.16456