Výsledky vyhledávání - "Li Mingcheng"

Report

Improving Factuality in Large Language Models via Decoding-Time Hallucinatory and Truthful Comparators

Autor: Yang, Dingkang, Xiao, Dongling, Wei, Jinjie, Li, Mingcheng, Chen, Zhaoyu, Li, Ke, Zhang, Lihua

Despite their remarkable capabilities, Large Language Models (LLMs) are prone to generate responses that contradict verifiable facts, i.e., unfaithful hallucination content. Existing efforts generally focus on optimizing model parameters or editing s

Externí odkaz: http://arxiv.org/abs/2408.12325

Zobrazit plný text záznamu

Report

MaskBEV: Towards A Unified Framework for BEV Detection and Map Segmentation

Autor: Zhao, Xiao, Zhang, Xukun, Yang, Dingkang, Sun, Mingyang, Li, Mingcheng, Wang, Shunli, Zhang, Lihua

Accurate and robust multimodal multi-task perception is crucial for modern autonomous driving systems. However, current multimodal perception research follows independent paradigms designed for specific perception tasks, leading to a lack of compleme

Externí odkaz: http://arxiv.org/abs/2408.09122

Zobrazit plný text záznamu

Report

HybridOcc: NeRF Enhanced Transformer-based Multi-Camera 3D Occupancy Prediction

Autor: Zhao, Xiao, Chen, Bo, Sun, Mingyang, Yang, Dingkang, Wang, Youxing, Zhang, Xukun, Li, Mingcheng, Kou, Dongliang, Wei, Xiaoyi, Zhang, Lihua

Vision-based 3D semantic scene completion (SSC) describes autonomous driving scenes through 3D volume representations. However, the occlusion of invisible voxels by scene surfaces poses challenges to current SSC methods in hallucinating refined 3D ge

Externí odkaz: http://arxiv.org/abs/2408.09104

Zobrazit plný text záznamu

Report

Faster Diffusion Action Segmentation

Autor: Wang, Shuaibing, Wang, Shunli, Li, Mingcheng, Yang, Dingkang, Kuang, Haopeng, Qian, Ziyun, Zhang, Lihua

Temporal Action Segmentation (TAS) is an essential task in video analysis, aiming to segment and classify continuous frames into distinct action segments. However, the ambiguous boundaries between actions pose a significant challenge for high-precisi

Externí odkaz: http://arxiv.org/abs/2408.02024

Zobrazit plný text záznamu

Report

Asynchronous Multimodal Video Sequence Fusion via Learning Modality-Exclusive and -Agnostic Representations

Autor: Yang, Dingkang, Li, Mingcheng, Qu, Linhao, Yang, Kun, Zhai, Peng, Wang, Song, Zhang, Lihua

Understanding human intentions (e.g., emotions) from videos has received considerable attention recently. Video streams generally constitute a blend of temporal data stemming from distinct modalities, including natural language, facial expressions, a

Externí odkaz: http://arxiv.org/abs/2407.04955

Zobrazit plný text záznamu

Report

CoMT: Chain-of-Medical-Thought Reduces Hallucination in Medical Report Generation

Autor: Jiang, Yue, Chen, Jiawei, Yang, Dingkang, Li, Mingcheng, Wang, Shunli, Wu, Tong, Li, Ke, Zhang, Lihua

Automatic medical report generation (MRG), which possesses significant research value as it can aid radiologists in clinical diagnosis and report composition, has garnered increasing attention. Despite recent progress, generating accurate reports rem

Externí odkaz: http://arxiv.org/abs/2406.11451

Zobrazit plný text záznamu

Report

Detecting and Evaluating Medical Hallucinations in Large Vision Language Models

Autor: Chen, Jiawei, Yang, Dingkang, Wu, Tong, Jiang, Yue, Hou, Xiaolu, Li, Mingcheng, Wang, Shunli, Xiao, Dongling, Li, Ke, Zhang, Lihua

Large Vision Language Models (LVLMs) are increasingly integral to healthcare applications, including medical visual question answering and imaging report generation. While these models inherit the robust capabilities of foundational Large Language Mo

Externí odkaz: http://arxiv.org/abs/2406.10185

Zobrazit plný text záznamu

Report

PediatricsGPT: Large Language Models as Chinese Medical Assistants for Pediatric Applications

Autor: Yang, Dingkang, Wei, Jinjie, Xiao, Dongling, Wang, Shunli, Wu, Tong, Li, Gang, Li, Mingcheng, Wang, Shuaibing, Chen, Jiawei, Jiang, Yue, Xu, Qingyao, Li, Ke, Zhai, Peng, Zhang, Lihua

Developing intelligent pediatric consultation systems offers promising prospects for improving diagnostic efficiency, especially in China, where healthcare resources are scarce. Despite recent advances in Large Language Models (LLMs) for Chinese medi

Externí odkaz: http://arxiv.org/abs/2405.19266

Zobrazit plný text záznamu

Report

SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion

Autor: Qian, Ziyun, Xiao, Zeyu, Wu, Zhenyi, Yang, Dingkang, Li, Mingcheng, Wang, Shunli, Wang, Shuaibing, Kou, Dongliang, Zhang, Lihua

Motion style transfer is a significant research direction in multimedia applications. It enables the rapid switching of different styles of the same motion for virtual digital humans, thus vastly increasing the diversity and realism of movements. It

Externí odkaz: http://arxiv.org/abs/2405.02844

Zobrazit plný text záznamu

Report

Correlation-Decoupled Knowledge Distillation for Multimodal Sentiment Analysis with Incomplete Modalities

Autor: Li, Mingcheng, Yang, Dingkang, Zhao, Xiao, Wang, Shuaibing, Wang, Yan, Yang, Kun, Sun, Mingyang, Kou, Dongliang, Qian, Ziyun, Zhang, Lihua

Multimodal sentiment analysis (MSA) aims to understand human sentiment through multimodal data. Most MSA efforts are based on the assumption of modality completeness. However, in real-world applications, some practical factors cause uncertain modalit

Externí odkaz: http://arxiv.org/abs/2404.16456

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání