Výsledky vyhledávání

Report

DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination

Autor: Gong, Xuan, Ming, Tianshi, Wang, Xinpeng, Wei, Zhihua

Despite the great success of Large Vision-Language Models (LVLMs), they inevitably suffer from hallucination. As we know, both the visual encoder and the Large Language Model (LLM) decoder in LVLMs are Transformer-based, allowing the model to extract

Externí odkaz: http://arxiv.org/abs/2410.04514

Zobrazit plný text záznamu

Report

Self-supervised 3D Patient Modeling with Multi-modal Attentive Fusion

Autor: Zheng, Meng, Planche, Benjamin, Gong, Xuan, Yang, Fan, Chen, Terrence, Wu, Ziyan

3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms. Existing CNN-based end-to-end patient modeling solutions typically require a) customized network designs demanding la

Externí odkaz: http://arxiv.org/abs/2403.03217

Zobrazit plný text záznamu

Report

Spectrum AUC Difference (SAUCD): Human-aligned 3D Shape Evaluation

Autor: Luan, Tianyu, Li, Zhong, Chen, Lele, Gong, Xuan, Chen, Lichang, Xu, Yi, Yuan, Junsong

Existing 3D mesh shape evaluation metrics mainly focus on the overall shape but are usually less sensitive to local details. This makes them inconsistent with human evaluation, as human perception cares about both overall and detailed shape. In this

Externí odkaz: http://arxiv.org/abs/2403.01619

Zobrazit plný text záznamu

Report

Federated Learning via Input-Output Collaborative Distillation

Autor: Gong, Xuan, Li, Shanglin, Bao, Yuxiang, Yao, Barry, Huang, Yawen, Wu, Ziyan, Zhang, Baochang, Zheng, Yefeng, Doermann, David

Federated learning (FL) is a machine learning paradigm in which distributed local nodes collaboratively train a central model without sharing individually held private data. Existing FL methods either iteratively share local model parameters or deplo

Externí odkaz: http://arxiv.org/abs/2312.14478

Zobrazit plný text záznamu

Report

Decom--CAM: Tell Me What You See, In Details! Feature-Level Interpretation via Decomposition Class Activation Map

Autor: Yang, Yuguang, Guo, Runtang, Wu, Sheng, Wang, Yimi, Zhang, Juan, Gong, Xuan, Zhang, Baochang

Interpretation of deep learning remains a very challenging problem. Although the Class Activation Map (CAM) is widely used to interpret deep model predictions by highlighting object location, it fails to provide insight into the salient features used

Externí odkaz: http://arxiv.org/abs/2306.04644

Zobrazit plný text záznamu

Akademický článek

Study on the Preparation and Performance of Ni-P/Diamond Composite Coatings on 2014 Aluminum Alloy Surfaces

Autor: GONG Xuan, LIU Jiachen, CUI Yan, LI Qiang

Publikováno v: Cailiao Baohu, Vol 57, Iss 9, Pp 148-153 (2024)

In order to enhance the wear resistance of aluminum alloy surfaces，Ni-P/diamond composite coatings were prepared on 2014 aluminum alloy surfaces using electroplating method.The effects of varying diamond particle content in the coating on the coati

Externí odkaz: https://doaj.org/article/00ca426dea224dbab1fdd7058a0817a4

Zobrazit plný text záznamu

Report

Harnessing Low-Frequency Neural Fields for Few-Shot View Synthesis

Autor: Song, Liangchen, Li, Zhong, Gong, Xuan, Chen, Lele, Chen, Zhang, Xu, Yi, Yuan, Junsong

Neural Radiance Fields (NeRF) have led to breakthroughs in the novel view synthesis problem. Positional Encoding (P.E.) is a critical factor that brings the impressive performance of NeRF, where low-dimensional coordinates are mapped to high-dimensio

Externí odkaz: http://arxiv.org/abs/2303.08370

Zobrazit plný text záznamu

Report

Progressive Multi-view Human Mesh Recovery with Self-Supervision

Autor: Gong, Xuan, Song, Liangchen, Zheng, Meng, Planche, Benjamin, Chen, Terrence, Yuan, Junsong, Doermann, David, Wu, Ziyan

To date, little attention has been given to multi-view 3D human mesh estimation, despite real-life applicability (e.g., motion capture, sport analysis) and robustness to single-view ambiguities. Existing solutions typically suffer from poor generaliz

Externí odkaz: http://arxiv.org/abs/2212.05223

Zobrazit plný text záznamu

Report

Federated Learning with Privacy-Preserving Ensemble Attention Distillation

Autor: Gong, Xuan, Song, Liangchen, Vedula, Rishi, Sharma, Abhishek, Zheng, Meng, Planche, Benjamin, Innanje, Arun, Chen, Terrence, Yuan, Junsong, Doermann, David, Wu, Ziyan

Federated Learning (FL) is a machine learning paradigm where many local nodes collaboratively train a central model while keeping the training data decentralized. This is particularly relevant for clinical applications since patient data are usually

Externí odkaz: http://arxiv.org/abs/2210.08464

Zobrazit plný text záznamu

Report

PREF: Predictability Regularized Neural Motion Fields

Autor: Song, Liangchen, Gong, Xuan, Planche, Benjamin, Zheng, Meng, Doermann, David, Yuan, Junsong, Chen, Terrence, Wu, Ziyan

Knowing the 3D motions in a dynamic scene is essential to many vision applications. Recent progress is mainly focused on estimating the activity of some specific elements like humans. In this paper, we leverage a neural motion field for estimating th

Externí odkaz: http://arxiv.org/abs/2209.10691

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání