Zobrazeno 1 - 10
of 752
pro vyhledávání: '"GONG Xuan"'
Despite the great success of Large Vision-Language Models (LVLMs), they inevitably suffer from hallucination. As we know, both the visual encoder and the Large Language Model (LLM) decoder in LVLMs are Transformer-based, allowing the model to extract
Externí odkaz:
http://arxiv.org/abs/2410.04514
3D patient body modeling is critical to the success of automated patient positioning for smart medical scanning and operating rooms. Existing CNN-based end-to-end patient modeling solutions typically require a) customized network designs demanding la
Externí odkaz:
http://arxiv.org/abs/2403.03217
Existing 3D mesh shape evaluation metrics mainly focus on the overall shape but are usually less sensitive to local details. This makes them inconsistent with human evaluation, as human perception cares about both overall and detailed shape. In this
Externí odkaz:
http://arxiv.org/abs/2403.01619
Autor:
Gong, Xuan, Li, Shanglin, Bao, Yuxiang, Yao, Barry, Huang, Yawen, Wu, Ziyan, Zhang, Baochang, Zheng, Yefeng, Doermann, David
Federated learning (FL) is a machine learning paradigm in which distributed local nodes collaboratively train a central model without sharing individually held private data. Existing FL methods either iteratively share local model parameters or deplo
Externí odkaz:
http://arxiv.org/abs/2312.14478
Interpretation of deep learning remains a very challenging problem. Although the Class Activation Map (CAM) is widely used to interpret deep model predictions by highlighting object location, it fails to provide insight into the salient features used
Externí odkaz:
http://arxiv.org/abs/2306.04644
Publikováno v:
Cailiao Baohu, Vol 57, Iss 9, Pp 148-153 (2024)
In order to enhance the wear resistance of aluminum alloy surfaces,Ni-P/diamond composite coatings were prepared on 2014 aluminum alloy surfaces using electroplating method.The effects of varying diamond particle content in the coating on the coati
Externí odkaz:
https://doaj.org/article/00ca426dea224dbab1fdd7058a0817a4
Neural Radiance Fields (NeRF) have led to breakthroughs in the novel view synthesis problem. Positional Encoding (P.E.) is a critical factor that brings the impressive performance of NeRF, where low-dimensional coordinates are mapped to high-dimensio
Externí odkaz:
http://arxiv.org/abs/2303.08370
Autor:
Gong, Xuan, Song, Liangchen, Zheng, Meng, Planche, Benjamin, Chen, Terrence, Yuan, Junsong, Doermann, David, Wu, Ziyan
To date, little attention has been given to multi-view 3D human mesh estimation, despite real-life applicability (e.g., motion capture, sport analysis) and robustness to single-view ambiguities. Existing solutions typically suffer from poor generaliz
Externí odkaz:
http://arxiv.org/abs/2212.05223
Autor:
Gong, Xuan, Song, Liangchen, Vedula, Rishi, Sharma, Abhishek, Zheng, Meng, Planche, Benjamin, Innanje, Arun, Chen, Terrence, Yuan, Junsong, Doermann, David, Wu, Ziyan
Federated Learning (FL) is a machine learning paradigm where many local nodes collaboratively train a central model while keeping the training data decentralized. This is particularly relevant for clinical applications since patient data are usually
Externí odkaz:
http://arxiv.org/abs/2210.08464
Autor:
Song, Liangchen, Gong, Xuan, Planche, Benjamin, Zheng, Meng, Doermann, David, Yuan, Junsong, Chen, Terrence, Wu, Ziyan
Knowing the 3D motions in a dynamic scene is essential to many vision applications. Recent progress is mainly focused on estimating the activity of some specific elements like humans. In this paper, we leverage a neural motion field for estimating th
Externí odkaz:
http://arxiv.org/abs/2209.10691