Zobrazeno 1 - 10
of 482
pro vyhledávání: '"Li, Pengxiang"'
Autor:
Zhang, Zhuofan, Zhu, Ziyu, Li, Pengxiang, Liu, Tengyu, Ma, Xiaojian, Chen, Yixin, Jia, Baoxiong, Huang, Siyuan, Li, Qing
Grounding natural language in physical 3D environments is essential for the advancement of embodied artificial intelligence. Current datasets and models for 3D visual grounding predominantly focus on identifying and localizing objects from static, ob
Externí odkaz:
http://arxiv.org/abs/2408.04034
Autor:
Li, Pengxiang, Gao, Zhi, Zhang, Bofei, Yuan, Tao, Wu, Yuwei, Harandi, Mehrtash, Jia, Yunde, Zhu, Song-Chun, Li, Qing
Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction. In this paper, we build FIRE, a feedback-refinement dataset, consisting of 1.1M multi-turn conversations that are derive
Externí odkaz:
http://arxiv.org/abs/2407.11522
Text-to-image generation models often struggle with key element loss or semantic confusion in tasks involving Chinese classical poetry.Addressing this issue through fine-tuning models needs considerable training costs. Additionally, manual prompts fo
Externí odkaz:
http://arxiv.org/abs/2407.06196
The rapid advancements in Large Language Models (LLMs) have revolutionized various natural language processing tasks. However, the substantial size of LLMs presents significant challenges in training or fine-tuning. While parameter-efficient approach
Externí odkaz:
http://arxiv.org/abs/2405.18380
Autor:
Chen, Kai, Li, Yanze, Zhang, Wenhua, Liu, Yanxin, Li, Pengxiang, Gao, Ruiyuan, Hong, Lanqing, Tian, Meng, Zhao, Xinhai, Li, Zhenguo, Yeung, Dit-Yan, Lu, Huchuan, Jia, Xu
Large Vision-Language Models (LVLMs) have received widespread attention in advancing the interpretable self-driving. Existing evaluations of LVLMs primarily focus on the multi-faceted capabilities in natural circumstances, lacking automated and quant
Externí odkaz:
http://arxiv.org/abs/2404.10595
Autor:
Li, Pengxiang, Chen, Kai, Liu, Zhili, Gao, Ruiyuan, Hong, Lanqing, Zhou, Guo, Yao, Hua, Yeung, Dit-Yan, Lu, Huchuan, Jia, Xu
Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the neces
Externí odkaz:
http://arxiv.org/abs/2312.00651
Publikováno v:
In Journal of Energy Storage 1 October 2024 99 Part A
Autor:
Cui, Yifang, Zhu, Jiajia, Li, Pengxiang *, Guo, Fangfang *, Yang, Bing *, Su, Xia *, Zhou, Hongzhuan *, Zhu, Kui, Xu, Fuzhou *
Publikováno v:
In Poultry Science August 2024 103(8)
In this paper, we present a decomposition model for stereo matching to solve the problem of excessive growth in computational cost (time and memory cost) as the resolution increases. In order to reduce the huge cost of stereo matching at the original
Externí odkaz:
http://arxiv.org/abs/2104.07516
Autor:
Ning, Zigong, Zhou, Shuang, Li, Pengxiang, Li, Rong, Liu, Feihua, Zhao, Zilong, Ren, Nanqi, Lu, Lu
Publikováno v:
In Chemosphere December 2023 345