Výsledky vyhledávání - "Li, Pengxiang"

Report

Task-oriented Sequential Grounding in 3D Scenes

Autor: Zhang, Zhuofan, Zhu, Ziyu, Li, Pengxiang, Liu, Tengyu, Ma, Xiaojian, Chen, Yixin, Jia, Baoxiong, Huang, Siyuan, Li, Qing

Grounding natural language in physical 3D environments is essential for the advancement of embodied artificial intelligence. Current datasets and models for 3D visual grounding predominantly focus on identifying and localizing objects from static, ob

Externí odkaz: http://arxiv.org/abs/2408.04034

Zobrazit plný text záznamu

Report

FIRE: A Dataset for Feedback Integration and Refinement Evaluation of Multimodal Models

Autor: Li, Pengxiang, Gao, Zhi, Zhang, Bofei, Yuan, Tao, Wu, Yuwei, Harandi, Mehrtash, Jia, Yunde, Zhu, Song-Chun, Li, Qing

Vision language models (VLMs) have achieved impressive progress in diverse applications, becoming a prevalent research direction. In this paper, we build FIRE, a feedback-refinement dataset, consisting of 1.1M multi-turn conversations that are derive

Externí odkaz: http://arxiv.org/abs/2407.11522

Zobrazit plný text záznamu

Report

Poetry2Image: An Iterative Correction Framework for Images Generated from Chinese Classical Poetry

Autor: Jiang, Jing, Ling, Yiran, Li, Binzhu, Li, Pengxiang, Piao, Junming, Zhang, Yu

Text-to-image generation models often struggle with key element loss or semantic confusion in tasks involving Chinese classical poetry.Addressing this issue through fine-tuning models needs considerable training costs. Additionally, manual prompts fo

Externí odkaz: http://arxiv.org/abs/2407.06196

Zobrazit plný text záznamu

Report

OwLore: Outlier-weighed Layerwise Sampled Low-Rank Projection for Memory-Efficient LLM Fine-tuning

Autor: Li, Pengxiang, Yin, Lu, Gao, Xiaowei, Liu, Shiwei

The rapid advancements in Large Language Models (LLMs) have revolutionized various natural language processing tasks. However, the substantial size of LLMs presents significant challenges in training or fine-tuning. While parameter-efficient approach

Externí odkaz: http://arxiv.org/abs/2405.18380

Zobrazit plný text záznamu

Report

Automated Evaluation of Large Vision-Language Models on Self-driving Corner Cases

Autor: Chen, Kai, Li, Yanze, Zhang, Wenhua, Liu, Yanxin, Li, Pengxiang, Gao, Ruiyuan, Hong, Lanqing, Tian, Meng, Zhao, Xinhai, Li, Zhenguo, Yeung, Dit-Yan, Lu, Huchuan, Jia, Xu

Large Vision-Language Models (LVLMs) have received widespread attention in advancing the interpretable self-driving. Existing evaluations of LVLMs primarily focus on the multi-faceted capabilities in natural circumstances, lacking automated and quant

Externí odkaz: http://arxiv.org/abs/2404.10595

Zobrazit plný text záznamu

Report

TrackDiffusion: Tracklet-Conditioned Video Generation via Diffusion Models

Autor: Li, Pengxiang, Chen, Kai, Liu, Zhili, Gao, Ruiyuan, Hong, Lanqing, Zhou, Guo, Yao, Hua, Yeung, Dit-Yan, Lu, Huchuan, Jia, Xu

Despite remarkable achievements in video synthesis, achieving granular control over complex dynamics, such as nuanced movement among multiple interacting objects, still presents a significant hurdle for dynamic world modeling, compounded by the neces

Externí odkaz: http://arxiv.org/abs/2312.00651

Zobrazit plný text záznamu

Akademický článek

Characterization study on external short circuit for lithium-ion battery safety management: From single cell to module

Autor: Zhang, Bo, Chen, Zeyu, Tao, Qingyi, Jiao, Meng, Li, Pengxiang, Zhou, Nan

Publikováno v: In Journal of Energy Storage 1 October 2024 99 Part A

Zobrazit plný text záznamu

Akademický článek

Assessment of probiotic Bacillus velezensis supplementation to reduce Campylobacter jejuni colonization in chickens

Autor: Cui, Yifang, Zhu, Jiajia, Li, Pengxiang ^*, Guo, Fangfang ^*, Yang, Bing ^*, Su, Xia ^*, Zhou, Hongzhuan ^*, Zhu, Kui, Xu, Fuzhou ^*

Publikováno v: In Poultry Science August 2024 103(8)

Zobrazit plný text záznamu

Report

A Decomposition Model for Stereo Matching

Autor: Yao, Chengtang, Jia, Yunde, Di, Huijun, Li, Pengxiang, Wu, Yuwei

In this paper, we present a decomposition model for stereo matching to solve the problem of excessive growth in computational cost (time and memory cost) as the resolution increases. In order to reduce the huge cost of stereo matching at the original

Externí odkaz: http://arxiv.org/abs/2104.07516

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání