Výsledky vyhledávání - "WANG Hanqing"

Report

SeqAfford: Sequential 3D Affordance Reasoning via Multimodal Large Language Model

Autor: Yu, Chunlin, Wang, Hanqing, Shi, Ye, Luo, Haoyang, Yang, Sibei, Yu, Jingyi, Wang, Jingya

3D affordance segmentation aims to link human instructions to touchable regions of 3D objects for embodied manipulations. Existing efforts typically adhere to single-object, single-affordance paradigms, where each affordance type or explicit instruct

Externí odkaz: http://arxiv.org/abs/2412.01550

Zobrazit plný text záznamu

Report

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning

Autor: Wang, Xujia, Zhao, Haiyan, Wang, Shuo, Wang, Hanqing, Liu, Zhiyuan

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner. However, in multi-task scenarios, challenges such as training imbalance and the seesaw eff

Externí odkaz: http://arxiv.org/abs/2410.22782

Zobrazit plný text záznamu

Report

GRUtopia: Dream General Robots in a City at Scale

Recent works have been exploring the scaling laws in the field of Embodied AI. Given the prohibitive costs of collecting real-world data, we believe the Simulation-to-Real (Sim2Real) paradigm is a crucial step for scaling the learning of embodied mod

Externí odkaz: http://arxiv.org/abs/2407.10943

Zobrazit plný text záznamu

Report

OVExp: Open Vocabulary Exploration for Object-Oriented Navigation

Autor: Wei, Meng, Wang, Tai, Chen, Yilun, Wang, Hanqing, Pang, Jiangmiao, Liu, Xihui

Object-oriented embodied navigation aims to locate specific objects, defined by category or depicted in images. Existing methods often struggle to generalize to open vocabulary goals without extensive training data. While recent advances in Vision-La

Externí odkaz: http://arxiv.org/abs/2407.09016

Zobrazit plný text záznamu

Report

MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM Finetuning

Autor: Wang, Hanqing, Li, Yixia, Wang, Shuo, Chen, Guanhua, Chen, Yun

Efficient finetuning of large language models (LLMs) aims to adapt the LLMs with reduced computational and memory cost. Previous LoRA-based approaches initialize the low-rank matrices with Gaussian distribution and zero values while keeping the origi

Externí odkaz: http://arxiv.org/abs/2406.09044

Zobrazit plný text záznamu

Report

Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models

Autor: Ping, Bowen, Wang, Shuo, Wang, Hanqing, Han, Xu, Xu, Yuzhuang, Yan, Yukun, Chen, Yun, Chang, Baobao, Liu, Zhiyuan, Sun, Maosong

Fine-tuning is a crucial process for adapting large language models (LLMs) to diverse applications. In certain scenarios, such as multi-tenant serving, deploying multiple LLMs becomes necessary to meet complex demands. Recent studies suggest decompos

Externí odkaz: http://arxiv.org/abs/2406.08903

Zobrazit plný text záznamu

Report

LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks

Autor: Wang, Hanqing, Ping, Bowen, Wang, Shuo, Han, Xu, Chen, Yun, Liu, Zhiyuan, Sun, Maosong

LoRA employs lightweight modules to customize large language models (LLMs) for each downstream task or domain, where different learned additional modules represent diverse skills. Combining existing LoRAs to address new tasks can enhance the reusabil

Externí odkaz: http://arxiv.org/abs/2402.11455

Zobrazit plný text záznamu

Report

StyleBART: Decorate Pretrained Model with Style Adapters for Unsupervised Stylistic Headline Generation

Autor: Wang, Hanqing, Luo, Yajing, Xiong, Boya, Chen, Guanhua, Chen, Yun

Stylistic headline generation is the task to generate a headline that not only summarizes the content of an article, but also reflects a desired style that attracts users. As style-specific article-headline pairs are scarce, previous researches focus

Externí odkaz: http://arxiv.org/abs/2310.17743

Zobrazit plný text záznamu

Report

DREAMWALKER: Mental Planning for Continuous Vision-Language Navigation

Autor: Wang, Hanqing, Liang, Wei, Van Gool, Luc, Wang, Wenguan

VLN-CE is a recently released embodied task, where AI agents need to navigate a freely traversable environment to reach a distant target location, given language instructions. It poses great challenges due to the huge space of possible strategies. Dr

Externí odkaz: http://arxiv.org/abs/2308.07498

Zobrazit plný text záznamu

Report

ETPNav: Evolving Topological Planning for Vision-Language Navigation in Continuous Environments

Autor: An, Dong, Wang, Hanqing, Wang, Wenguan, Wang, Zun, Huang, Yan, He, Keji, Wang, Liang

Vision-language navigation is a task that requires an agent to follow instructions to navigate in environments. It becomes increasingly crucial in the field of embodied AI, with potential applications in autonomous navigation, search and rescue, and

Externí odkaz: http://arxiv.org/abs/2304.03047

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání