Výsledky vyhledávání

Report

Xwin-LM: Strong and Scalable Alignment Practice for LLMs

Autor: Ni, Bolin, Hu, JingCheng, Wei, Yixuan, Peng, Houwen, Zhang, Zheng, Meng, Gaofeng, Hu, Han

In this work, we present Xwin-LM, a comprehensive suite of alignment methodologies for large language models (LLMs). This suite encompasses several key techniques, including supervised finetuning (SFT), reward modeling (RM), rejection sampling finetu

Externí odkaz: http://arxiv.org/abs/2405.20335

Zobrazit plný text záznamu

Report

Enhancing Visual Continual Learning with Language-Guided Supervision

Autor: Ni, Bolin, Zhao, Hongbo, Zhang, Chenghao, Hu, Ke, Meng, Gaofeng, Zhang, Zhaoxiang, Xiang, Shiming

Continual learning (CL) aims to empower models to learn new tasks without forgetting previously acquired knowledge. Most prior works concentrate on the techniques of architectures, replay data, regularization, \etc. However, the category name of each

Externí odkaz: http://arxiv.org/abs/2403.16124

Zobrazit plný text záznamu

Report

Defying Imbalanced Forgetting in Class Incremental Learning

Autor: Xu, Shixiong, Meng, Gaofeng, Nie, Xing, Ni, Bolin, Fan, Bin, Xiang, Shiming

We observe a high level of imbalance in the accuracy of different classes in the same old task for the first time. This intriguing phenomenon, discovered in replay-based Class Incremental Learning (CIL), highlights the imbalanced forgetting of learne

Externí odkaz: http://arxiv.org/abs/2403.14910

Zobrazit plný text záznamu

Report

Continual Forgetting for Pre-trained Vision Models

Autor: Zhao, Hongbo, Ni, Bolin, Wang, Haochen, Fan, Junsong, Zhu, Fei, Wang, Yuxi, Chen, Yuntao, Meng, Gaofeng, Zhang, Zhaoxiang

For privacy and security concerns, the need to erase unwanted information from pre-trained vision models is becoming evident nowadays. In real-world scenarios, erasure requests originate at any time from both users and model owners. These requests us

Externí odkaz: http://arxiv.org/abs/2403.11530

Zobrazit plný text záznamu

Report

FP8-LM: Training FP8 Large Language Models

In this paper, we explore FP8 low-bit data formats for efficient training of large language models (LLMs). Our key insight is that most variables, such as gradients and optimizer states, in LLM training can employ low-precision data formats without c

Externí odkaz: http://arxiv.org/abs/2310.18313

Zobrazit plný text záznamu

Report

Expanding Language-Image Pretrained Models for General Video Recognition

Autor: Ni, Bolin, Peng, Houwen, Chen, Minghao, Zhang, Songyang, Meng, Gaofeng, Fu, Jianlong, Xiang, Shiming, Ling, Haibin

Contrastive language-image pretraining has shown great success in learning visual-textual joint representation from web-scale data, demonstrating remarkable "zero-shot" generalization ability for various image tasks. However, how to effectively expan

Externí odkaz: http://arxiv.org/abs/2208.02816

Zobrazit plný text záznamu

Report

Pro-tuning: Unified Prompt Tuning for Vision Tasks

Autor: Nie, Xing, Ni, Bolin, Chang, Jianlong, Meng, Gaomeng, Huo, Chunlei, Zhang, Zhaoxiang, Xiang, Shiming, Tian, Qi, Pan, Chunhong

In computer vision, fine-tuning is the de-facto approach to leverage pre-trained vision models to perform downstream tasks. However, deploying it in practice is quite challenging, due to adopting parameter inefficient global update and heavily relyin

Externí odkaz: http://arxiv.org/abs/2207.14381

Zobrazit plný text záznamu

Report

Searching the Search Space of Vision Transformer

Autor: Chen, Minghao, Wu, Kan, Ni, Bolin, Peng, Houwen, Liu, Bei, Fu, Jianlong, Chao, Hongyang, Ling, Haibin

Vision Transformer has shown great visual representation power in substantial vision tasks such as recognition and detection, and thus been attracting fast-growing efforts on manually designing more effective architectures. In this paper, we propose

Externí odkaz: http://arxiv.org/abs/2111.14725

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání