Výsledky vyhledávání

Report

Wings: Learning Multimodal LLMs without Text-only Forgetting

Autor: Zhang, Yi-Kai, Lu, Shiyin, Li, Yang, Ma, Yanqing, Chen, Qing-Guo, Xu, Zhao, Luo, Weihua, Zhang, Kaifu, Zhan, De-Chuan, Ye, Han-Jia

Multimodal large language models (MLLMs), initiated with a trained LLM, first align images with text and then fine-tune on multimodal mixed inputs. However, the MLLM catastrophically forgets the text-only instructions, which do not include images and

Externí odkaz: http://arxiv.org/abs/2406.03496

Zobrazit plný text záznamu

Report

Parrot: Multilingual Visual Instruction Tuning

Autor: Sun, Hai-Long, Zhou, Da-Wei, Li, Yang, Lu, Shiyin, Yi, Chao, Chen, Qing-Guo, Xu, Zhao, Luo, Weihua, Zhang, Kaifu, Zhan, De-Chuan, Ye, Han-Jia

The rapid development of Multimodal Large Language Models (MLLMs) like GPT-4V has marked a significant step towards artificial general intelligence. Existing methods mainly focus on aligning vision encoders with LLMs through supervised fine-tuning (S

Externí odkaz: http://arxiv.org/abs/2406.02539

Zobrazit plný text záznamu

Report

Ovis: Structural Embedding Alignment for Multimodal Large Language Model

Autor: Lu, Shiyin, Li, Yang, Chen, Qing-Guo, Xu, Zhao, Luo, Weihua, Zhang, Kaifu, Ye, Han-Jia

Current Multimodal Large Language Models (MLLMs) typically integrate a pre-trained LLM with another pre-trained vision transformer through a connector, such as an MLP, endowing the LLM with visual capabilities. However, the misalignment between two e

Externí odkaz: http://arxiv.org/abs/2405.20797

Zobrazit plný text záznamu

Report

Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees

Autor: Wang, Yibo, Yang, Wenhao, Jiang, Wei, Lu, Shiyin, Wang, Bing, Tang, Haihong, Wan, Yuanyu, Zhang, Lijun

Projection-free online learning has drawn increasing interest due to its efficiency in solving high-dimensional problems with complicated constraints. However, most existing projection-free online methods focus on minimizing the static regret, which

Externí odkaz: http://arxiv.org/abs/2305.11726

Zobrazit plný text záznamu

Report

Revisiting Smoothed Online Learning

Autor: Zhang, Lijun, Jiang, Wei, Lu, Shiyin, Yang, Tianbao

In this paper, we revisit the problem of smoothed online learning, in which the online learner suffers both a hitting cost and a switching cost, and target two performance metrics: competitive ratio and dynamic regret with switching cost. To bound th

Externí odkaz: http://arxiv.org/abs/2102.06933

Zobrazit plný text záznamu

Report

Minimizing Dynamic Regret and Adaptive Regret Simultaneously

Autor: Zhang, Lijun, Lu, Shiyin, Yang, Tianbao

Regret minimization is treated as the golden rule in the traditional study of online learning. However, regret minimization algorithms tend to converge to the static optimum, thus being suboptimal for changing environments. To address this limitation

Externí odkaz: http://arxiv.org/abs/2002.02085

Zobrazit plný text záznamu

Report

Adaptive and Efficient Algorithms for Tracking the Best Expert

Autor: Lu, Shiyin, Zhang, Lijun

In this paper, we consider the problem of prediction with expert advice in dynamic environments. We choose tracking regret as the performance metric and develop two adaptive and efficient algorithms with data-dependent tracking regret bounds. The fir

Externí odkaz: http://arxiv.org/abs/1909.02187

Zobrazit plný text záznamu

Report

Multi-Objective Generalized Linear Bandits

Autor: Lu, Shiyin, Wang, Guanghui, Hu, Yao, Zhang, Lijun

In this paper, we study the multi-objective bandits (MOB) problem, where a learner repeatedly selects one arm to play and then receives a reward vector consisting of multiple objectives. MOB has found many real-world applications as varied as online

Externí odkaz: http://arxiv.org/abs/1905.12879

Zobrazit plný text záznamu

Report

Adaptivity and Optimality: A Universal Algorithm for Online Convex Optimization

Autor: Wang, Guanghui, Lu, Shiyin, Zhang, Lijun

In this paper, we study adaptive online convex optimization, and aim to design a universal algorithm that achieves optimal regret bounds for multiple common types of loss functions. Existing universal methods are limited in the sense that they are op

Externí odkaz: http://arxiv.org/abs/1905.05917

Zobrazit plný text záznamu

Report

SAdam: A Variant of Adam for Strongly Convex Functions

Autor: Wang, Guanghui, Lu, Shiyin, Tu, Weiwei, Zhang, Lijun

The Adam algorithm has become extremely popular for large-scale machine learning. Under convexity condition, it has been proved to enjoy a data-dependant $O(\sqrt{T})$ regret bound where $T$ is the time horizon. However, whether strong convexity can

Externí odkaz: http://arxiv.org/abs/1905.02957

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání