Výsledky vyhledávání - "Wang, Zhendong"

Report

Diffusion-RPO: Aligning Diffusion Models through Relative Preference Optimization

Autor: Gu, Yi, Wang, Zhendong, Yin, Yueqin, Xie, Yujia, Zhou, Mingyuan

Aligning large language models with human preferences has emerged as a critical focus in language modeling research. Yet, integrating preference learning into Text-to-Image (T2I) generative models is still relatively uncharted territory. The Diffusio

Externí odkaz: http://arxiv.org/abs/2406.06382

Zobrazit plný text záznamu

Report

Long and Short Guidance in Score identity Distillation for One-Step Text-to-Image Generation

Autor: Zhou, Mingyuan, Wang, Zhendong, Zheng, Huangjie, Huang, Hai

Diffusion-based text-to-image generation models trained on extensive text-image pairs have shown the capacity to generate photorealistic images consistent with textual descriptions. However, a significant limitation of these models is their slow samp

Externí odkaz: http://arxiv.org/abs/2406.01561

Zobrazit plný text záznamu

Report

Self-Augmented Preference Optimization: Off-Policy Paradigms for Language Model Alignment

Autor: Yin, Yueqin, Wang, Zhendong, Xie, Yujia, Chen, Weizhu, Zhou, Mingyuan

Traditional language model alignment methods, such as Direct Preference Optimization (DPO), are limited by their dependence on static, pre-collected paired preference data, which hampers their adaptability and practical applicability. To overcome thi

Externí odkaz: http://arxiv.org/abs/2405.20830

Zobrazit plný text záznamu

Report

Diffusion Policies creating a Trust Region for Offline Reinforcement Learning

Autor: Chen, Tianyu, Wang, Zhendong, Zhou, Mingyuan

Offline reinforcement learning (RL) leverages pre-collected datasets to train optimal policies. Diffusion Q-Learning (DQL), introducing diffusion models as a powerful and expressive policy class, significantly boosts the performance of offline RL. Ho

Externí odkaz: http://arxiv.org/abs/2405.19690

Zobrazit plný text záznamu

Report

Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation

Autor: Zhou, Mingyuan, Zheng, Huangjie, Wang, Zhendong, Yin, Mingzhang, Huang, Hai

We introduce Score identity Distillation (SiD), an innovative data-free method that distills the generative capabilities of pretrained diffusion models into a single-step generator. SiD not only facilitates an exponentially fast reduction in Fr\'eche

Externí odkaz: http://arxiv.org/abs/2404.04057

Zobrazit plný text záznamu

Report

Take the Bull by the Horns: Hard Sample-Reweighted Continual Training Improves LLM Generalization

Autor: Chen, Xuxi, Wang, Zhendong, Sow, Daouda, Yang, Junjie, Chen, Tianlong, Liang, Yingbin, Zhou, Mingyuan, Wang, Zhangyang

In the rapidly advancing arena of large language models (LLMs), a key challenge is to enhance their capabilities amid a looming shortage of high-quality training data. Our study starts from an empirical strategy for the light continual training of LL

Externí odkaz: http://arxiv.org/abs/2402.14270

Zobrazit plný text záznamu

Report

Relative Preference Optimization: Enhancing LLM Alignment through Contrasting Responses across Identical and Diverse Prompts

Autor: Yin, Yueqin, Wang, Zhendong, Gu, Yi, Huang, Hai, Chen, Weizhu, Zhou, Mingyuan

In the field of large language models (LLMs), aligning models with the diverse preferences of users is a critical challenge. Direct Preference Optimization (DPO) has played a key role in this area. It works by using pairs of preferences derived from

Externí odkaz: http://arxiv.org/abs/2402.10958

Zobrazit plný text záznamu

Report

Anything in Any Scene: Photorealistic Video Object Insertion

Autor: Bai, Chen, Shao, Zeman, Zhang, Guoxiang, Liang, Di, Yang, Jie, Zhang, Zhuorui, Guo, Yujian, Zhong, Chengzhang, Qiu, Yiqiao, Wang, Zhendong, Guan, Yichen, Zheng, Xiaoyin, Wang, Tao, Lu, Cheng

Realistic video simulation has shown significant potential across diverse applications, from virtual reality to film production. This is particularly true for scenarios where capturing videos in real-world settings is either impractical or expensive.

Externí odkaz: http://arxiv.org/abs/2401.17509

Zobrazit plný text záznamu

Report

Improving In-Context Learning in Diffusion Models with Visual Context-Modulated Prompts

Autor: Chen, Tianqi, Liu, Yongfei, Wang, Zhendong, Yuan, Jianbo, You, Quanzeng, Yang, Hongxia, Zhou, Mingyuan

In light of the remarkable success of in-context learning in large language models, its potential extension to the vision domain, particularly with visual foundation models like Stable Diffusion, has sparked considerable interest. Existing approaches

Externí odkaz: http://arxiv.org/abs/2312.01408

Zobrazit plný text záznamu

Report

Counterfactual Explanations for Time Series Forecasting

Autor: Wang, Zhendong, Miliou, Ioanna, Samsten, Isak, Papapetrou, Panagiotis

Among recent developments in time series forecasting methods, deep forecasting models have gained popularity as they can utilize hidden feature patterns in time series to improve forecasting performance. Nevertheless, the majority of current deep for

Externí odkaz: http://arxiv.org/abs/2310.08137

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání