Výsledky vyhledávání - "Fan, Mingyuan"

Report

Autor: Fei, Zhengcong, Fan, Mingyuan, Yu, Changqian, Huang, Junshi

This paper explores a simple extension of diffusion-based rectified flow Transformers for text-to-music generation, termed as FluxMusic. Generally, along with design in advanced Flux\footnote{https://github.com/black-forest-labs/flux} model, we trans

Externí odkaz: http://arxiv.org/abs/2409.00587

Zobrazit plný text záznamu

Report

FedMCP: Parameter-Efficient Federated Learning with Model-Contrastive Personalization

Autor: Zhao, Qianyi, Qu, Chen, Chen, Cen, Fan, Mingyuan, Wang, Yanhao

With increasing concerns and regulations on data privacy, fine-tuning pretrained language models (PLMs) in federated learning (FL) has become a common paradigm for NLP tasks. Despite being extensively studied, the existing methods for this problem st

Externí odkaz: http://arxiv.org/abs/2409.00116

Zobrazit plný text záznamu

Report

EIUP: A Training-Free Approach to Erase Non-Compliant Concepts Conditioned on Implicit Unsafe Prompts

Autor: Chen, Die, Li, Zhiwen, Fan, Mingyuan, Chen, Cen, Zhou, Wenmeng, Li, Yaliang

Text-to-image diffusion models have shown the ability to learn a diverse range of concepts. However, it is worth noting that they may also generate undesirable outputs, consequently giving rise to significant security concerns. Specifically, issues s

Externí odkaz: http://arxiv.org/abs/2408.01014

Zobrazit plný text záznamu

Report

Scaling Diffusion Transformers to 16 Billion Parameters

Autor: Fei, Zhengcong, Fan, Mingyuan, Yu, Changqian, Li, Debang, Huang, Junshi

In this paper, we present DiT-MoE, a sparse version of the diffusion Transformer, that is scalable and competitive with dense networks while exhibiting highly optimized inference. The DiT-MoE includes two simple designs: shared expert routing and exp

Externí odkaz: http://arxiv.org/abs/2407.11633

Zobrazit plný text záznamu

Report

SemiAdv: Query-Efficient Black-Box Adversarial Attack with Unlabeled Images

Autor: Fan, Mingyuan, Liu, Yang, Chen, Cen, Liu, Ximeng

Adversarial attack has garnered considerable attention due to its profound implications for the secure deployment of robots in sensitive security scenarios. To potentially push for advances in the field, this paper studies the adversarial attack in t

Externí odkaz: http://arxiv.org/abs/2407.11073

Zobrazit plný text záznamu

Report

Dimba: Transformer-Mamba Diffusion Models

Autor: Fei, Zhengcong, Fan, Mingyuan, Yu, Changqian, Li, Debang, Zhang, Youqiang, Huang, Junshi

This paper unveils Dimba, a new text-to-image diffusion model that employs a distinctive hybrid architecture combining Transformer and Mamba elements. Specifically, Dimba sequentially stacked blocks alternate between Transformer and Mamba layers, and

Externí odkaz: http://arxiv.org/abs/2406.01159

Zobrazit plný text záznamu

Report

Music Consistency Models

Autor: Fei, Zhengcong, Fan, Mingyuan, Huang, Junshi

Consistency models have exhibited remarkable capabilities in facilitating efficient image/video generation, enabling synthesis with minimal sampling steps. It has proven to be advantageous in mitigating the computational burdens associated with diffu

Externí odkaz: http://arxiv.org/abs/2404.13358

Zobrazit plný text záznamu

Report

Diffusion-RWKV: Scaling RWKV-Like Architectures for Diffusion Models

Autor: Fei, Zhengcong, Fan, Mingyuan, Yu, Changqian, Li, Debang, Huang, Junshi

Transformers have catalyzed advancements in computer vision and natural language processing (NLP) fields. However, substantial computational complexity poses limitations for their application in long-context tasks, such as high-resolution image gener

Externí odkaz: http://arxiv.org/abs/2404.04478

Zobrazit plný text záznamu

Report

Scalable Diffusion Models with State Space Backbone

Autor: Fei, Zhengcong, Fan, Mingyuan, Yu, Changqian, Huang, Junshi

This paper presents a new exploration into a category of diffusion models built upon state space architecture. We endeavor to train diffusion models for image data, wherein the traditional U-Net backbone is supplanted by a state space backbone, funct

Externí odkaz: http://arxiv.org/abs/2402.05608

Zobrazit plný text záznamu

Report

Tuning-Free Inversion-Enhanced Control for Consistent Image Editing

Autor: Duan, Xiaoyue, Cui, Shuhao, Kang, Guoliang, Zhang, Baochang, Fei, Zhengcong, Fan, Mingyuan, Huang, Junshi

Consistent editing of real images is a challenging task, as it requires performing non-rigid edits (e.g., changing postures) to the main objects in the input image without changing their identity or attributes. To guarantee consistent attributes, som

Externí odkaz: http://arxiv.org/abs/2312.14611

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání