Výsledky vyhledávání

Report

Diffusion Model Patching via Mixture-of-Prompts

Autor: Ham, Seokil, Woo, Sangmin, Kim, Jin-Young, Go, Hyojun, Park, Byeongjun, Kim, Changick

We present Diffusion Model Patching (DMP), a simple method to boost the performance of pre-trained diffusion models that have already reached convergence, with a negligible increase in parameters. DMP inserts a small, learnable set of prompts into th

Externí odkaz: http://arxiv.org/abs/2405.17825

Zobrazit plný text záznamu

Report

Pegasus-v1 Technical Report

This technical report introduces Pegasus-1, a multimodal language model specialized in video content understanding and interaction through natural language. Pegasus-1 is designed to address the unique challenges posed by video data, such as interpret

Externí odkaz: http://arxiv.org/abs/2404.14687

Zobrazit plný text záznamu

Report

Denoising Task Difficulty-based Curriculum for Training Diffusion Models

Autor: Kim, Jin-Young, Go, Hyojun, Kwon, Soonwoo, Kim, Hyun-Gyoon

Diffusion-based generative models have emerged as powerful tools in the realm of generative modeling. Despite extensive research on denoising across various timesteps and noise levels, a conflict persists regarding the relative difficulties of the de

Externí odkaz: http://arxiv.org/abs/2403.10348

Zobrazit plný text záznamu

Report

Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts

Autor: Park, Byeongjun, Go, Hyojun, Kim, Jin-Young, Woo, Sangmin, Ham, Seokil, Kim, Changick

Diffusion models have achieved remarkable success across a range of generative tasks. Recent efforts to enhance diffusion model architectures have reimagined them as a form of multi-task learning, where each task corresponds to a denoising task at a

Externí odkaz: http://arxiv.org/abs/2403.09176

Zobrazit plný text záznamu

Report

HarmonyView: Harmonizing Consistency and Diversity in One-Image-to-3D

Autor: Woo, Sangmin, Park, Byeongjun, Go, Hyojun, Kim, Jin-Young, Kim, Changick

Recent progress in single-image 3D generation highlights the importance of multi-view coherency, leveraging 3D priors from large-scale diffusion models pretrained on Internet-scale images. However, the aspect of novel-view diversity remains underexpl

Externí odkaz: http://arxiv.org/abs/2312.15980

Zobrazit plný text záznamu

Report

Denoising Task Routing for Diffusion Models

Autor: Park, Byeongjun, Woo, Sangmin, Go, Hyojun, Kim, Jin-Young, Kim, Changick

Diffusion models generate highly realistic images by learning a multi-step denoising process, naturally embodying the principles of multi-task learning (MTL). Despite the inherent connection between diffusion models and MTL, there remains an unexplor

Externí odkaz: http://arxiv.org/abs/2310.07138

Zobrazit plný text záznamu

Report

Multi-Architecture Multi-Expert Diffusion Models

Autor: Lee, Yunsung, Kim, Jin-Young, Go, Hyojun, Jeong, Myeongho, Oh, Shinhyeok, Choi, Seungtaek

In this paper, we address the performance degradation of efficient diffusion models by introducing Multi-architecturE Multi-Expert diffusion models (MEME). We identify the need for tailored operations at different time-steps in diffusion processes an

Externí odkaz: http://arxiv.org/abs/2306.04990

Zobrazit plný text záznamu

Report

ScoreCL: Augmentation-Adaptive Contrastive Learning via Score-Matching Function

Autor: Kim, Jin-Young, Kwon, Soonwoo, Go, Hyojun, Lee, Yunsung, Choi, Seungtaek, Kim, Hyun-Gyoon

Self-supervised contrastive learning (CL) has achieved state-of-the-art performance in representation learning by minimizing the distance between positive pairs while maximizing that of negative ones. Recently, it has been verified that the model lea

Externí odkaz: http://arxiv.org/abs/2306.04175

Zobrazit plný text záznamu

Report

Addressing Negative Transfer in Diffusion Models

Autor: Go, Hyojun, Kim, JinYoung, Lee, Yunsung, Lee, Seunghyun, Oh, Shinhyeok, Moon, Hyeongdon, Choi, Seungtaek

Diffusion-based generative models have achieved remarkable success in various domains. It trains a shared model on denoising tasks that encompass different noise levels simultaneously, representing a form of multi-task learning (MTL). However, analyz

Externí odkaz: http://arxiv.org/abs/2306.00354

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání