Výsledky vyhledávání - "Liu, Ming Yu"

Report

Meshtron: High-Fidelity, Artist-Like 3D Mesh Generation at Scale

Autor: Hao, Zekun, Romero, David W., Lin, Tsung-Yi, Liu, Ming-Yu

Meshes are fundamental representations of 3D surfaces. However, creating high-quality meshes is a labor-intensive task that requires significant time and expertise in 3D modeling. While a delicate object often requires over $10^4$ faces to be accurat

Externí odkaz: http://arxiv.org/abs/2412.09548

Zobrazit plný text záznamu

Report

Edify 3D: Scalable High-Quality 3D Asset Generation

We introduce Edify 3D, an advanced solution designed for high-quality 3D asset generation. Our method first synthesizes RGB and surface normal images of the described object at multiple viewpoints using a diffusion model. The multi-view observations

Externí odkaz: http://arxiv.org/abs/2411.07135

Zobrazit plný text záznamu

Report

Edify Image: High-Quality Image Generation with Pixel Space Laplacian Diffusion Models

We introduce Edify Image, a family of diffusion models capable of generating photorealistic image content with pixel-perfect accuracy. Edify Image utilizes cascaded pixel-space diffusion models trained using a novel Laplacian diffusion process, in wh

Externí odkaz: http://arxiv.org/abs/2411.07126

Zobrazit plný text záznamu

Report

One-Step Diffusion Policy: Fast Visuomotor Policies via Diffusion Distillation

Autor: Wang, Zhendong, Li, Zhaoshuo, Mandlekar, Ajay, Xu, Zhenjia, Fan, Jiaojiao, Narang, Yashraj, Fan, Linxi, Zhu, Yuke, Balaji, Yogesh, Zhou, Mingyuan, Liu, Ming-Yu, Zeng, Yu

Diffusion models, praised for their success in generative tasks, are increasingly being applied to robotics, demonstrating exceptional performance in behavior cloning. However, their slow generation process stemming from iterative denoising steps pos

Externí odkaz: http://arxiv.org/abs/2410.21257

Zobrazit plný text záznamu

Report

EdgeRunner: Auto-regressive Auto-encoder for Artistic Mesh Generation

Autor: Tang, Jiaxiang, Li, Zhaoshuo, Hao, Zekun, Liu, Xian, Zeng, Gang, Liu, Ming-Yu, Zhang, Qinsheng

Current auto-regressive mesh generation methods suffer from issues such as incompleteness, insufficient detail, and poor generalization. In this paper, we propose an Auto-regressive Auto-encoder (ArAE) model capable of generating high-quality 3D mesh

Externí odkaz: http://arxiv.org/abs/2409.18114

Zobrazit plný text záznamu

Report

Masked Diffusion Models are Secretly Time-Agnostic Masked Models and Exploit Inaccurate Categorical Sampling

Autor: Zheng, Kaiwen, Chen, Yongxin, Mao, Hanzi, Liu, Ming-Yu, Zhu, Jun, Zhang, Qinsheng

Masked diffusion models (MDMs) have emerged as a popular research topic for generative modeling of discrete data, thanks to their superior performance over other discrete diffusion models, and are rivaling the auto-regressive models (ARMs) for langua

Externí odkaz: http://arxiv.org/abs/2409.02908

Zobrazit plný text záznamu

Report

Wolf: Captioning Everything with a World Summarization Framework

We propose Wolf, a WOrLd summarization Framework for accurate video captioning. Wolf is an automated captioning framework that adopts a mixture-of-experts approach, leveraging complementary strengths of Vision Language Models (VLMs). By utilizing bot

Externí odkaz: http://arxiv.org/abs/2407.18908

Zobrazit plný text záznamu

Report

JeDi: Joint-Image Diffusion Models for Finetuning-Free Personalized Text-to-Image Generation

Autor: Zeng, Yu, Patel, Vishal M., Wang, Haochen, Huang, Xun, Wang, Ting-Chun, Liu, Ming-Yu, Balaji, Yogesh

Personalized text-to-image generation models enable users to create images that depict their individual possessions in diverse scenes, finding applications in various domains. To achieve the personalization capability, existing methods rely on finetu

Externí odkaz: http://arxiv.org/abs/2407.06187

Zobrazit plný text záznamu

Report

Visual Fact Checker: Enabling High-Fidelity Detailed Caption Generation

Autor: Ge, Yunhao, Zeng, Xiaohui, Huffman, Jacob Samuel, Lin, Tsung-Yi, Liu, Ming-Yu, Cui, Yin

Existing automatic captioning methods for visual content face challenges such as lack of detail, content hallucination, and poor instruction following. In this work, we propose VisualFactChecker (VFC), a flexible training-free pipeline that generates

Externí odkaz: http://arxiv.org/abs/2404.19752

Zobrazit plný text záznamu

Report

Condition-Aware Neural Network for Controlled Image Generation

Autor: Cai, Han, Li, Muyang, Zhang, Zhuoyang, Zhang, Qinsheng, Liu, Ming-Yu, Han, Song

We present Condition-Aware Neural Network (CAN), a new method for adding control to image generative models. In parallel to prior conditional control methods, CAN controls the image generation process by dynamically manipulating the weight of the neu

Externí odkaz: http://arxiv.org/abs/2404.01143

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání