Výsledky vyhledávání - "Voleti, Vikram"

Report

SV4D: Dynamic 3D Content Generation with Multi-Frame and Multi-View Consistency

Autor: Xie, Yiming, Yao, Chun-Han, Voleti, Vikram, Jiang, Huaizu, Jampani, Varun

We present Stable Video 4D (SV4D), a latent video diffusion model for multi-frame and multi-view consistent dynamic 3D content generation. Unlike previous methods that rely on separately trained generative models for video generation and novel view s

Externí odkaz: http://arxiv.org/abs/2407.17470

Zobrazit plný text záznamu

Report

HouseCrafter: Lifting Floorplans to 3D Scenes with 2D Diffusion Model

Autor: Nguyen, Hieu T., Chen, Yiwen, Voleti, Vikram, Jampani, Varun, Jiang, Huaizu

We introduce HouseCrafter, a novel approach that can lift a floorplan into a complete large 3D indoor scene (e.g., a house). Our key insight is to adapt a 2D diffusion model, which is trained on web-scale images, to generate consistent multi-view col

Externí odkaz: http://arxiv.org/abs/2406.20077

Zobrazit plný text záznamu

Dissertation/ Thesis

Conditional generative modeling for images, 3D animations, and video

Autor: Voleti, Vikram

Generative modeling for computer vision has shown immense progress in the last few years, revolutionizing the way we perceive, understand, and manipulate visual data. This rapidly evolving field has witnessed advancements in image generation, 3D anim

Externí odkaz: http://hdl.handle.net/1866/32123
https://orcid.org/0000-0003-0941-7227

Zobrazit plný text záznamu

Report

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

Autor: Voleti, Vikram, Yao, Chun-Han, Boss, Mark, Letts, Adam, Pankratz, David, Tochilkin, Dmitry, Laforte, Christian, Rombach, Robin, Jampani, Varun

We present Stable Video 3D (SV3D) -- a latent video diffusion model for high-resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent work on 3D generation propose techniques to adapt 2D generative models for novel view

Externí odkaz: http://arxiv.org/abs/2403.12008

Zobrazit plný text záznamu

Report

Objaverse-XL: A Universe of 10M+ 3D Objects

Autor: Deitke, Matt, Liu, Ruoshi, Wallingford, Matthew, Ngo, Huong, Michel, Oscar, Kusupati, Aditya, Fan, Alan, Laforte, Christian, Voleti, Vikram, Gadre, Samir Yitzhak, VanderBilt, Eli, Kembhavi, Aniruddha, Vondrick, Carl, Gkioxari, Georgia, Ehsani, Kiana, Schmidt, Ludwig, Farhadi, Ali

Natural language processing and 2D vision models have attained remarkable proficiency on many tasks primarily by escalating the scale of training data. However, 3D vision tasks have not seen the same progress, in part due to the challenges of acquiri

Externí odkaz: http://arxiv.org/abs/2307.05663

Zobrazit plný text záznamu

Report

Are Diffusion Models Vision-And-Language Reasoners?

Autor: Krojer, Benno, Poole-Dayan, Elinor, Voleti, Vikram, Pal, Christopher, Reddy, Siva

Text-conditioned image generation models have recently shown immense qualitative success using denoising diffusion processes. However, unlike discriminative vision-and-language models, it is a non-trivial task to subject these diffusion-based generat

Externí odkaz: http://arxiv.org/abs/2305.16397

Zobrazit plný text záznamu

Report

Score-based Diffusion Models in Function Space

Autor: Lim, Jae Hyun, Kovachki, Nikola B., Baptista, Ricardo, Beckham, Christopher, Azizzadenesheli, Kamyar, Kossaifi, Jean, Voleti, Vikram, Song, Jiaming, Kreis, Karsten, Kautz, Jan, Pal, Christopher, Vahdat, Arash, Anandkumar, Anima

Diffusion models have recently emerged as a powerful framework for generative modeling. They consist of a forward process that perturbs input data with Gaussian white noise and a reverse process that learns a score function to generate samples by den

Externí odkaz: http://arxiv.org/abs/2302.07400

Zobrazit plný text záznamu

Report

Plankton-FL: Exploration of Federated Learning for Privacy-Preserving Training of Deep Neural Networks for Phytoplankton Classification

Autor: Zhang, Daniel, Voleti, Vikram, Wong, Alexander, Deglint, Jason

Creating high-performance generalizable deep neural networks for phytoplankton monitoring requires utilizing large-scale data coming from diverse global water sources. A major challenge to training such networks lies in data privacy, where data colle

Externí odkaz: http://arxiv.org/abs/2212.08990

Zobrazit plný text záznamu

Report

Score-based Denoising Diffusion with Non-Isotropic Gaussian Noise Models

Autor: Voleti, Vikram, Pal, Christopher, Oberman, Adam

Publikováno v: NeurIPS 2022 Workshop on Score-Based Methods

Generative models based on denoising diffusion techniques have led to an unprecedented increase in the quality and diversity of imagery that is now possible to create with neural generative models. However, most contemporary state-of-the-art methods

Externí odkaz: http://arxiv.org/abs/2210.12254

Zobrazit plný text záznamu

Report

SMPL-IK: Learned Morphology-Aware Inverse Kinematics for AI Driven Artistic Workflows

Autor: Voleti, Vikram, Oreshkin, Boris N., Bocquelet, Florent, Harvey, Félix G., Ménard, Louis-Simon, Pal, Christopher

Inverse Kinematics (IK) systems are often rigid with respect to their input character, thus requiring user intervention to be adapted to new skeletons. In this paper we aim at creating a flexible, learned IK solver applicable to a wide variety of hum

Externí odkaz: http://arxiv.org/abs/2208.08274

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání