Výsledky vyhledávání

Report

Autor: Chen, Ziyu, Yang, Jiawei, Huang, Jiahui, de Lutio, Riccardo, Esturo, Janick Martinez, Ivanovic, Boris, Litany, Or, Gojcic, Zan, Fidler, Sanja, Pavone, Marco, Song, Li, Wang, Yue

We introduce OmniRe, a holistic approach for efficiently reconstructing high-fidelity dynamic urban scenes from on-device logs. Recent methods for modeling driving sequences using neural radiance fields or Gaussian Splatting have demonstrated the pot

Externí odkaz: http://arxiv.org/abs/2408.16760

Zobrazit plný text záznamu

Report

Photorealistic Object Insertion with Diffusion-Guided Inverse Rendering

Autor: Liang, Ruofan, Gojcic, Zan, Nimier-David, Merlin, Acuna, David, Vijaykumar, Nandita, Fidler, Sanja, Wang, Zian

The correct insertion of virtual objects in images of real-world scenes requires a deep understanding of the scene's lighting, geometry and materials, as well as the image formation process. While recent large-scale diffusion models have shown strong

Externí odkaz: http://arxiv.org/abs/2408.09702

Zobrazit plný text záznamu

Report

Wolf: Captioning Everything with a World Summarization Framework

We propose Wolf, a WOrLd summarization Framework for accurate video captioning. Wolf is an automated captioning framework that adopts a mixture-of-experts approach, leveraging complementary strengths of Vision Language Models (VLMs). By utilizing bot

Externí odkaz: http://arxiv.org/abs/2407.18908

Zobrazit plný text záznamu

Report

SuperPADL: Scaling Language-Directed Physics-Based Control with Progressive Supervised Distillation

Autor: Juravsky, Jordan, Guo, Yunrong, Fidler, Sanja, Peng, Xue Bin

Physically-simulated models for human motion can generate high-quality responsive character animations, often in real-time. Natural language serves as a flexible interface for controlling these models, allowing expert and non-expert users to quickly

Externí odkaz: http://arxiv.org/abs/2407.10481

Zobrazit plný text záznamu

Report

3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes

Autor: Moenne-Loccoz, Nicolas, Mirzaei, Ashkan, Perel, Or, de Lutio, Riccardo, Esturo, Janick Martinez, State, Gavriel, Fidler, Sanja, Sharp, Nicholas, Gojcic, Zan

Particle-based representations of radiance fields such as 3D Gaussian Splatting have found great success for reconstructing and re-rendering of complex scenes. Most existing methods render particles via rasterization, projecting them to screen space

Externí odkaz: http://arxiv.org/abs/2407.07090

Zobrazit plný text záznamu

Report

fVDB: A Deep-Learning Framework for Sparse, Large-Scale, and High-Performance Spatial Intelligence

Autor: Williams, Francis, Huang, Jiahui, Swartz, Jonathan, Klár, Gergely, Thakkar, Vijay, Cong, Matthew, Ren, Xuanchi, Li, Ruilong, Fuji-Tsang, Clement, Fidler, Sanja, Sifakis, Eftychios, Museth, Ken

We present fVDB, a novel GPU-optimized framework for deep learning on large-scale 3D data. fVDB provides a complete set of differentiable primitives to build deep learning architectures for common tasks in 3D learning such as convolution, pooling, at

Externí odkaz: http://arxiv.org/abs/2407.01781

Zobrazit plný text záznamu

Report

DistillNeRF: Perceiving 3D Scenes from Single-Glance Images by Distilling Neural Fields and Foundation Model Features

Autor: Wang, Letian, Kim, Seung Wook, Yang, Jiawei, Yu, Cunjun, Ivanovic, Boris, Waslander, Steven L., Wang, Yue, Fidler, Sanja, Pavone, Marco, Karkus, Peter

We propose DistillNeRF, a self-supervised learning framework addressing the challenge of understanding 3D environments from limited 2D observations in autonomous driving. Our method is a generalizable feedforward model that predicts a rich neural sce

Externí odkaz: http://arxiv.org/abs/2406.12095

Zobrazit plný text záznamu

Report

L4GM: Large 4D Gaussian Reconstruction Model

Autor: Ren, Jiawei, Xie, Kevin, Mirzaei, Ashkan, Liang, Hanxue, Zeng, Xiaohui, Kreis, Karsten, Liu, Ziwei, Torralba, Antonio, Fidler, Sanja, Kim, Seung Wook, Ling, Huan

We present L4GM, the first 4D Large Reconstruction Model that produces animated objects from a single-view video input -- in a single feed-forward pass that takes only a second. Key to our success is a novel dataset of multiview videos containing cur

Externí odkaz: http://arxiv.org/abs/2406.10324

Zobrazit plný text záznamu

Report

Outdoor Scene Extrapolation with Hierarchical Generative Cellular Automata

Autor: Zhang, Dongsu, Williams, Francis, Gojcic, Zan, Kreis, Karsten, Fidler, Sanja, Kim, Young Min, Kar, Amlan

We aim to generate fine-grained 3D geometry from large-scale sparse LiDAR scans, abundantly captured by autonomous vehicles (AV). Contrary to prior work on AV scene completion, we aim to extrapolate fine geometry from unlabeled and beyond spatial lim

Externí odkaz: http://arxiv.org/abs/2406.08292

Zobrazit plný text záznamu

Report

NeRF-XL: Scaling NeRFs with Multiple GPUs

Autor: Li, Ruilong, Fidler, Sanja, Kanazawa, Angjoo, Williams, Francis

We present NeRF-XL, a principled method for distributing Neural Radiance Fields (NeRFs) across multiple GPUs, thus enabling the training and rendering of NeRFs with an arbitrarily large capacity. We begin by revisiting existing multi-GPU approaches,

Externí odkaz: http://arxiv.org/abs/2404.16221

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání