Výsledky vyhledávání - "Herrmann, Charles"

Report

WonderWorld: Interactive 3D Scene Generation from a Single Image

Autor: Yu, Hong-Xing, Duan, Haoyi, Herrmann, Charles, Freeman, William T., Wu, Jiajun

We present WonderWorld, a novel framework for interactive 3D scene extrapolation that enables users to explore and shape virtual environments based on a single input image and user-specified text. While significant improvements have been made to the

Externí odkaz: http://arxiv.org/abs/2406.09394

Zobrazit plný text záznamu

Report

DreamWalk: Style Space Exploration using Diffusion Guidance

Autor: Shu, Michelle, Herrmann, Charles, Bowen, Richard Strong, Cole, Forrester, Zabih, Ramin

Text-conditioned diffusion models can generate impressive images, but fall short when it comes to fine-grained control. Unlike direct-editing tools like Photoshop, text conditioned models require the artist to perform "prompt engineering," constructi

Externí odkaz: http://arxiv.org/abs/2404.03145

Zobrazit plný text záznamu

Report

Lumiere: A Space-Time Diffusion Model for Video Generation

Autor: Bar-Tal, Omer, Chefer, Hila, Tov, Omer, Herrmann, Charles, Paiss, Roni, Zada, Shiran, Ephrat, Ariel, Hur, Junhwa, Liu, Guanghui, Raj, Amit, Li, Yuanzhen, Rubinstein, Michael, Michaeli, Tomer, Wang, Oliver, Sun, Deqing, Dekel, Tali, Mosseri, Inbar

We introduce Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal challenge in video synthesis. To this end, we introduce a Space-Time U-Net architecture that gen

Externí odkaz: http://arxiv.org/abs/2401.12945

Zobrazit plný text záznamu

Report

Efficient Hybrid Zoom using Camera Fusion on Mobile Phones

Autor: Wu, Xiaotong, Lai, Wei-Sheng, Shih, YiChang, Herrmann, Charles, Krainin, Michael, Sun, Deqing, Liang, Chia-Kai

DSLR cameras can achieve multiple zoom levels via shifting lens distances or swapping lens types. However, these techniques are not possible on smartphone devices due to space constraints. Most smartphone manufacturers adopt a hybrid zoom system: com

Externí odkaz: http://arxiv.org/abs/2401.01461

Zobrazit plný text záznamu

Report

Boundary Attention: Learning to Localize Boundaries under High Noise

Autor: Polansky, Mia Gaia, Herrmann, Charles, Hur, Junhwa, Sun, Deqing, Verbin, Dor, Zickler, Todd

We present a differentiable model that infers explicit boundaries, including curves, corners and junctions, using a mechanism that we call boundary attention. Boundary attention is a boundary-aware local attention operation that, when applied densely

Externí odkaz: http://arxiv.org/abs/2401.00935

Zobrazit plný text záznamu

Report

Zero-Shot Metric Depth with a Field-of-View Conditioned Diffusion Model

Autor: Saxena, Saurabh, Hur, Junhwa, Herrmann, Charles, Sun, Deqing, Fleet, David J.

While methods for monocular depth estimation have made significant strides on standard benchmarks, zero-shot metric depth estimation remains unsolved. Challenges include the joint modeling of indoor and outdoor scenes, which often exhibit significant

Externí odkaz: http://arxiv.org/abs/2312.13252

Zobrazit plný text záznamu

Report

WonderJourney: Going from Anywhere to Everywhere

Autor: Yu, Hong-Xing, Duan, Haoyi, Hur, Junhwa, Sargent, Kyle, Rubinstein, Michael, Freeman, William T., Cole, Forrester, Sun, Deqing, Snavely, Noah, Wu, Jiajun, Herrmann, Charles

We introduce WonderJourney, a modularized framework for perpetual 3D scene generation. Unlike prior work on view generation that focuses on a single type of scenes, we start at any user-provided location (by a text description or an image) and genera

Externí odkaz: http://arxiv.org/abs/2312.03884

Zobrazit plný text záznamu

Report

Telling Left from Right: Identifying Geometry-Aware Semantic Correspondence

Autor: Zhang, Junyi, Herrmann, Charles, Hur, Junhwa, Chen, Eric, Jampani, Varun, Sun, Deqing, Yang, Ming-Hsuan

While pre-trained large-scale vision models have shown significant promise for semantic correspondence, their features often struggle to grasp the geometry and orientation of instances. This paper identifies the importance of being geometry-aware for

Externí odkaz: http://arxiv.org/abs/2311.17034

Zobrazit plný text záznamu

Report

ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

Autor: Sargent, Kyle, Li, Zizhang, Shah, Tanmay, Herrmann, Charles, Yu, Hong-Xing, Zhang, Yunzhi, Chan, Eric Ryan, Lagun, Dmitry, Fei-Fei, Li, Sun, Deqing, Wu, Jiajun

We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced

Externí odkaz: http://arxiv.org/abs/2310.17994

Zobrazit plný text záznamu

Report

Substance or Style: What Does Your Image Embedding Know?

Autor: Rashtchian, Cyrus, Herrmann, Charles, Ferng, Chun-Sung, Chakrabarti, Ayan, Krishnan, Dilip, Sun, Deqing, Juan, Da-Cheng, Tomkins, Andrew

Probes are small networks that predict properties of underlying data from embeddings, and they provide a targeted, effective way to illuminate the information contained in embeddings. While analysis through the use of probes has become standard in NL

Externí odkaz: http://arxiv.org/abs/2307.05610

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání