Výsledky vyhledávání

Report

AuroraCap: Efficient, Performant Video Detailed Captioning and a New Benchmark

Autor: Chai, Wenhao, Song, Enxin, Du, Yilun, Meng, Chenlin, Madhavan, Vashisht, Bar-Tal, Omer, Hwang, Jeng-Neng, Xie, Saining, Manning, Christopher D.

Video detailed captioning is a key task which aims to generate comprehensive and coherent textual descriptions of video content, benefiting both video understanding and generation. In this paper, we propose AuroraCap, a video captioner based on a lar

Externí odkaz: http://arxiv.org/abs/2410.03051

Zobrazit plný text záznamu

Report

Lumiere: A Space-Time Diffusion Model for Video Generation

Autor: Bar-Tal, Omer, Chefer, Hila, Tov, Omer, Herrmann, Charles, Paiss, Roni, Zada, Shiran, Ephrat, Ariel, Hur, Junhwa, Liu, Guanghui, Raj, Amit, Li, Yuanzhen, Rubinstein, Michael, Michaeli, Tomer, Wang, Oliver, Sun, Deqing, Dekel, Tali, Mosseri, Inbar

We introduce Lumiere -- a text-to-video diffusion model designed for synthesizing videos that portray realistic, diverse and coherent motion -- a pivotal challenge in video synthesis. To this end, we introduce a Space-Time U-Net architecture that gen

Externí odkaz: http://arxiv.org/abs/2401.12945

Zobrazit plný text záznamu

Report

Space-Time Diffusion Features for Zero-Shot Text-Driven Motion Transfer

Autor: Yatim, Danah, Fridman, Rafail, Bar-Tal, Omer, Kasten, Yoni, Dekel, Tali

We present a new method for text-driven motion transfer - synthesizing a video that complies with an input text prompt describing the target objects and scene while maintaining an input video's motion and scene layout. Prior methods are confined to t

Externí odkaz: http://arxiv.org/abs/2311.17009

Zobrazit plný text záznamu

Report

Disentangling Structure and Appearance in ViT Feature Space

Autor: Tumanyan, Narek, Bar-Tal, Omer, Amir, Shir, Bagon, Shai, Dekel, Tali

We present a method for semantically transferring the visual appearance of one natural image to another. Specifically, our goal is to generate an image in which objects in a source structure image are "painted" with the visual appearance of their sem

Externí odkaz: http://arxiv.org/abs/2311.12193

Zobrazit plný text záznamu

Report

TokenFlow: Consistent Diffusion Features for Consistent Video Editing

Autor: Geyer, Michal, Bar-Tal, Omer, Bagon, Shai, Dekel, Tali

The generative AI revolution has recently expanded to videos. Nevertheless, current state-of-the-art video models are still lagging behind image models in terms of visual quality and user control over the generated content. In this work, we present a

Externí odkaz: http://arxiv.org/abs/2307.10373

Zobrazit plný text záznamu

Report

Geometrical optics of large deviations of Brownian motion in inhomogeneous media

Autor: Bar, Tal, Meerson, Baruch

Publikováno v: J. Stat. Mech. (2023) 093301

Geometrical optics provides an instructive insight into Brownian motion, ``pushed" into a large-deviations regime by imposed constraints. Here we extend geometrical optics of Brownian motion by accounting for diffusion inhomogeneity in space. We cons

Externí odkaz: http://arxiv.org/abs/2305.05942

Zobrazit plný text záznamu

Report

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Autor: Bar-Tal, Omer, Yariv, Lior, Lipman, Yaron, Dekel, Tali

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge, currently

Externí odkaz: http://arxiv.org/abs/2302.08113

Zobrazit plný text záznamu

Report

Compatible director fields in $\mathbb{R}^3$

Autor: da Silva, Luiz C. B., Bar, Tal, Efrati, Efi

Publikováno v: Journal of Elasticity (2023)

The geometry and interactions between the constituents of a liquid crystal, which are responsible for inducing the partial order in the fluid, may locally favor an attempted phase that could not be realized in $\mathbb{R}^3$. While states that are in

Externí odkaz: http://arxiv.org/abs/2211.08598

Zobrazit plný text záznamu

Report

Text2LIVE: Text-Driven Layered Image and Video Editing

Autor: Bar-Tal, Omer, Ofri-Amar, Dolev, Fridman, Rafail, Kasten, Yoni, Dekel, Tali

We present a method for zero-shot, text-driven appearance manipulation in natural images and videos. Given an input image or video and a target text prompt, our goal is to edit the appearance of existing objects (e.g., object's texture) or augment th

Externí odkaz: http://arxiv.org/abs/2204.02491

Zobrazit plný text záznamu

Report

Splicing ViT Features for Semantic Appearance Transfer

Autor: Tumanyan, Narek, Bar-Tal, Omer, Bagon, Shai, Dekel, Tali

Externí odkaz: http://arxiv.org/abs/2201.00424

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání