Výsledky vyhledávání - "Materzynska, Joanna"

Report

AirLetters: An Open Video Dataset of Characters Drawn in the Air

Autor: Dagli, Rishit, Berger, Guillaume, Materzynska, Joanna, Bax, Ingo, Memisevic, Roland

We introduce AirLetters, a new video dataset consisting of real-world videos of human-generated, articulated motions. Specifically, our dataset requires a vision model to predict letters that humans draw in the air. Unlike existing video datasets, ac

Externí odkaz: http://arxiv.org/abs/2410.02921

Zobrazit plný text záznamu

Report

Consent in Crisis: The Rapid Decline of the AI Data Commons

General-purpose artificial intelligence (AI) systems are built on massive swathes of public web data, assembled into corpora such as C4, RefinedWeb, and Dolma. To our knowledge, we conduct the first, large-scale, longitudinal audit of the consent pro

Externí odkaz: http://arxiv.org/abs/2407.14933

Zobrazit plný text záznamu

Report

Customizing Motion in Text-to-Video Diffusion Models

Autor: Materzynska, Joanna, Sivic, Josef, Shechtman, Eli, Torralba, Antonio, Zhang, Richard, Russell, Bryan

We introduce an approach for augmenting text-to-video generation models with customized motions, extending their capabilities beyond the motions depicted in the original training data. By leveraging a few video samples demonstrating specific movement

Externí odkaz: http://arxiv.org/abs/2312.04966

Zobrazit plný text záznamu

Report

Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models

Autor: Gandikota, Rohit, Materzynska, Joanna, Zhou, Tingrui, Torralba, Antonio, Bau, David

We present a method to create interpretable concept sliders that enable precise control over attributes in image generations from diffusion models. Our approach identifies a low-rank parameter direction corresponding to one concept while minimizing i

Externí odkaz: http://arxiv.org/abs/2311.12092

Zobrazit plný text záznamu

Report

FIND: A Function Description Benchmark for Evaluating Interpretability Methods

Autor: Schwettmann, Sarah, Shaham, Tamar Rott, Materzynska, Joanna, Chowdhury, Neil, Li, Shuang, Andreas, Jacob, Bau, David, Torralba, Antonio

Publikováno v: NeurIPS 2023

Labeling neural network submodules with human-legible descriptions is useful for many downstream tasks: such descriptions can surface failures, guide interventions, and perhaps even explain important model behaviors. To date, most mechanistic descrip

Externí odkaz: http://arxiv.org/abs/2309.03886

Zobrazit plný text záznamu

Report

Unified Concept Editing in Diffusion Models

Autor: Gandikota, Rohit, Orgad, Hadas, Belinkov, Yonatan, Materzyńska, Joanna, Bau, David

Text-to-image models suffer from various safety issues that may limit their suitability for deployment. Previous methods have separately addressed individual issues of bias, copyright, and offensive content in text-to-image models. However, in the re

Externí odkaz: http://arxiv.org/abs/2308.14761

Zobrazit plný text záznamu

Report

Erasing Concepts from Diffusion Models

Autor: Gandikota, Rohit, Materzynska, Joanna, Fiotto-Kaufman, Jaden, Bau, David

Motivated by recent advancements in text-to-image diffusion, we study erasure of specific concepts from the model's weights. While Stable Diffusion has shown promise in producing explicit or realistic artwork, it has raised concerns regarding its pot

Externí odkaz: http://arxiv.org/abs/2303.07345

Zobrazit plný text záznamu

Report

Disentangling visual and written concepts in CLIP

Autor: Materzynska, Joanna, Torralba, Antonio, Bau, David

The CLIP network measures the similarity between natural text and images; in this work, we investigate the entanglement of the representation of word images and natural images in its image encoder. First, we find that the image encoder has an ability

Externí odkaz: http://arxiv.org/abs/2206.07835

Zobrazit plný text záznamu

Report

Something-Else: Compositional Action Recognition with Spatial-Temporal Interaction Networks

Autor: Materzynska, Joanna, Xiao, Tete, Herzig, Roei, Xu, Huijuan, Wang, Xiaolong, Darrell, Trevor

Human action is naturally compositional: humans can easily recognize and perform actions with objects that are different from those used in training demonstrations. In this paper, we study the compositionality of action by looking into the dynamics o

Externí odkaz: http://arxiv.org/abs/1912.09930

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání