Výsledky vyhledávání

Report

Diffusing Differentiable Representations

Autor: Savani, Yash, Finzi, Marc, Kolter, J. Zico

We introduce a novel, training-free method for sampling differentiable representations (diffreps) using pretrained diffusion models. Rather than merely mode-seeking, our method achieves sampling by "pulling back" the dynamics of the reverse-time proc

Externí odkaz: http://arxiv.org/abs/2412.06981

Zobrazit plný text záznamu

Report

Blind Inverse Problem Solving Made Easy by Text-to-Image Latent Diffusion

Autor: Dontas, Michail, He, Yutong, Murata, Naoki, Mitsufuji, Yuki, Kolter, J. Zico, Salakhutdinov, Ruslan

Blind inverse problems, where both the target data and forward operator are unknown, are crucial to many computer vision applications. Existing methods often depend on restrictive assumptions such as additional training, operator linearity, or narrow

Externí odkaz: http://arxiv.org/abs/2412.00557

Zobrazit plný text záznamu

Report

Inference Optimal VLMs Need Only One Visual Token but Larger Models

Autor: Li, Kevin Y., Goyal, Sachin, Semedo, Joao D., Kolter, J. Zico

Vision Language Models (VLMs) have demonstrated strong capabilities across various visual understanding and reasoning tasks. However, their real-world deployment is often constrained by high latency during inference due to substantial compute require

Externí odkaz: http://arxiv.org/abs/2411.03312

Zobrazit plný text záznamu

Report

One-Step Diffusion Distillation through Score Implicit Matching

Autor: Luo, Weijian, Huang, Zemin, Geng, Zhengyang, Kolter, J. Zico, Qi, Guo-jun

Publikováno v: NeurIPS 2024

Despite their strong performances on many generative tasks, diffusion models require a large number of sampling steps in order to generate realistic samples. This has motivated the community to develop effective methods to distill pre-trained diffusi

Externí odkaz: http://arxiv.org/abs/2410.16794

Zobrazit plný text záznamu

Report

Rethinking Distance Metrics for Counterfactual Explainability

Autor: Williams, Joshua Nathaniel, Katakkar, Anurag, Heidari, Hoda, Kolter, J. Zico

Counterfactual explanations have been a popular method of post-hoc explainability for a variety of settings in Machine Learning. Such methods focus on explaining classifiers by generating new data points that are similar to a given reference, while r

Externí odkaz: http://arxiv.org/abs/2410.14522

Zobrazit plný text záznamu

Report

Adaptive Data Optimization: Dynamic Sample Selection with Scaling Laws

Autor: Jiang, Yiding, Zhou, Allan, Feng, Zhili, Malladi, Sadhika, Kolter, J. Zico

The composition of pretraining data is a key determinant of foundation models' performance, but there is no standard guideline for allocating a limited computational budget across different data sources. Most current approaches either rely on extensi

Externí odkaz: http://arxiv.org/abs/2410.11820

Zobrazit plný text záznamu

Report

Mimetic Initialization Helps State Space Models Learn to Recall

Autor: Trockman, Asher, Harutyunyan, Hrayr, Kolter, J. Zico, Kumar, Sanjiv, Bhojanapalli, Srinadh

Recent work has shown that state space models such as Mamba are significantly worse than Transformers on recall-based tasks due to the fact that their state size is constant with respect to their input sequence length. But in practice, state space mo

Externí odkaz: http://arxiv.org/abs/2410.11135

Zobrazit plný text záznamu

Report

Context-Parametric Inversion: Why Instruction Finetuning May Not Actually Improve Context Reliance

Autor: Goyal, Sachin, Baek, Christina, Kolter, J. Zico, Raghunathan, Aditi

A standard practice when using large language models is for users to supplement their instruction with an input context containing new information for the model to process. However, models struggle to reliably follow the input context, especially whe

Externí odkaz: http://arxiv.org/abs/2410.10796

Zobrazit plný text záznamu

Report

Finetuning CLIP to Reason about Pairwise Differences

Autor: Sam, Dylan, Willmott, Devin, Semedo, Joao D., Kolter, J. Zico

Vision-language models (VLMs) such as CLIP are trained via contrastive learning between text and image pairs, resulting in aligned image and text embeddings that are useful for many downstream tasks. A notable drawback of CLIP, however, is that the r

Externí odkaz: http://arxiv.org/abs/2409.09721

Zobrazit plný text záznamu

Report

Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Models

Autor: Bick, Aviv, Li, Kevin Y., Xing, Eric P., Kolter, J. Zico, Gu, Albert

Transformer architectures have become a dominant paradigm for domains like language modeling but suffer in many inference settings due to their quadratic-time self-attention. Recently proposed subquadratic architectures, such as Mamba, have shown pro

Externí odkaz: http://arxiv.org/abs/2408.10189

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání