Výsledky vyhledávání

Report

IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning

Autor: Ghosal, Soumya Suvra, Basu, Samyadeep, Feizi, Soheil, Manocha, Dinesh

Image-text contrastive models such as CLIP learn transferable and robust representations for zero-shot transfer to a variety of downstream tasks. However, to obtain strong downstream performances, prompts need to be carefully curated, which can be a

Externí odkaz: http://arxiv.org/abs/2406.13683

Zobrazit plný text záznamu

Report

Endor: Hardware-Friendly Sparse Format for Offloaded LLM Inference

Autor: Joo, Donghyeon, Hadidi, Ramyad, Feizi, Soheil, Asgari, Bahar

The increasing size of large language models (LLMs) challenges their usage on resource-constrained platforms. For example, memory on modern GPUs is insufficient to hold LLMs that are hundreds of Gigabytes in size. Offloading is a popular method to es

Externí odkaz: http://arxiv.org/abs/2406.11674

Zobrazit plný text záznamu

Report

Understanding and Mitigating Compositional Issues in Text-to-Image Generative Models

Autor: Zarei, Arman, Rezaei, Keivan, Basu, Samyadeep, Saberi, Mehrdad, Moayeri, Mazda, Kattakinda, Priyatham, Feizi, Soheil

Recent text-to-image diffusion-based generative models have the stunning ability to generate highly detailed and photo-realistic images and achieve state-of-the-art low FID scores on challenging image generation benchmarks. However, one of the primar

Externí odkaz: http://arxiv.org/abs/2406.07844

Zobrazit plný text záznamu

Report

Understanding Information Storage and Transfer in Multi-modal Large Language Models

Autor: Basu, Samyadeep, Grayson, Martin, Morrison, Cecily, Nushi, Besmira, Feizi, Soheil, Massiceti, Daniela

Understanding the mechanisms of information storage and transfer in Transformer-based models is important for driving model understanding progress. Recent work has studied these mechanisms for Large Language Models (LLMs), revealing insights on how i

Externí odkaz: http://arxiv.org/abs/2406.04236

Zobrazit plný text záznamu

Report

DREW : Towards Robust Data Provenance by Leveraging Error-Controlled Watermarking

Autor: Saberi, Mehrdad, Sadasivan, Vinu Sankar, Zarei, Arman, Mahdavifar, Hessam, Feizi, Soheil

Identifying the origin of data is crucial for data provenance, with applications including data ownership protection, media forensics, and detecting AI-generated content. A standard approach involves embedding-based retrieval techniques that match qu

Externí odkaz: http://arxiv.org/abs/2406.02836

Zobrazit plný text záznamu

Report

Loki: Low-Rank Keys for Efficient Sparse Attention

Autor: Singhania, Prajwal, Singh, Siddharth, He, Shwai, Feizi, Soheil, Bhatele, Abhinav

Inference on large language models can be expensive in terms of the compute and memory costs involved, especially when long sequence lengths are used. In particular, the self-attention mechanism used in such models contributes significantly to these

Externí odkaz: http://arxiv.org/abs/2406.02542

Zobrazit plný text záznamu

Report

Decomposing and Interpreting Image Representations via Text in ViTs Beyond CLIP

Autor: Balasubramanian, Sriram, Basu, Samyadeep, Feizi, Soheil

Recent works have explored how individual components of the CLIP-ViT model contribute to the final representation by leveraging the shared image-text representation space of CLIP. These components, such as attention heads and MLPs, have been shown to

Externí odkaz: http://arxiv.org/abs/2406.01583

Zobrazit plný text záznamu

Report

Understanding the Effect of using Semantically Meaningful Tokens for Visual Representation Learning

Autor: Kalibhat, Neha, Kattakinda, Priyatham, Zarei, Arman, Seleznev, Nikita, Sharpe, Samuel, Kumar, Senthil, Feizi, Soheil

Vision transformers have established a precedent of patchifying images into uniformly-sized chunks before processing. We hypothesize that this design choice may limit models in learning comprehensive and compositional representations from visual data

Externí odkaz: http://arxiv.org/abs/2405.16401

Zobrazit plný text záznamu

Report

Securing the Future of GenAI: Policy and Technology

Autor: Christodorescu, Mihai, Craven, Ryan, Feizi, Soheil, Gong, Neil, Hoffmann, Mia, Jha, Somesh, Jiang, Zhengyuan, Kamarposhti, Mehrdad Saberi, Mitchell, John, Newman, Jessica, Probasco, Emelia, Qi, Yanjun, Shams, Khawaja, Turek, Matthew

The rise of Generative AI (GenAI) brings about transformative potential across sectors, but its dual-use nature also amplifies risks. Governments globally are grappling with the challenge of regulating GenAI, balancing innovation against safety. Chin

Externí odkaz: http://arxiv.org/abs/2407.12999

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání