Výsledky vyhledávání - "A P, Molchanov"

Report

RADIO Amplified: Improved Baselines for Agglomerative Vision Foundation Models

Autor: Heinrich, Greg, Ranzinger, Mike, Hongxu, Yin, Lu, Yao, Kautz, Jan, Tao, Andrew, Catanzaro, Bryan, Molchanov, Pavlo

Agglomerative models have recently emerged as a powerful approach to training vision foundation models, leveraging multi-teacher distillation from existing models such as CLIP, DINO, and SAM. This strategy enables the efficient creation of robust mod

Externí odkaz: http://arxiv.org/abs/2412.07679

Zobrazit plný text záznamu

Report

NVILA: Efficient Frontier Visual Language Models

Visual language models (VLMs) have made significant advances in accuracy in recent years. However, their efficiency has received much less attention. This paper introduces NVILA, a family of open VLMs designed to optimize both efficiency and accuracy

Externí odkaz: http://arxiv.org/abs/2412.04468

Zobrazit plný text záznamu

Report

Puzzle: Distillation-Based NAS for Inference-Optimized LLMs

Large language models (LLMs) have demonstrated remarkable capabilities, but their adoption is limited by high computational costs during inference. While increasing parameter counts enhances accuracy, it also widens the gap between state-of-the-art c

Externí odkaz: http://arxiv.org/abs/2411.19146

Zobrazit plný text záznamu

Report

Hymba: A Hybrid-head Architecture for Small Language Models

Autor: Dong, Xin, Fu, Yonggan, Diao, Shizhe, Byeon, Wonmin, Chen, Zijia, Mahabaleshwarkar, Ameya Sunil, Liu, Shih-Yang, Van Keirsbilck, Matthijs, Chen, Min-Hung, Suhara, Yoshi, Lin, Yingyan, Kautz, Jan, Molchanov, Pavlo

We propose Hymba, a family of small language models featuring a hybrid-head parallel architecture that integrates transformer attention mechanisms with state space models (SSMs) for enhanced efficiency. Attention heads provide high-resolution recall,

Externí odkaz: http://arxiv.org/abs/2411.13676

Zobrazit plný text záznamu

Report

VILA-M3: Enhancing Vision-Language Models with Medical Expert Knowledge

Generalist vision language models (VLMs) have made significant strides in computer vision, but they fall short in specialized fields like healthcare, where expert knowledge is essential. In traditional computer vision tasks, creative or approximate a

Externí odkaz: http://arxiv.org/abs/2411.12915

Zobrazit plný text záznamu

Report

EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation

Autor: Liu, Shih-Yang, Yang, Huck, Wang, Chien-Yi, Fung, Nai Chit, Yin, Hongxu, Sakr, Charbel, Muralidharan, Saurav, Cheng, Kwang-Ting, Kautz, Jan, Wang, Yu-Chiang Frank, Molchanov, Pavlo, Chen, Min-Hung

In this work, we re-formulate the model compression problem into the customized compensation problem: Given a compressed model, we aim to introduce residual low-rank paths to compensate for compression errors under customized requirements from users

Externí odkaz: http://arxiv.org/abs/2410.21271

Zobrazit plný text záznamu

Report

Prompt Engineering a Schizophrenia Chatbot: Utilizing a Multi-Agent Approach for Enhanced Compliance with Prompt Instructions

Autor: Waaler, Per Niklas, Hussain, Musarrat, Molchanov, Igor, Bongo, Lars Ailo, Elvevåg, Brita

Patients with schizophrenia often present with cognitive impairments that may hinder their ability to learn about their condition. These individuals could benefit greatly from education platforms that leverage the adaptability of Large Language Model

Externí odkaz: http://arxiv.org/abs/2410.12848

Zobrazit plný text záznamu

Report

On the analytic extension of Random Riemann Zeta Functions for some probabilistic models of the primes

Autor: Margarint, Vlad, Molchanov, Stanislav

The first step in the formulation and study of the Riemann Hypothesis is the analytic continuation of the Riemann Zeta Function (RZF) in the full Complex Plane with a pole at $s=1$. In the current work, we study the analytic continuation of two rando

Externí odkaz: http://arxiv.org/abs/2410.03044

Zobrazit plný text záznamu

Report

PHI-S: Distribution Balancing for Label-Free Multi-Teacher Distillation

Autor: Ranzinger, Mike, Barker, Jon, Heinrich, Greg, Molchanov, Pavlo, Catanzaro, Bryan, Tao, Andrew

Various visual foundation models have distinct strengths and weaknesses, both of which can be improved through heterogeneous multi-teacher knowledge distillation without labels, termed "agglomerative models." We build upon this body of work by studyi

Externí odkaz: http://arxiv.org/abs/2410.01680

Zobrazit plný text záznamu

Report

MaskLLM: Learnable Semi-Structured Sparsity for Large Language Models

Autor: Fang, Gongfan, Yin, Hongxu, Muralidharan, Saurav, Heinrich, Greg, Pool, Jeff, Kautz, Jan, Molchanov, Pavlo, Wang, Xinchao

Large Language Models (LLMs) are distinguished by their massive parameter counts, which typically result in significant redundancy. This work introduces MaskLLM, a learnable pruning method that establishes Semi-structured (or ``N:M'') Sparsity in LLM

Externí odkaz: http://arxiv.org/abs/2409.17481

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání