Výsledky vyhledávání - "Jain, Swayambhoo"

Report

Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge

Autor: Raju, Ravi, Jain, Swayambhoo, Li, Bo, Li, Jonathan, Thakker, Urmish

Large Language Models (LLMs) have revolutionized the landscape of machine learning, yet current benchmarks often fall short in capturing the diverse behavior of these models in real-world applications. A benchmark's usefulness is determined by its ab

Externí odkaz: http://arxiv.org/abs/2408.08808

Zobrazit plný text záznamu

Report

SambaNova SN40L: Scaling the AI Memory Wall with Dataflow and Composition of Experts

Monolithic large language models (LLMs) like GPT-4 have paved the way for modern generative AI applications. Training, serving, and maintaining monolithic LLMs at scale, however, remains prohibitively expensive and challenging. The disproportionate i

Externí odkaz: http://arxiv.org/abs/2405.07518

Zobrazit plný text záznamu

Report

Data-Driven Low-Rank Neural Network Compression

Autor: Papadimitriou, Dimitris, Jain, Swayambhoo

Despite many modern applications of Deep Neural Networks (DNNs), the large number of parameters in the hidden layers makes them unattractive for deployment on devices with storage capacity constraints. In this paper we propose a Data-Driven Low-rank

Externí odkaz: http://arxiv.org/abs/2107.05787

Zobrazit plný text záznamu

Report

Efficacy of Bayesian Neural Networks in Active Learning

Autor: Rakesh, Vineeth, Jain, Swayambhoo

Obtaining labeled data for machine learning tasks can be prohibitively expensive. Active learning mitigates this issue by exploring the unlabeled data space and prioritizing the selection of data that can best improve the model performance. A common

Externí odkaz: http://arxiv.org/abs/2104.00896

Zobrazit plný text záznamu

Report

Matrix Completion in the Unit Hypercube via Structured Matrix Factorization

Autor: Bugliarello, Emanuele, Jain, Swayambhoo, Rakesh, Vineeth

Several complex tasks that arise in organizations can be simplified by mapping them into a matrix completion problem. In this paper, we address a key challenge faced by our company: predicting the efficiency of artists in rendering visual effects (VF

Externí odkaz: http://arxiv.org/abs/1905.12881

Zobrazit plný text záznamu

Report

Minimum Uncertainty Based Detection of Adversaries in Deep Neural Networks

Autor: Sheikholeslami, Fatemeh, Jain, Swayambhoo, Giannakis, Georgios B.

Despite their unprecedented performance in various domains, utilization of Deep Neural Networks (DNNs) in safety-critical environments is severely limited in the presence of even small adversarial perturbations. The present work develops a randomized

Externí odkaz: http://arxiv.org/abs/1904.02841

Zobrazit plný text záznamu

Report

Learning Generative Models of Structured Signals from Their Superposition Using GANs with Application to Denoising and Demixing

Autor: Soltani, Mohammadreza, Jain, Swayambhoo, Sambasivan, Abhinav

Recently, Generative Adversarial Networks (GANs) have emerged as a popular alternative for modeling complex high dimensional distributions. Most of the existing works implicitly assume that the clean samples from the target distribution are easily av

Externí odkaz: http://arxiv.org/abs/1902.04664

Zobrazit plný text záznamu

Report

Improved Support Recovery Guarantees for the Group Lasso With Applications to Structural Health Monitoring

Autor: Elyaderani, Mojtaba Kadkhodaie, Jain, Swayambhoo, Druce, Jeffrey, Gonella, Stefano, Haupt, Jarvis

This paper considers the problem of estimating an unknown high dimensional signal from noisy linear measurements, {when} the signal is assumed to possess a \emph{group-sparse} structure in a {known,} fixed dictionary. We consider signals generated ac

Externí odkaz: http://arxiv.org/abs/1708.08826

Zobrazit plný text záznamu

Report

Noisy Tensor Completion for Tensors with a Sparse Canonical Polyadic Factor

Autor: Jain, Swayambhoo, Gutierrez, Alexander, Haupt, Jarvis

In this paper we study the problem of noisy tensor completion for tensors that admit a canonical polyadic or CANDECOMP/PARAFAC (CP) decomposition with one of the factors being sparse. We present general theoretical error bounds for an estimate obtain

Externí odkaz: http://arxiv.org/abs/1704.02534

Zobrazit plný text záznamu

Report

Block CUR: Decomposing Matrices using Groups of Columns

Autor: Oswal, Urvashi, Jain, Swayambhoo, Xu, Kevin S., Eriksson, Brian

A common problem in large-scale data analysis is to approximate a matrix using a combination of specifically sampled rows and columns, known as CUR decomposition. Unfortunately, in many real-world environments, the ability to sample specific individu

Externí odkaz: http://arxiv.org/abs/1703.06065

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání