Výsledky vyhledávání - "Gupta, Ashim"

Report

Beyond Perplexity: Multi-dimensional Safety Evaluation of LLM Compression

Autor: Xu, Zhichao, Gupta, Ashim, Li, Tao, Bentham, Oliver, Srikumar, Vivek

Large language models (LLMs) are increasingly deployed in real-world scenarios with the help of recent model compression techniques. Such momentum towards local deployment means the use of compressed LLMs will widely impact a large population. Howeve

Externí odkaz: http://arxiv.org/abs/2407.04965

Zobrazit plný text záznamu

Report

K-mouflage at high k: extending the reach of $\texttt{Hi-COLA}$

Autor: Gupta, Ashim Sen, Fiorini, Bartolomeo, Baker, Tessa

The $\texttt{Hi-COLA}$ code is an efficient dark matter simulation suite that flexibly handles the Horndeski family of modified gravity models. In this work we extend the scope of $\texttt{Hi-COLA}$ to accommodate Horndeski theories with K-mouflage s

Externí odkaz: http://arxiv.org/abs/2407.00855

Zobrazit plný text záznamu

Report

Matter Power Spectra in Modified Gravity: A Comparative Study of Approximations and $N$-Body Simulations

Autor: Bose, Benjamin, Gupta, Ashim Sen, Fiorini, Bartolomeo, Brando, Guilherme, Hassani, Farbod, Baker, Tessa, Lombriser, Lucas, Li, Baojiu, Ruan, Cheng-Zong, Hernandez-Aguayo, Cesar, Atayde, Luis, Frusciante, Noemi

Testing gravity and the concordance model of cosmology, $\Lambda$CDM, at large scales is a key goal of this decade's largest galaxy surveys. Here we present a comparative study of dark matter power spectrum predictions from different numerical codes

Externí odkaz: http://arxiv.org/abs/2406.13667

Zobrazit plný text záznamu

Report

An Empirical Investigation of Matrix Factorization Methods for Pre-trained Transformers

Autor: Gupta, Ashim, Saravani, Sina Mahdipour, Sadayappan, P., Srikumar, Vivek

The increasing size of transformer-based models in NLP makes the question of compressing them important. In this work, we present a comprehensive analysis of factorization based model compression techniques. Specifically, we focus on comparing straig

Externí odkaz: http://arxiv.org/abs/2406.11307

Zobrazit plný text záznamu

Report

Enhancing Question Answering on Charts Through Effective Pre-training Tasks

Autor: Gupta, Ashim, Gupta, Vivek, Zhang, Shuo, He, Yujie, Zhang, Ning, Shah, Shalin

To completely understand a document, the use of textual information is not enough. Understanding visual cues, such as layouts and charts, is also required. While the current state-of-the-art approaches for document understanding (both OCR-based and O

Externí odkaz: http://arxiv.org/abs/2406.10085

Zobrazit plný text záznamu

Report

Whispers of Doubt Amidst Echoes of Triumph in NLP Robustness

Autor: Gupta, Ashim, Rajendhran, Rishanth, Stringham, Nathan, Srikumar, Vivek, Marasović, Ana

Do larger and more performant models resolve NLP's longstanding robustness issues? We investigate this question using over 20 models of different sizes spanning different architectural choices and pretraining objectives. We conduct evaluations using

Externí odkaz: http://arxiv.org/abs/2311.09694

Zobrazit plný text záznamu

Report

IntenDD: A Unified Contrastive Learning Approach for Intent Detection and Discovery

Autor: Singhal, Bhavuk, Gupta, Ashim, P, Shivasankaran V, Krishna, Amrith

Identifying intents from dialogue utterances forms an integral component of task-oriented dialogue systems. Intent-related tasks are typically formulated either as a classification task, where the utterances are classified into predefined categories

Externí odkaz: http://arxiv.org/abs/2310.16761

Zobrazit plný text záznamu

Report

Adversarial Clean Label Backdoor Attacks and Defenses on Text Classification Systems

Autor: Gupta, Ashim, Krishna, Amrith

Clean-label (CL) attack is a form of data poisoning attack where an adversary modifies only the textual input of the training data, without requiring access to the labeling function. CL attacks are relatively unexplored in NLP, as compared to label f

Externí odkaz: http://arxiv.org/abs/2305.19607

Zobrazit plný text záznamu

Report

Don't Retrain, Just Rewrite: Countering Adversarial Perturbations by Rewriting Text

Autor: Gupta, Ashim, Blum, Carter Wood, Choji, Temma, Fei, Yingjie, Shah, Shalin, Vempala, Alakananda, Srikumar, Vivek

Can language models transform inputs to protect text classifiers against adversarial attacks? In this work, we present ATINTER, a model that intercepts and learns to rewrite adversarial inputs to make them non-adversarial for a downstream text classi

Externí odkaz: http://arxiv.org/abs/2305.16444

Zobrazit plný text záznamu

Report

S\={a}mayik: A Benchmark and Dataset for English-Sanskrit Translation

Autor: Maheshwari, Ayush, Gupta, Ashim, Krishna, Amrith, Singh, Atul Kumar, Ramakrishnan, Ganesh, Kumar, G. Anil, Singla, Jitin

We release S\={a}mayik, a dataset of around 53,000 parallel English-Sanskrit sentences, written in contemporary prose. Sanskrit is a classical language still in sustenance and has a rich documented heritage. However, due to the limited availability o

Externí odkaz: http://arxiv.org/abs/2305.14004

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání