Výsledky vyhledávání - "Bhardwaj, Rishabh"

Report

Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse

Autor: Song, Maojia, Sim, Shang Hong, Bhardwaj, Rishabh, Chieu, Hai Leong, Majumder, Navonil, Poria, Soujanya

LLMs are an integral component of retrieval-augmented generation (RAG) systems. While many studies focus on evaluating the overall quality of end-to-end RAG systems, there is a gap in understanding the appropriateness of LLMs for the RAG task. To add

Externí odkaz: http://arxiv.org/abs/2409.11242

Zobrazit plný text záznamu

Report

Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique

Autor: Pala, Tej Deep, Toh, Vernon Y. H., Bhardwaj, Rishabh, Poria, Soujanya

In today's era, where large language models (LLMs) are integrated into numerous real-world applications, ensuring their safety and robustness is crucial for responsible AI usage. Automated red-teaming methods play a key role in this process by genera

Externí odkaz: http://arxiv.org/abs/2408.10701

Zobrazit plný text záznamu

Report

WalledEval: A Comprehensive Safety Evaluation Toolkit for Large Language Models

Autor: Gupta, Prannaya, Yau, Le Qi, Low, Hao Han, Lee, I-Shiang, Lim, Hugo Maximus, Teoh, Yu Xin, Koh, Jia Hng, Liew, Dar Win, Bhardwaj, Rishabh, Bhardwaj, Rajat, Poria, Soujanya

WalledEval is a comprehensive AI safety testing toolkit designed to evaluate large language models (LLMs). It accommodates a diverse range of models, including both open-weight and API-based ones, and features over 35 safety benchmarks covering areas

Externí odkaz: http://arxiv.org/abs/2408.03837

Zobrazit plný text záznamu

Report

Ruby Teaming: Improving Quality Diversity Search with Memory for Automated Red Teaming

Autor: Han, Vernon Toh Yan, Bhardwaj, Rishabh, Poria, Soujanya

We propose Ruby Teaming, a method that improves on Rainbow Teaming by including a memory cache as its third dimension. The memory dimension provides cues to the mutator to yield better-quality prompts, both in terms of attack success rate (ASR) and q

Externí odkaz: http://arxiv.org/abs/2406.11654

Zobrazit plný text záznamu

Report

DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling

Autor: Deep, Pala Tej, Bhardwaj, Rishabh, Poria, Soujanya

With the proliferation of domain-specific models, model merging has emerged as a set of techniques that combine the capabilities of multiple models into one that can multitask without the cost of additional training. In this paper, we propose a new m

Externí odkaz: http://arxiv.org/abs/2406.11617

Zobrazit plný text záznamu

Report

On Unitarity of Bespoke Amplitudes

Autor: Bhardwaj, Rishabh, Spradlin, Marcus, Volovich, Anastasia, Weng, He-Chen

We use partial wave unitarity to constrain various bespoke four-point amplitudes. We start by constructing bespoke generalizations of the type I superstring amplitude, which we show satisfy dual resonance and have suitable high-energy limits. By anal

Externí odkaz: http://arxiv.org/abs/2406.04410

Zobrazit plný text záznamu

Report

HyperTTS: Parameter Efficient Adaptation in Text to Speech using Hypernetworks

Autor: Li, Yingting, Bhardwaj, Rishabh, Mehrish, Ambuj, Cheng, Bo, Poria, Soujanya

Neural speech synthesis, or text-to-speech (TTS), aims to transform a signal from the text domain to the speech domain. While developing TTS architectures that train and test on the same set of speakers has seen significant improvements, out-of-domai

Externí odkaz: http://arxiv.org/abs/2404.04645

Zobrazit plný text záznamu

Report

Celestial soft currents at one-loop and their OPEs

Autor: Bhardwaj, Rishabh, Srikant, Akshay Yelleshpur

Conformally soft operators and their associated soft theorems on the celestial sphere encode the low energy behaviour of bulk scattering amplitudes. They lead to an infinite dimensional symmetry algebra of the celestial CFT at tree-level. In this pap

Externí odkaz: http://arxiv.org/abs/2403.10443

Zobrazit plný text záznamu

Report

Language Models are Homer Simpson! Safety Re-Alignment of Fine-tuned Language Models through Task Arithmetic

Autor: Bhardwaj, Rishabh, Anh, Do Duc, Poria, Soujanya

Aligned language models face a significant limitation as their fine-tuning often results in compromised safety. To tackle this, we propose a simple method RESTA that performs LLM safety realignment. RESTA stands for REstoring Safety through Task Arit

Externí odkaz: http://arxiv.org/abs/2402.11746

Zobrazit plný text záznamu

Report

A double copy from twisted (co)homology at genus one

Autor: Bhardwaj, Rishabh, Pokraka, Andrzej, Ren, Lecheng, Rodriguez, Carlos

We study the twisted (co)homology of a family of genus-one integrals -- the so called Riemann-Wirtinger integrals. These integrals are closely related to one-loop string amplitudes in chiral splitting where one leaves the loop-momentum, modulus and a

Externí odkaz: http://arxiv.org/abs/2312.02148

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání