Výsledky vyhledávání - "Ramakrishna, A."

Report

LLOR: Automated Repair of OpenMP Programs

Autor: Bora, Utpal, Joshi, Saurabh, Muduganti, Gautam, Upadrasta, Ramakrishna

In this paper, we present a technique for repairing data race errors in parallel programs written in C/C++ and Fortran using the OpenMP API. Our technique can also remove barriers that are deemed unnecessary for correctness. We implement these ideas

Externí odkaz: http://arxiv.org/abs/2411.14590

Zobrazit plný text záznamu

Report

Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM

Autor: Chang, Haw-Shiuan, Peng, Nanyun, Bansal, Mohit, Ramakrishna, Anil, Chung, Tagyoung

Contrastive decoding (CD) (Li et al., 2023) improves the next-token distribution of a large expert language model (LM) using a small amateur LM. Although CD is applied to various LMs and domains to enhance open-ended text generation, it is still uncl

Externí odkaz: http://arxiv.org/abs/2411.01610

Zobrazit plný text záznamu

Report

Unlearning as multi-task optimization: A normalized gradient difference approach with an adaptive learning rate

Autor: Bu, Zhiqi, Jin, Xiaomeng, Vinzamuri, Bhanukiran, Ramakrishna, Anil, Chang, Kai-Wei, Cevher, Volkan, Hong, Mingyi

Machine unlearning has been used to remove unwanted knowledge acquired by large language models (LLMs). In this paper, we examine machine unlearning from an optimization perspective, framing it as a regularized multi-task optimization problem, where

Externí odkaz: http://arxiv.org/abs/2410.22086

Zobrazit plný text záznamu

Report

Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios

Autor: Wang, Kai, Li, Zekai, Cheng, Zhi-Qi, Khaki, Samir, Sajedi, Ahmad, Vedantam, Ramakrishna, Plataniotis, Konstantinos N, Hauptmann, Alexander, You, Yang

Dataset distillation has demonstrated strong performance on simple datasets like CIFAR, MNIST, and TinyImageNet but struggles to achieve similar results in more complex scenarios. In this paper, we propose EDF (emphasizes the discriminative features)

Externí odkaz: http://arxiv.org/abs/2410.17193

Zobrazit plný text záznamu

Report

Attribute Controlled Fine-tuning for Large Language Models: A Case Study on Detoxification

Autor: Meng, Tao, Mehrabi, Ninareh, Goyal, Palash, Ramakrishna, Anil, Galstyan, Aram, Zemel, Richard, Chang, Kai-Wei, Gupta, Rahul, Peris, Charith

We propose a constraint learning schema for fine-tuning Large Language Models (LLMs) with attribute control. Given a training corpus and control criteria formulated as a sequence-level constraint on model outputs, our method fine-tunes the LLM on the

Externí odkaz: http://arxiv.org/abs/2410.05559

Zobrazit plný text záznamu

Report

Electrical Conductivity of Warm Dense Hydrogen from Ohm's Law and Time-Dependent Density Functional Theory

Autor: Ramakrishna, Kushal, Lokamani, Mani, Cangi, Attila

Understanding the electrical conductivity of warm dense hydrogen is critical for both fundamental physics and applications in planetary science and inertial confinement fusion. We demonstrate how to calculate the electrical conductivity using the con

Externí odkaz: http://arxiv.org/abs/2409.15160

Zobrazit plný text záznamu

Report

On the strong Massey property for number fields

Autor: Maire, Christian, Mináč, Ján, Ramakrishna, Ravi, Tan, Nguyen Duy

Let $n\geq 3$. We show that for every number field $K$ with $\zeta_p \notin K$, the absolute and tame Galois groups of $K$ satisfy the strong $n$-fold Massey property relative to $p$. Our work is based on an adapted version of the proof of the Theore

Externí odkaz: http://arxiv.org/abs/2409.01028

Zobrazit plný text záznamu

Report

Fine-tuning Smaller Language Models for Question Answering over Financial Documents

Autor: Phogat, Karmvir Singh, Puranam, Sai Akhil, Dasaratha, Sridhar, Harsha, Chetan, Ramakrishna, Shashishekar

Recent research has shown that smaller language models can acquire substantial reasoning abilities when fine-tuned with reasoning exemplars crafted by a significantly larger teacher model. We explore this paradigm for the financial domain, focusing o

Externí odkaz: http://arxiv.org/abs/2408.12337

Zobrazit plný text záznamu

Report

Tree-of-Traversals: A Zero-Shot Reasoning Algorithm for Augmenting Black-box Language Models with Knowledge Graphs

Autor: Markowitz, Elan, Ramakrishna, Anil, Dhamala, Jwala, Mehrabi, Ninareh, Peris, Charith, Gupta, Rahul, Chang, Kai-Wei, Galstyan, Aram

Knowledge graphs (KGs) complement Large Language Models (LLMs) by providing reliable, structured, domain-specific, and up-to-date external knowledge. However, KGs and LLMs are often developed separately and must be integrated after training. We intro

Externí odkaz: http://arxiv.org/abs/2407.21358

Zobrazit plný text záznamu

Report

Interaction of vector light beams with atoms exposed to a time-dependent magnetic field

Autor: Ramakrishna, Shreyas, Schmidt, Riaan P., Peshkov, Anton A., Franke-Arnold, Sonja, Surzhykov, Andrey, Fritzsche, Stephan

During recent years interest has been rising for applications of vector light beams towards magnetic field sensing. In particular, a series of experiments were performed to extract information about properties of static magnetic fields from absorptio

Externí odkaz: http://arxiv.org/abs/2407.17991

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání