Výsledky vyhledávání - "Badrinath, A"

Report

AttackQA: Development and Adoption of a Dataset for Assisting Cybersecurity Operations using Fine-tuned and Open-Source LLMs

Autor: Krishna, Varun Badrinath

Retrieval-augmented generation (RAG) on specialized domain datasets has shown improved performance when large language models (LLMs) are fine-tuned for generating responses to user queries. In this study, we develop a cybersecurity question-answering

Externí odkaz: http://arxiv.org/abs/2411.01073

Zobrazit plný text záznamu

Report

Balancing Continuous Pre-Training and Instruction Fine-Tuning: Optimizing Instruction-Following in LLMs

Autor: Jindal, Ishan, Badrinath, Chandana, Bharti, Pranjal, Vinay, Lakkidi, Sharma, Sachin Dev

Large Language Models (LLMs) for public use require continuous pre-training to remain up-to-date with the latest data. The models also need to be fine-tuned with specific instructions to maintain their ability to follow instructions accurately. Typic

Externí odkaz: http://arxiv.org/abs/2410.10739

Zobrazit plný text záznamu

Report

All Roads Lead to Rome? Exploring Representational Similarities Between Latent Spaces of Generative Image Models

Autor: Badrinath, Charumathi, Bhalla, Usha, Oesterling, Alex, Srinivas, Suraj, Lakkaraju, Himabindu

Do different generative image models secretly learn similar underlying representations? We investigate this by measuring the latent space similarity of four different models: VAEs, GANs, Normalizing Flows (NFs), and Diffusion Models (DMs). Our method

Externí odkaz: http://arxiv.org/abs/2407.13449

Zobrazit plný text záznamu

Report

Hybrid Preference Optimization: Augmenting Direct Preference Optimization with Auxiliary Objectives

Autor: Badrinath, Anirudhan, Agarwal, Prabhat, Xu, Jiajing

For aligning large language models (LLMs), prior work has leveraged reinforcement learning via human feedback (RLHF) or variations of direct preference optimization (DPO). While DPO offers a simpler framework based on maximum likelihood estimation, i

Externí odkaz: http://arxiv.org/abs/2405.17956

Zobrazit plný text záznamu

Report

OPERA: Automatic Offline Policy Evaluation with Re-weighted Aggregates of Multiple Estimators

Autor: Nie, Allen, Chandak, Yash, Yuan, Christina J., Badrinath, Anirudhan, Flet-Berliac, Yannis, Brunskil, Emma

Offline policy evaluation (OPE) allows us to evaluate and estimate a new sequential decision-making policy's performance by leveraging historical interaction data collected from other policies. Evaluating a new policy online without a confident estim

Externí odkaz: http://arxiv.org/abs/2405.17708

Zobrazit plný text záznamu

Akademický článek

Managing COVID-19 outbreaks in prisons – a brief review of literature and key lessons learnt

Autor: Guo, Lin, Badrinath, Padmanabhan, Mookherjee, Jessica, Ghosh, Anjan, McCallum, Edyta, Dissanayake, Nirosha, George, Abraham

Publikováno v: International Journal of Prison Health, 2024, Vol. 20, Issue 4, pp. 410-421.

Externí odkaz: http://www.emeraldinsight.com/doi/10.1108/IJOPH-08-2023-0049

Zobrazit plný text záznamu

Report

SAP-sLDA: An Interpretable Interface for Exploring Unstructured Text

Autor: Badrinath, Charumathi, Pan, Weiwei, Doshi-Velez, Finale

A common way to explore text corpora is through low-dimensional projections of the documents, where one hopes that thematically similar documents will be clustered together in the projected space. However, popular algorithms for dimensionality reduct

Externí odkaz: http://arxiv.org/abs/2308.01420

Zobrazit plný text záznamu

Report

Analyzing Chain-of-Thought Prompting in Large Language Models via Gradient-based Feature Attributions

Autor: Wu, Skyler, Shen, Eric Meng, Badrinath, Charumathi, Ma, Jiaqi, Lakkaraju, Himabindu

Chain-of-thought (CoT) prompting has been shown to empirically improve the accuracy of large language models (LLMs) on various question answering tasks. While understanding why CoT prompting is effective is crucial to ensuring that this phenomenon is

Externí odkaz: http://arxiv.org/abs/2307.13339

Zobrazit plný text záznamu

Report

OCTraN: 3D Occupancy Convolutional Transformer Network in Unstructured Traffic Scenarios

Autor: Ganesh, Aditya Nalgunda, Badrinath, Dhruval Pobbathi, Kumar, Harshith Mohan, SS, Priya, Narayan, Surabhi

Modern approaches for vision-centric environment perception for autonomous navigation make extensive use of self-supervised monocular depth estimation algorithms that output disparity maps. However, when this disparity map is projected onto 3D space,

Externí odkaz: http://arxiv.org/abs/2307.10934

Zobrazit plný text záznamu

Report

Waypoint Transformer: Reinforcement Learning via Supervised Learning with Intermediate Targets

Autor: Badrinath, Anirudhan, Flet-Berliac, Yannis, Nie, Allen, Brunskill, Emma

Despite the recent advancements in offline reinforcement learning via supervised learning (RvS) and the success of the decision transformer (DT) architecture in various domains, DTs have fallen short in several challenging benchmarks. The root cause

Externí odkaz: http://arxiv.org/abs/2306.14069

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání