Výsledky vyhledávání

Report

Diffusing States and Matching Scores: A New Framework for Imitation Learning

Autor: Wu, Runzhe, Chen, Yiding, Swamy, Gokul, Brantley, Kianté, Sun, Wen

Adversarial Imitation Learning is traditionally framed as a two-player zero-sum game between a learner and an adversarially chosen cost function, and can therefore be thought of as the sequential generalization of a Generative Adversarial Network (GA

Externí odkaz: http://arxiv.org/abs/2410.13855

Zobrazit plný text záznamu

Report

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Autor: Gao, Zhaolin, Zhan, Wenhao, Chang, Jonathan D., Swamy, Gokul, Brantley, Kianté, Lee, Jason D., Sun, Wen

Large Language Models (LLMs) have achieved remarkable success at tasks like summarization that involve a single turn of interaction. However, they can still struggle with multi-turn tasks like dialogue that require long-term planning. Previous works

Externí odkaz: http://arxiv.org/abs/2410.04612

Zobrazit plný text záznamu

Report

DiffSpec: Differential Testing with LLMs using Natural Language Specifications and Code Artifacts

Autor: Rao, Nikitha, Gilbert, Elizabeth, Ramananandro, Tahina, Swamy, Nikhil, Goues, Claire Le, Fakhoury, Sarah

Differential testing can be an effective way to find bugs in software systems with multiple implementations that conform to the same specification, like compilers, network protocol parsers, and language runtimes. Specifications for such systems are o

Externí odkaz: http://arxiv.org/abs/2410.04249

Zobrazit plný text záznamu

Report

From Explanations to Action: A Zero-Shot, Theory-Driven LLM Framework for Student Performance Feedback

Autor: Swamy, Vinitra, Romano, Davide, Desikan, Bhargav Srinivasa, Camburu, Oana-Maria, Käser, Tanja

Recent advances in eXplainable AI (XAI) for education have highlighted a critical challenge: ensuring that explanations for state-of-the-art AI models are understandable for non-technical users such as educators and students. In response, we introduc

Externí odkaz: http://arxiv.org/abs/2409.08027

Zobrazit plný text záznamu

Report

Approximation Algorithms for Correlated Knapsack Orienteering

Autor: Espinosa, David Aleman, Swamy, Chaitanya

We consider the {\em correlated knapsack orienteering} (CSKO) problem: we are given a travel budget $B$, processing-time budget $W$, finite metric space $(V,d)$ with root $\rho\in V$, where each vertex is associated with a job with possibly correlate

Externí odkaz: http://arxiv.org/abs/2408.16566

Zobrazit plný text záznamu

Report

IDNet: A Novel Dataset for Identity Document Analysis and Fraud Detection

Autor: Guan, Hong, Wang, Yancheng, Xie, Lulu, Nag, Soham, Goel, Rajeev, Swamy, Niranjan Erappa Narayana, Yang, Yingzhen, Xiao, Chaowei, Prisby, Jonathan, Maciejewski, Ross, Zou, Jia

Effective fraud detection and analysis of government-issued identity documents, such as passports, driver's licenses, and identity cards, are essential in thwarting identity theft and bolstering security on online platforms. The training of accurate

Externí odkaz: http://arxiv.org/abs/2408.01690

Zobrazit plný text záznamu

Report

PathoWAve: A Deep Learning-based Weight Averaging Method for Improving Domain Generalization in Histopathology Images

Autor: Sharifi, Parastoo Sotoudeh, Ahmad, M. Omair, Swamy, M. N. S.

Recent advancements in deep learning (DL) have significantly advanced medical image analysis. In the field of medical image processing, particularly in histopathology image analysis, the variation in staining protocols and differences in scanners pre

Externí odkaz: http://arxiv.org/abs/2406.15685

Zobrazit plný text záznamu

Report

EvIL: Evolution Strategies for Generalisable Imitation Learning

Autor: Sapora, Silvia, Swamy, Gokul, Lu, Chris, Teh, Yee Whye, Foerster, Jakob Nicolaus

Often times in imitation learning (IL), the environment we collect expert demonstrations in and the environment we want to deploy our learned policy in aren't exactly the same (e.g. demonstrations collected in simulation but deployment in the real wo

Externí odkaz: http://arxiv.org/abs/2406.11905

Zobrazit plný text záznamu

Report

Highly Connected Graph Partitioning: Exact Formulation and Solution Methods

Autor: Swamy, Rahul, King, Douglas M., Jacobson, Sheldon H.

Graph partitioning (GP) and vertex connectivity have traditionally been two distinct fields of study. This paper introduces the highly connected graph partitioning (HCGP) problem, which partitions a graph into compact, size balanced, and $Q$-(vertex)

Externí odkaz: http://arxiv.org/abs/2406.08329

Zobrazit plný text záznamu

Report

Multi-Agent Imitation Learning: Value is Easy, Regret is Hard

Autor: Tang, Jingwu, Swamy, Gokul, Fang, Fei, Wu, Zhiwei Steven

We study a multi-agent imitation learning (MAIL) problem where we take the perspective of a learner attempting to coordinate a group of agents based on demonstrations of an expert doing so. Most prior work in MAIL essentially reduces the problem to m

Externí odkaz: http://arxiv.org/abs/2406.04219

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání