Výsledky vyhledávání

Report

Compositional Planning for Logically Constrained Multi-Agent Markov Decision Processes

Autor: Kalagarla, Krishna C., Low, Matthew, Jain, Rahul, Nayyar, Ashutosh, Nuzzo, Pierluigi

Designing control policies for large, distributed systems is challenging, especially in the context of critical, temporal logic based specifications (e.g., safety) that must be met with high probability. Compositional methods for such problems are ne

Externí odkaz: http://arxiv.org/abs/2410.04004

Zobrazit plný text záznamu

Report

Lossy Catalytic Computation

Autor: Gupta, Chetan, Jain, Rahul, Sharma, Vimal Raj, Tewari, Raghunath

A catalytic Turing machine is a variant of a Turing machine in which there exists an auxiliary tape in addition to the input tape and the work tape. This auxiliary tape is initially filled with arbitrary content. The machine can read and write on the

Externí odkaz: http://arxiv.org/abs/2408.14670

Zobrazit plný text záznamu

Report

Markov Balance Satisfaction Improves Performance in Strictly Batch Offline Imitation Learning

Autor: Agrawal, Rishabh, Dahlin, Nathan, Jain, Rahul, Nayyar, Ashutosh

Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot

Externí odkaz: http://arxiv.org/abs/2408.09125

Zobrazit plný text záznamu

Report

avaTTAR: Table Tennis Stroke Training with On-body and Detached Visualization in Augmented Reality

Autor: Ma, Dizhi, Hu, Xiyun, Shi, Jingyu, Patel, Mayank, Jain, Rahul, Liu, Ziyi, Zhu, Zhengzhe, Ramani, Karthik

Publikováno v: UIST '2024: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology

Table tennis stroke training is a critical aspect of player development. We designed a new augmented reality (AR) system, avaTTAR, for table tennis stroke training. The system provides both "on-body" (first-person view) and "detached" (third-person v

Externí odkaz: http://arxiv.org/abs/2407.15373

Zobrazit plný text záznamu

Report

Online Bandit Learning with Offline Preference Data

Autor: Agnihotri, Akhil, Jain, Rahul, Ramachandran, Deepak, Wen, Zheng

Reinforcement Learning with Human Feedback (RLHF) is at the core of fine-tuning methods for generative AI models for language and images. Such feedback is often sought as rank or preference feedback from human raters, as opposed to eliciting scores s

Externí odkaz: http://arxiv.org/abs/2406.09574

Zobrazit plný text záznamu

Report

e-COP : Episodic Constrained Optimization of Policies

Autor: Agnihotri, Akhil, Jain, Rahul, Ramachandran, Deepak, Singla, Sahil

In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate sets of opti

Externí odkaz: http://arxiv.org/abs/2406.09563

Zobrazit plný text záznamu

Report

Nuclear Data to Quantify Urca Cooling in Accreting Neutron Stars

Autor: Jain, Rahul

Neutron stars in Low Mass X-ray Binaries (LMXBs) can accrete matter onto their surface from the companion star. Transiently accreting neutron stars go through alternating phases of active accretion outbursts and quiescence. X-ray observations during

Externí odkaz: http://arxiv.org/abs/2406.02634

Zobrazit plný text záznamu

Report

Pure Exploration for Constrained Best Mixed Arm Identification with a Fixed Budget

Autor: Tang, Dengwang, Jain, Rahul, Nayyar, Ashutosh, Nuzzo, Pierluigi

In this paper, we introduce the constrained best mixed arm identification (CBMAI) problem with a fixed budget. This is a pure exploration problem in a stochastic finite armed bandit model. Each arm is associated with a reward and multiple types of co

Externí odkaz: http://arxiv.org/abs/2405.15090

Zobrazit plný text záznamu

Report

Robust and composable device-independent quantum protocols for oblivious transfer and bit commitment

Autor: Batra, Rishabh, Chakraborty, Sayantan, Jain, Rahul, Kapshikar, Upendra

We present robust and composable device-independent quantum protocols for oblivious transfer (OT) and bit commitment (BC) using Magic Square devices. We assume there is no long-term quantum memory, that is, after a finite time interval, referred to a

Externí odkaz: http://arxiv.org/abs/2404.11283

Zobrazit plný text záznamu

Report

Commitments are equivalent to statistically-verifiable one-way state generators

Autor: Batra, Rishabh, Jain, Rahul

One-way state generators (OWSG) are natural quantum analogs to classical one-way functions. We consider statistically-verifiable OWSGs (sv-OWSG), which are potentially weaker objects than OWSGs. We show that O(n/log(n))-copy sv-OWSGs (n represents th

Externí odkaz: http://arxiv.org/abs/2404.03220

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání