Zobrazeno 1 - 10
of 2 340
pro vyhledávání: '"JAIN, Rahul"'
Designing control policies for large, distributed systems is challenging, especially in the context of critical, temporal logic based specifications (e.g., safety) that must be met with high probability. Compositional methods for such problems are ne
Externí odkaz:
http://arxiv.org/abs/2410.04004
A catalytic Turing machine is a variant of a Turing machine in which there exists an auxiliary tape in addition to the input tape and the work tape. This auxiliary tape is initially filled with arbitrary content. The machine can read and write on the
Externí odkaz:
http://arxiv.org/abs/2408.14670
Imitation learning (IL) is notably effective for robotic tasks where directly programming behaviors or defining optimal control costs is challenging. In this work, we address a scenario where the imitator relies solely on observed behavior and cannot
Externí odkaz:
http://arxiv.org/abs/2408.09125
Autor:
Ma, Dizhi, Hu, Xiyun, Shi, Jingyu, Patel, Mayank, Jain, Rahul, Liu, Ziyi, Zhu, Zhengzhe, Ramani, Karthik
Publikováno v:
UIST '2024: Proceedings of the 37th Annual ACM Symposium on User Interface Software and Technology
Table tennis stroke training is a critical aspect of player development. We designed a new augmented reality (AR) system, avaTTAR, for table tennis stroke training. The system provides both "on-body" (first-person view) and "detached" (third-person v
Externí odkaz:
http://arxiv.org/abs/2407.15373
Reinforcement Learning with Human Feedback (RLHF) is at the core of fine-tuning methods for generative AI models for language and images. Such feedback is often sought as rank or preference feedback from human raters, as opposed to eliciting scores s
Externí odkaz:
http://arxiv.org/abs/2406.09574
In this paper, we present the $\texttt{e-COP}$ algorithm, the first policy optimization algorithm for constrained Reinforcement Learning (RL) in episodic (finite horizon) settings. Such formulations are applicable when there are separate sets of opti
Externí odkaz:
http://arxiv.org/abs/2406.09563
Autor:
Jain, Rahul
Neutron stars in Low Mass X-ray Binaries (LMXBs) can accrete matter onto their surface from the companion star. Transiently accreting neutron stars go through alternating phases of active accretion outbursts and quiescence. X-ray observations during
Externí odkaz:
http://arxiv.org/abs/2406.02634
In this paper, we introduce the constrained best mixed arm identification (CBMAI) problem with a fixed budget. This is a pure exploration problem in a stochastic finite armed bandit model. Each arm is associated with a reward and multiple types of co
Externí odkaz:
http://arxiv.org/abs/2405.15090
Robust and composable device-independent quantum protocols for oblivious transfer and bit commitment
We present robust and composable device-independent quantum protocols for oblivious transfer (OT) and bit commitment (BC) using Magic Square devices. We assume there is no long-term quantum memory, that is, after a finite time interval, referred to a
Externí odkaz:
http://arxiv.org/abs/2404.11283
Autor:
Batra, Rishabh, Jain, Rahul
One-way state generators (OWSG) are natural quantum analogs to classical one-way functions. We consider statistically-verifiable OWSGs (sv-OWSG), which are potentially weaker objects than OWSGs. We show that O(n/log(n))-copy sv-OWSGs (n represents th
Externí odkaz:
http://arxiv.org/abs/2404.03220