Zobrazeno 1 - 10
of 6 450
pro vyhledávání: '"Thomas, Philip A."'
Autor:
Guillery, R. W.
Publikováno v:
Biographical Memoirs of Fellows of the Royal Society, 1997 Nov 01. 43, 413-427.
Externí odkaz:
https://www.jstor.org/stable/770343
Evaluating policies using off-policy data is crucial for applying reinforcement learning to real-world problems such as healthcare and autonomous driving. Previous methods for off-policy evaluation (OPE) generally suffer from high variance or irreduc
Externí odkaz:
http://arxiv.org/abs/2410.02172
Autor:
Welte, Stephan, Thomas, Philip, Hartung, Lukas, Daiss, Severin, Langenfeld, Stefan, Morin, Olivier, Rempe, Gerhard, Distante, Emanuele
Publikováno v:
Nat. Photon. 15, 504-509 (2021)
One of the most fascinating aspects of quantum networks is their capability to distribute entanglement as a nonlocal communication resource. In a first step, this requires network-ready devices that can generate and store entangled states. Another cr
Externí odkaz:
http://arxiv.org/abs/2409.00871
Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous cal
Externí odkaz:
http://arxiv.org/abs/2406.16241
We present ICU-Sepsis, an environment that can be used in benchmarks for evaluating reinforcement learning (RL) algorithms. Sepsis management is a complex task that has been an important topic in applied RL research in recent years. Therefore, MDPs t
Externí odkaz:
http://arxiv.org/abs/2406.05646
Publikováno v:
Nature 629, 567-572 (2024)
Entanglement has evolved from an enigmatic concept of quantum physics to a key ingredient of quantum technology. It explains correlations between measurement outcomes that contradict classical physics, and has been widely explored with small sets of
Externí odkaz:
http://arxiv.org/abs/2403.11950
Publikováno v:
Princeton University Library Chronicle, 2012 Jan 01. 73(2), 247-278.
Autor:
Gupta, Dhawal, Jordan, Scott M., Chaudhari, Shreyas, Liu, Bo, Thomas, Philip S., da Silva, Bruno Castro
In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment
Externí odkaz:
http://arxiv.org/abs/2312.12972
Autor:
Hanania, Marwan D.
Publikováno v:
The Arab Studies Journal, 2018 Apr 01. 26(1), 170-173.
Externí odkaz:
https://www.jstor.org/stable/26529000
Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task. This is challenging since it requires the identification of reward structures that are not sparse and that avoid inadve
Externí odkaz:
http://arxiv.org/abs/2310.19007