Zobrazeno 1 - 10
of 3 329
pro vyhledávání: '"Thomas,Philip"'
Autor:
Cesarz, Patrick, Fiorini, Eugene, Gong, Charles, Kelley, Kyle, Thomas, Philip, Woldar, Andrew
We introduce the notion of a move graph, that is, a directed graph whose vertex set is a $\mathbb Z$-module $\mathbb Z_n^m$, and whose arc set is uniquely determined by the action $M\!:\!\mathbb Z_n^m\to \mathbb Z_n^m$ where $M$ is an $m\times m$ mat
Externí odkaz:
http://arxiv.org/abs/2411.01047
Evaluating policies using off-policy data is crucial for applying reinforcement learning to real-world problems such as healthcare and autonomous driving. Previous methods for off-policy evaluation (OPE) generally suffer from high variance or irreduc
Externí odkaz:
http://arxiv.org/abs/2410.02172
Autor:
Welte, Stephan, Thomas, Philip, Hartung, Lukas, Daiss, Severin, Langenfeld, Stefan, Morin, Olivier, Rempe, Gerhard, Distante, Emanuele
Publikováno v:
Nat. Photon. 15, 504-509 (2021)
One of the most fascinating aspects of quantum networks is their capability to distribute entanglement as a nonlocal communication resource. In a first step, this requires network-ready devices that can generate and store entangled states. Another cr
Externí odkaz:
http://arxiv.org/abs/2409.00871
Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous cal
Externí odkaz:
http://arxiv.org/abs/2406.16241
We present ICU-Sepsis, an environment that can be used in benchmarks for evaluating reinforcement learning (RL) algorithms. Sepsis management is a complex task that has been an important topic in applied RL research in recent years. Therefore, MDPs t
Externí odkaz:
http://arxiv.org/abs/2406.05646
Publikováno v:
Nature 629, 567-572 (2024)
Entanglement has evolved from an enigmatic concept of quantum physics to a key ingredient of quantum technology. It explains correlations between measurement outcomes that contradict classical physics, and has been widely explored with small sets of
Externí odkaz:
http://arxiv.org/abs/2403.11950
Autor:
Gupta, Dhawal, Jordan, Scott M., Chaudhari, Shreyas, Liu, Bo, Thomas, Philip S., da Silva, Bruno Castro
In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment
Externí odkaz:
http://arxiv.org/abs/2312.12972