Výsledky vyhledávání - "Thomas, Philip A."

Akademický článek

Thomas Philip Stroud Powell. 13 July 1923-8 February 1996

Autor: Guillery, R. W.

Publikováno v: Biographical Memoirs of Fellows of the Royal Society, 1997 Nov 01. 43, 413-427.

Externí odkaz: https://www.jstor.org/stable/770343

Zobrazit plný text záznamu

Report

Abstract Reward Processes: Leveraging State Abstraction for Consistent Off-Policy Evaluation

Autor: Chaudhari, Shreyas, Deshpande, Ameet, da Silva, Bruno Castro, Thomas, Philip S.

Evaluating policies using off-policy data is crucial for applying reinforcement learning to real-world problems such as healthcare and autonomous driving. Previous methods for off-policy evaluation (OPE) generally suffer from high variance or irreduc

Externí odkaz: http://arxiv.org/abs/2410.02172

Zobrazit plný text záznamu

Report

A nondestructive Bell-state measurement on two distant atomic qubits

Autor: Welte, Stephan, Thomas, Philip, Hartung, Lukas, Daiss, Severin, Langenfeld, Stefan, Morin, Olivier, Rempe, Gerhard, Distante, Emanuele

Publikováno v: Nat. Photon. 15, 504-509 (2021)

One of the most fascinating aspects of quantum networks is their capability to distribute entanglement as a nonlocal communication resource. In a first step, this requires network-ready devices that can generate and store entangled states. Another cr

Externí odkaz: http://arxiv.org/abs/2409.00871

Zobrazit plný text záznamu

Report

Position: Benchmarking is Limited in Reinforcement Learning Research

Autor: Jordan, Scott M., White, Adam, da Silva, Bruno Castro, White, Martha, Thomas, Philip S.

Novel reinforcement learning algorithms, or improvements on existing ones, are commonly justified by evaluating their performance on benchmark environments and are compared to an ever-changing set of standard algorithms. However, despite numerous cal

Externí odkaz: http://arxiv.org/abs/2406.16241

Zobrazit plný text záznamu

Report

ICU-Sepsis: A Benchmark MDP Built from Real Medical Data

Autor: Choudhary, Kartik, Gupta, Dhawal, Thomas, Philip S.

We present ICU-Sepsis, an environment that can be used in benchmarks for evaluating reinforcement learning (RL) algorithms. Sepsis management is a complex task that has been an important topic in applied RL research in recent years. Therefore, MDPs t

Externí odkaz: http://arxiv.org/abs/2406.05646

Zobrazit plný text záznamu

Report

Fusion of deterministically generated photonic graph states

Autor: Thomas, Philip, Ruscio, Leonardo, Morin, Olivier, Rempe, Gerhard

Publikováno v: Nature 629, 567-572 (2024)

Entanglement has evolved from an enigmatic concept of quantum physics to a key ingredient of quantum technology. It explains correlations between measurement outcomes that contradict classical physics, and has been widely explored with small sets of

Externí odkaz: http://arxiv.org/abs/2403.11950

Zobrazit plný text záznamu

Akademický článek

The Priest and the Prophetess: Thomas Philip Foley, Joanna Southcott, and Millenarian Activity in the Late Georgian Church of England

Publikováno v: Princeton University Library Chronicle, 2012 Jan 01. 73(2), 247-278.

Externí odkaz: https://www.jstor.org/stable/10.25290/prinunivlibrchro.73.2.0247

Zobrazit plný text záznamu

Report

From Past to Future: Rethinking Eligibility Traces

Autor: Gupta, Dhawal, Jordan, Scott M., Chaudhari, Shreyas, Liu, Bo, Thomas, Philip S., da Silva, Bruno Castro

In this paper, we introduce a fresh perspective on the challenges of credit assignment and policy evaluation. First, we delve into the nuances of eligibility traces and explore instances where their updates may result in unexpected credit assignment

Externí odkaz: http://arxiv.org/abs/2312.12972

Zobrazit plný text záznamu

Review

COLONIAL JERUSALEM: THE SPATIAL CONSTRUCTION OF IDENTITY AND DIFFERENCE IN A CITY OF MYTH, 1948–2012 Abowd Thomas Philip

Autor: Hanania, Marwan D.

Publikováno v: The Arab Studies Journal, 2018 Apr 01. 26(1), 170-173.

Externí odkaz: https://www.jstor.org/stable/26529000

Zobrazit plný text záznamu

Report

Behavior Alignment via Reward Function Optimization

Autor: Gupta, Dhawal, Chandak, Yash, Jordan, Scott M., Thomas, Philip S., da Silva, Bruno Castro

Designing reward functions for efficiently guiding reinforcement learning (RL) agents toward specific behaviors is a complex task. This is challenging since it requires the identification of reward structures that are not sparse and that avoid inadve

Externí odkaz: http://arxiv.org/abs/2310.19007

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání