Výsledky vyhledávání - "Wiltzer, Harley"

Report

Non-Adversarial Inverse Reinforcement Learning via Successor Feature Matching

Autor: Jain, Arnav Kumar, Wiltzer, Harley, Farebrother, Jesse, Rish, Irina, Berseth, Glen, Choudhury, Sanjiban

In inverse reinforcement learning (IRL), an agent seeks to replicate expert demonstrations through interactions with the environment. Traditionally, IRL is treated as an adversarial game, where an adversary searches over reward models, and a learner

Externí odkaz: http://arxiv.org/abs/2411.07007

Zobrazit plný text záznamu

Report

Action Gaps and Advantages in Continuous-Time Distributional Reinforcement Learning

Autor: Wiltzer, Harley, Bellemare, Marc G., Meger, David, Shafto, Patrick, Jhaveri, Yash

When decisions are made at high frequency, traditional reinforcement learning (RL) methods struggle to accurately estimate action values. In turn, their performance is inconsistent and often poor. Whether the performance of distributional RL (DRL) ag

Externí odkaz: http://arxiv.org/abs/2410.11022

Zobrazit plný text záznamu

Report

Foundations of Multivariate Distributional Reinforcement Learning

Autor: Wiltzer, Harley, Farebrother, Jesse, Gretton, Arthur, Rowland, Mark

In reinforcement learning (RL), the consideration of multivariate reward signals has led to fundamental advancements in multi-objective decision-making, transfer learning, and representation learning. This work introduces the first oracle-free and co

Externí odkaz: http://arxiv.org/abs/2409.00328

Zobrazit plný text záznamu

Report

A Distributional Analogue to the Successor Representation

Autor: Wiltzer, Harley, Farebrother, Jesse, Gretton, Arthur, Tang, Yunhao, Barreto, André, Dabney, Will, Bellemare, Marc G., Rowland, Mark

This paper contributes a new approach for distributional reinforcement learning which elucidates a clean separation of transition structure and reward in the learning process. Analogous to how the successor representation (SR) describes the expected

Externí odkaz: http://arxiv.org/abs/2402.08530

Zobrazit plný text záznamu

Report

Policy Optimization in a Noisy Neighborhood: On Return Landscapes in Continuous Control

Autor: Rahn, Nate, D'Oro, Pierluca, Wiltzer, Harley, Bacon, Pierre-Luc, Bellemare, Marc G.

Deep reinforcement learning agents for continuous control are known to exhibit significant instability in their performance over time. In this work, we provide a fresh perspective on these behaviors by studying the return landscape: the mapping betwe

Externí odkaz: http://arxiv.org/abs/2309.14597

Zobrazit plný text záznamu

Dissertation/ Thesis

On the evolution of return distributions in continuous-time reinforcement learning

Autor: Wiltzer, Harley

Externí odkaz: https://escholarship.mcgill.ca/concern/theses/v118rk640

Zobrazit plný text záznamu

Report

Distributional Hamilton-Jacobi-Bellman Equations for Continuous-Time Reinforcement Learning

Autor: Wiltzer, Harley, Meger, David, Bellemare, Marc G.

Continuous-time reinforcement learning offers an appealing formalism for describing control problems in which the passage of time is not naturally divided into discrete increments. Here we consider the problem of predicting the distribution of return

Externí odkaz: http://arxiv.org/abs/2205.12184

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání