Zobrazeno 1 - 6
of 6
pro vyhledávání: '"Trappett, Matthew"'
Hindsight experience replay (HER) is well-known to accelerate goal-based reinforcement learning (RL). While HER is generally applied to off-policy RL algorithms, we previously showed that HER can also accelerate on-policy algorithms, such as proximal
Externí odkaz:
http://arxiv.org/abs/2410.24016
Hindsight experience replay (HER) accelerates off-policy reinforcement learning algorithms for environments that emit sparse rewards by modifying the goal of the episode post-hoc to be some state achieved during the episode. Because post-hoc modifica
Externí odkaz:
http://arxiv.org/abs/2410.22524
Publikováno v:
In Current Opinion in Chemical Biology August 2019 51:138-145
Autor:
Bellot, Patrice, Doucet, Antoine, Geva, Shlomo, Gurajada, Sairam, Kamps, Jaap, Kazai, Gabriella, Koolen, Marijn, Mishra, Arunav, Moriceau, Véronique, Mothe, Josiane, Preminger, Michael, SanJuan, Eric, Schenkel, Ralf, Tannier, Xavier, Theobald, Martin, Trappett, Matthew, Wang, Qiuyue
Publikováno v:
Information Access Evaluation. Multilinguality, Multimodality & Visualization; 2013, p269-281, 13p
Publikováno v:
Focused Retrieval of Content & Structure; 2012, p283-294, 12p