Výsledky vyhledávání - "Ângelo Gregório Lovatto"

Model-based policy gradients: an empirical study on linear quadratic environments

Publikováno v: Biblioteca Digital de Teses e Dissertações da USP
Universidade de São Paulo (USP)
instacron:USP

Stochastic Value Gradient (SVG) methods underlie many recent achievements of model-based Reinforcement Learning (RL) agents in continuous state-action spaces. Such methods use data collected by exploration in the environment to produce a model of its

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::16751d06e66c6b7a3fdd1c73aef0eeb3
https://doi.org/10.11606/d.45.2022.tde-28062022-123656

Zobrazit plný text záznamu

Exploration Versus Exploitation in Model-Based Reinforcement Learning: An Empirical Study

Autor: Ângelo Gregório Lovatto, Leliane Nunes de Barros, Denis D. Mauá

Publikováno v: Intelligent Systems ISBN: 9783031216886

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b87cc6d7b6b1c579e6d0a65f1e4c61e7
https://doi.org/10.1007/978-3-031-21689-3_3

Zobrazit plný text záznamu

Gradient Estimation in Model-Based Reinforcement Learning: A Study on Linear Quadratic Environments

Autor: Ângelo Gregório Lovatto, Leliane Nunes de Barros, Thiago Pereira Bueno

Publikováno v: Intelligent Systems ISBN: 9783030917012

Stochastic Value Gradient (SVG) methods underlie many recent achievements of model-based Reinforcement Learning agents in continuous state-action spaces. Despite their practical significance, many algorithm design choices still lack rigorous theoreti

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::98e09e25b477df85b9aa7a93bfca5776
https://doi.org/10.1007/978-3-030-91702-9_3

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání