Výsledky vyhledávání - "Coninx, Miranda"

Report

Discovering and Exploiting Sparse Rewards in a Learned Behavior Space

Autor: Paolo, Giuseppe, Coninx, Miranda, Laflaquière, Alban, Doncieux, Stephane

Learning optimal policies in sparse rewards settings is difficult as the learning agent has little to no feedback on the quality of its actions. In these situations, a good strategy is to focus on exploration, hopefully leading to the discovery of a

Externí odkaz: http://arxiv.org/abs/2111.01919

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání