Výsledky vyhledávání - "Nil Stolt Anso"

Sampled Policy Gradient for Learning to Play the Game Agar.io

Autor: Anton Wiehe, Nil Stolt Anso, Drugan, Madalina M., Marco Wiering

Publikováno v: ArXiv. Cornell University Press
University of Groningen

In this paper, a new offline actor-critic learning algorithm is introduced: Sampled Policy Gradient (SPG). SPG samples in the action space to calculate an approximated policy gradient by using the critic to evaluate the samples. This sampling allows

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::48e762ba8cf82344d42e62932113380d
https://research.rug.nl/en/publications/769cda31-8ddf-443c-b13e-89006b8d3f91

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání