Výsledky vyhledávání - "Cicek, Dogan Can"

Report

Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients

Autor: Saglam, Baturay, Mutlu, Furkan Burak, Cicek, Dogan Can, Kozat, Suleyman Serdar

Approximation of the value functions in value-based deep reinforcement learning induces overestimation bias, resulting in suboptimal policies. We show that when the reinforcement signals received by the agents have a high variance, deep actor-critic

Externí odkaz: http://arxiv.org/abs/2109.11788

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání