Výsledky vyhledávání

Report

On Practical Robust Reinforcement Learning: Practical Uncertainty Set and Double-Agent Algorithm

Autor: Hwang, Ukjo, Hong, Songnam

Robust reinforcement learning (RRL) aims at seeking a robust policy to optimize the worst case performance over an uncertainty set of Markov decision processes (MDPs). This set contains some perturbed MDPs from a nominal MDP (N-MDP) that generate sam

Externí odkaz: http://arxiv.org/abs/2305.06657

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání