Výsledky vyhledávání - "Nkhumise, Reabetswe M."

Report

How does Your RL Agent Explore? An Optimal Transport Analysis of Occupancy Measure Trajectories

Autor: Nkhumise, Reabetswe M., Basu, Debabrota, Prescott, Tony J., Gilra, Aditya

The rising successes of RL are propelled by combining smart algorithmic strategies and deep architectures to optimize the distribution of returns and visitations over the state-action space. A quantitative framework to compare the learning processes

Externí odkaz: http://arxiv.org/abs/2402.09113

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání