Výsledky vyhledávání - "Renard, Titouan"

Report

Convergence of a model-free entropy-regularized inverse reinforcement learning algorithm

Autor: Renard, Titouan, Schlaginhaufen, Andreas, Ni, Tingting, Kamgarpour, Maryam

Given a dataset of expert demonstrations, inverse reinforcement learning (IRL) aims to recover a reward for which the expert is optimal. This work proposes a model-free algorithm to solve entropy-regularized IRL problem. In particular, we employ a st

Externí odkaz: http://arxiv.org/abs/2403.16829

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání