Výsledky vyhledávání - "Gaebler, Johann Demetrio"

Report

An Adaptive State Aggregation Algorithm for Markov Decision Processes

Autor: Chen, Guanting, Gaebler, Johann Demetrio, Peng, Matt, Sun, Chunlin, Ye, Yinyu

Value iteration is a well-known method of solving Markov Decision Processes (MDPs) that is simple to implement and boasts strong theoretical convergence guarantees. However, the computational cost of value iteration quickly becomes infeasible as the

Externí odkaz: http://arxiv.org/abs/2107.11053

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání