Výsledky vyhledávání - "Gautam, Simanta"

Report

Autor: Kulkarni, Tejas D., Saeedi, Ardavan, Gautam, Simanta, Gershman, Samuel J.

Learning robust value functions given raw observations and rewards is now possible with model-free and model-based deep reinforcement learning algorithms. There is a third alternative, called Successor Representations (SR), which decomposes the value

Externí odkaz: http://arxiv.org/abs/1606.02396

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání