Learning with Value-Ramp

Autor: Ameloot, Tom J., Bussche, Jan Van den
Rok vydání: 2016
Předmět:
Druh dokumentu: Working Paper
Popis: We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers.
Comment: Version 2: fixed notation in definition of transition + clarified a sentence in the Introduction
Databáze: arXiv