Learning with Value-Ramp
Autor: | Ameloot, Tom J., Bussche, Jan Van den |
---|---|
Rok vydání: | 2016 |
Předmět: | |
Druh dokumentu: | Working Paper |
Popis: | We study a learning principle based on the intuition of forming ramps. The agent tries to follow an increasing sequence of values until the agent meets a peak of reward. The resulting Value-Ramp algorithm is natural, easy to configure, and has a robust implementation with natural numbers. Comment: Version 2: fixed notation in definition of transition + clarified a sentence in the Introduction |
Databáze: | arXiv |
Externí odkaz: |