Finite State Multi-Armed Bandit Problems: Sensitive-Discount, Average-Reward and Average-Overtaking Optimality
Autor: | Katehakis, Michael N., Rothblum, Uriel G. |
---|---|
Zdroj: | The Annals of Applied Probability, 1996 Aug 01. 6(3), 1024-1034. |
Databáze: | JSTOR Journals |
Externí odkaz: |