Finite State Multi-Armed Bandit Problems: Sensitive-Discount, Average-Reward and Average-Overtaking Optimality

Autor: Katehakis, Michael N., Rothblum, Uriel G.
Zdroj: The Annals of Applied Probability, 1996 Aug 01. 6(3), 1024-1034.
Databáze: JSTOR Journals