Learning Partial Policies to Speedup MDP Tree Search via Reduction to I.I.D. Learning.

Autor: Pinto, Jervis1 JERVIS.PINTO@GMAIL.COM, Fern, Alan1 AFERN@EECS.OREGONSTATE.EDU
Zdroj: Journal of Machine Learning Research. 2017, Vol. 18 Issue 64-98, p1-35. 35p.
Databáze: Business Source Ultimate