Learning Partial Policies to Speedup MDP Tree Search via Reduction to I.I.D. Learning.
Autor: | Pinto, Jervis1 JERVIS.PINTO@GMAIL.COM, Fern, Alan1 AFERN@EECS.OREGONSTATE.EDU |
---|---|
Zdroj: | Journal of Machine Learning Research. 2017, Vol. 18 Issue 64-98, p1-35. 35p. |
Databáze: | Business Source Ultimate |
Externí odkaz: |