Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Zhang, Paul Yuming"'
We study the exploratory Hamilton--Jacobi--Bellman (HJB) equation arising from the entropy-regularized exploratory control problem, which was formulated by Wang, Zariphopoulou and Zhou (J. Mach. Learn. Res., 21, 2020) in the context of reinforcement
Externí odkaz:
http://arxiv.org/abs/2109.10269