Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Del Col, Stefano"'
In reinforcement learning, we encode the potential behaviors of an agent interacting with an environment into an infinite set of policies, the policy space, typically represented by a family of parametric functions. Dealing with such a policy space i
Externí odkaz:
http://arxiv.org/abs/2202.11079