Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Bester, Craig J."'
Parameterised actions in reinforcement learning are composed of discrete actions with continuous action-parameters. This provides a framework for solving complex domains that require combining high-level actions with flexible control. The recent P-DQ
Externí odkaz:
http://arxiv.org/abs/1905.04388