Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Kharkwal, Ayush"'
Human-in-the-loop (HiL) reinforcement learning is gaining traction in domains with large action and state spaces, and sparse rewards by allowing the agent to take advice from HiL. Beyond advice accommodation, a sequential decision-making agent must b
Externí odkaz:
http://arxiv.org/abs/2210.03455