Výsledky vyhledávání - "Kharkwal, Ayush"

Report

Advice Conformance Verification by Reinforcement Learning agents for Human-in-the-Loop

Autor: Verma, Mudit, Kharkwal, Ayush, Kambhampati, Subbarao

Human-in-the-loop (HiL) reinforcement learning is gaining traction in domains with large action and state spaces, and sparse rewards by allowing the agent to take advice from HiL. Beyond advice accommodation, a sequential decision-making agent must b

Externí odkaz: http://arxiv.org/abs/2210.03455

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání