Interactive Reinforcement Learning from Imperfect Teachers

Autor: Andrea L. Thomaz, Taylor Kessler Faulkner
Rok vydání: 2021
Předmět:
Zdroj: HRI (Companion)
DOI: 10.1145/3434074.3446361
Popis: Robots can use information from people to improve learning speed or quality. However, people can have short attention spans and misunderstand tasks. Our work addresses these issues with algorithms for learning from inattentive teachers that take advantage of feedback when people are present, and an algorithm for learning from inaccurate teachers that estimates which state-action pairs receive incorrect feedback. These advances will enhance robots' ability to take advantage of imperfect feedback from human teachers.
Databáze: OpenAIRE