Using options to improve robustness of imitation learning against adversarial attacks

Autor:	Prithviraj Dasgupta
Rok vydání:	2021
Předmět:	Adversarial system Game playing business.industry Robustness (computer science) Computer science Autonomous agent Learning agent Reinforcement learning Artificial intelligence business Imitation learning Task (project management)
Zdroj:	Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications III.
Popis:	Imitation learning has been shown to be a successful learning technique in scenarios where autonomous agents have to adapt their operation across diverse environments or domains. The main principle underlying imitation learning is to determine a state-to-action mapping, called a policy, from trajectories demonstrated by an expert. We consider the problem of imitation learning under adversarial settings where the expert could be malicious and intermittently give incorrect demonstrations to misguide the learning agent. We propose a technique using temporally extended policies called options to make a learning agent robust against adversarial expert demonstrations. Experimental evaluation of our proposed technique for a game playing AI shows that a learning agent using our options based technique can successfully resist deterioration in its task performance as compared to using conventional reinforcement learning, when an expert adversarially modifies the demonstrations either randomly or strategically.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::a44999c9f225b03b0e9ecd3532eae628 https://doi.org/10.1117/12.2585849 Zobrazit plný text záznamu