Using options to improve robustness of imitation learning against adversarial attacks

Autor: Prithviraj Dasgupta
Rok vydání: 2021
Předmět:
Zdroj: Artificial Intelligence and Machine Learning for Multi-Domain Operations Applications III.
Popis: Imitation learning has been shown to be a successful learning technique in scenarios where autonomous agents have to adapt their operation across diverse environments or domains. The main principle underlying imitation learning is to determine a state-to-action mapping, called a policy, from trajectories demonstrated by an expert. We consider the problem of imitation learning under adversarial settings where the expert could be malicious and intermittently give incorrect demonstrations to misguide the learning agent. We propose a technique using temporally extended policies called options to make a learning agent robust against adversarial expert demonstrations. Experimental evaluation of our proposed technique for a game playing AI shows that a learning agent using our options based technique can successfully resist deterioration in its task performance as compared to using conventional reinforcement learning, when an expert adversarially modifies the demonstrations either randomly or strategically.
Databáze: OpenAIRE