Non-divergent Imitation for Verification of Complex Learned Controllers

Autor:	Jae Young Lee, Vahdat Abdelzad, Sean Sedwards, Soheil Soltani, Krzysztof Czarnecki
Rok vydání:	2021
Předmět:	Cloning (programming) Artificial neural network Computer science media_common.quotation_subject Fidelity Context (language use) Quantum Physics 02 engineering and technology Maximization Oracle Data modeling Metric (mathematics) 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Algorithm media_common
Zdroj:	IJCNN
DOI:	10.1109/ijcnn52387.2021.9533410
Popis:	We consider the problem of verifying complex learned controllers using distillation. In contrast to previous work, we require that the distilled model maintains behavioural fidelity with an oracle, defining the notion of non-divergent path length (NPL) as a metric. We demonstrate that current distillation approaches with proven accuracy bounds do not have high expected NPL and can be out-performed by naive behavioural cloning. We thus propose a distillation algorithm that typically gives greater expected NPL, improved sample efficiency, and more compact models. We prove properties of NPL maximization and demonstrate the performance of our algorithm on deep Q-network controllers for three standard learning environments that have been used in this context: Pong, CartPole and MountainCar.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::cac730fc0f44be626828814a43c9402a https://doi.org/10.1109/ijcnn52387.2021.9533410 Zobrazit plný text záznamu