Columba: A New Approach to Train an Agent for Autonomous Driving

Autor:	Beihong Jin, Kunchi Liu, Ruiyang Yang, Hongyin Tang
Rok vydání:	2021
Předmět:	Discriminator Artificial neural network Computer science business.industry media_common.quotation_subject Imitation learning Machine learning computer.software_genre Complete information Trajectory Reinforcement learning Train Quality (business) Artificial intelligence business computer media_common
Zdroj:	IJCNN
DOI:	10.1109/ijcnn52387.2021.9533604
Popis:	For autonomous driving in extremely complex scenarios, existing research utilizes deep reinforcement learning or imitation learning to obtain the decision-making capability of agents. However, due to the incomplete information nature of such driving scenarios, existing techniques usually suffer issues such as the incorrect rewards or unstable training which would impact the learning quality seriously. In this paper, we propose a new approach named Columba which trains the agent to learn from expert trajectory data and abnormal trajectory data instead of relying on any manually-set reward functions. In particular, Columba designs a positive and negative feedback regulator to reduce the dangerous or bad states of the car agent at the beginning of training. Further, Columba generates the rewards by coordinating with the discriminator, the random distillation network and the regulator, enhancing the accuracy of rewards. We conduct extensive experiments on the Torcs simulation platform. Experimental results show that the agent trained by Columba outperforms the agents trained by DDPG and GAIL, which are strong baselines in the deep reinforcement learning and the imitation learning, respectively.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::160b246c6a0829657a1c97c4d2effe13 https://doi.org/10.1109/ijcnn52387.2021.9533604 Zobrazit plný text záznamu