Vision-based robust control framework based on deep reinforcement learning applied to autonomous ground vehicles

Autor:	Marco H. Terra, Gustavo P. Morais, Valdir Grassi, Jose Nuno A. D. Bueno, Lucas Barbosa Marcos, Nilo F. de Resende
Rok vydání:	2020
Předmět:	0209 industrial biotechnology Generalization Computer science Applied Mathematics 020208 electrical & electronic engineering Control engineering Image processing 02 engineering and technology Linear-quadratic regulator Computer Science Applications 020901 industrial engineering & automation Control and Systems Engineering Control theory Control system 0202 electrical engineering electronic engineering information engineering Reinforcement learning Electrical and Electronic Engineering Robust control Collision avoidance
Zdroj:	Control Engineering Practice. 104:104630
ISSN:	0967-0661
DOI:	10.1016/j.conengprac.2020.104630
Popis:	Given the recent advances in computer vision, image processing and control systems, self-driving vehicles has been one of the most promising and challenging research topics nowadays. The design of vision-based robust controllers to keep an autonomous car in the center of the lane, despite uncertainties and disturbances, is still an ongoing challenge. This paper presents a hybrid control architecture that combines Deep Reinforcement Learning (DRL) and Robust Linear Quadratic Regulator (RLQR) for vision-based lateral control of an autonomous vehicle. Evolutionary estimation is used to model the vehicle uncertainties. For performance comparison, a DRL method and three other hybrid controllers are also evaluated. The inputs for each controller are real-time semantically segmented RGB camera images which serve as the basis to calculate continuous steering actions to keep the vehicle on the center of the lane with a constant velocity. Simulation results show that the proposed hybrid RLQR with evolutionary estimation of uncertainties architecture outperforms the other algorithms implemented. It presents lower tracking errors, smoother steering inputs, total collision avoidance and better generalization in new urban environments. Furthermore, it significantly decreases the required training time.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::f386fcd99956130299ef54113e3cd76a https://doi.org/10.1016/j.conengprac.2020.104630 Zobrazit plný text záznamu Full Text from ScienceDirect