Vision-based robust control framework based on deep reinforcement learning applied to autonomous ground vehicles
Autor: | Marco H. Terra, Gustavo P. Morais, Valdir Grassi, Jose Nuno A. D. Bueno, Lucas Barbosa Marcos, Nilo F. de Resende |
---|---|
Rok vydání: | 2020 |
Předmět: |
0209 industrial biotechnology
Generalization Computer science Applied Mathematics 020208 electrical & electronic engineering Control engineering Image processing 02 engineering and technology Linear-quadratic regulator Computer Science Applications 020901 industrial engineering & automation Control and Systems Engineering Control theory Control system 0202 electrical engineering electronic engineering information engineering Reinforcement learning Electrical and Electronic Engineering Robust control Collision avoidance |
Zdroj: | Control Engineering Practice. 104:104630 |
ISSN: | 0967-0661 |
DOI: | 10.1016/j.conengprac.2020.104630 |
Popis: | Given the recent advances in computer vision, image processing and control systems, self-driving vehicles has been one of the most promising and challenging research topics nowadays. The design of vision-based robust controllers to keep an autonomous car in the center of the lane, despite uncertainties and disturbances, is still an ongoing challenge. This paper presents a hybrid control architecture that combines Deep Reinforcement Learning (DRL) and Robust Linear Quadratic Regulator (RLQR) for vision-based lateral control of an autonomous vehicle. Evolutionary estimation is used to model the vehicle uncertainties. For performance comparison, a DRL method and three other hybrid controllers are also evaluated. The inputs for each controller are real-time semantically segmented RGB camera images which serve as the basis to calculate continuous steering actions to keep the vehicle on the center of the lane with a constant velocity. Simulation results show that the proposed hybrid RLQR with evolutionary estimation of uncertainties architecture outperforms the other algorithms implemented. It presents lower tracking errors, smoother steering inputs, total collision avoidance and better generalization in new urban environments. Furthermore, it significantly decreases the required training time. |
Databáze: | OpenAIRE |
Externí odkaz: |