Comparing deep reinforcement learning architectures for autonomous racing

Autor:	Benjamin David Evans, Hendrik Willem Jordaan, Herman Arnold Engelbrecht
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	Deep reinforcement learning End-to-end driving Autonomous racing Trajectory planning Cybernetics Q300-390 Electronic computers. Computer science QA75.5-76.95
Zdroj:	Machine Learning with Applications, Vol 14, Iss , Pp 100496- (2023)
Druh dokumentu:	article
ISSN:	2666-8270
DOI:	10.1016/j.mlwa.2023.100496
Popis:	In classical autonomous racing, a perception, planning, and control pipeline is employed to navigate vehicles around a track as quickly as possible. In contrast, neural network controllers have been used to replace either part of or the entire pipeline. This paper compares three deep learning architectures for F1Tenth autonomous racing: full planning, which replaces the global and local planner, trajectory tracking, which replaces the local planner and end-to-end, which replaces the entire pipeline. The evaluation contrasts two reward signals, compares the DDPG, TD3 and SAC algorithms and investigates the generality of the learned policies to different test maps. Training the agents in simulation shows that the full planning agent has the most robust training and testing performance. The trajectory tracking agents achieve fast lap times on the training map but low completion rates on different test maps. Transferring the trained agents to a physical F1Tenth car reveals that the trajectory tracking and full planning agents transfer poorly, displaying rapid side-to-side swerving (slaloming). In contrast, the end-to-end agent, the worst performer in simulation, transfers the best to the physical vehicle and can complete the test track with a maximum speed of 5 m/s. These results show that planning methods outperform end-to-end approaches in simulation performance, but end-to-end approaches transfer better to physical robots.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/652589d4a3b14b9d85c89692c8d7253e Zobrazit plný text záznamu View record in DOAJ