Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Isenbaev, Vladislav"'
Autor:
Booher, Jonathan, Rohanimanesh, Khashayar, Xu, Junhong, Isenbaev, Vladislav, Balakrishna, Ashwin, Gupta, Ishan, Liu, Wei, Petiushko, Aleksandr
Modern approaches to autonomous driving rely heavily on learned components trained with large amounts of human driving data via imitation learning. However, these methods require large amounts of expensive data collection and even then face challenge
Externí odkaz:
http://arxiv.org/abs/2406.08878
Autor:
Liu, Zuxin, Cen, Zhepeng, Isenbaev, Vladislav, Liu, Wei, Wu, Zhiwei Steven, Li, Bo, Zhao, Ding
Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality guarantees. This
Externí odkaz:
http://arxiv.org/abs/2201.11927