Výsledky vyhledávání - "Isenbaev, Vladislav"

Report

CIMRL: Combining IMitation and Reinforcement Learning for Safe Autonomous Driving

Autor: Booher, Jonathan, Rohanimanesh, Khashayar, Xu, Junhong, Isenbaev, Vladislav, Balakrishna, Ashwin, Gupta, Ishan, Liu, Wei, Petiushko, Aleksandr

Modern approaches to autonomous driving rely heavily on learned components trained with large amounts of human driving data via imitation learning. However, these methods require large amounts of expensive data collection and even then face challenge

Externí odkaz: http://arxiv.org/abs/2406.08878

Zobrazit plný text záznamu

Report

Constrained Variational Policy Optimization for Safe Reinforcement Learning

Autor: Liu, Zuxin, Cen, Zhepeng, Isenbaev, Vladislav, Liu, Wei, Wu, Zhiwei Steven, Li, Bo, Zhao, Ding

Safe reinforcement learning (RL) aims to learn policies that satisfy certain constraints before deploying them to safety-critical applications. Previous primal-dual style approaches suffer from instability issues and lack optimality guarantees. This

Externí odkaz: http://arxiv.org/abs/2201.11927

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání