Výsledky vyhledávání - "Khorasani, Sadegh"

Report

Efficiently Escaping Saddle Points for Non-Convex Policy Optimization

Autor: Khorasani, Sadegh, Salehkaleybar, Saber, Kiyavash, Negar, He, Niao, Grossglauser, Matthias

Policy gradient (PG) is widely used in reinforcement learning due to its scalability and good performance. In recent years, several variance-reduced PG methods have been proposed with a theoretical guarantee of converging to an approximate first-orde

Externí odkaz: http://arxiv.org/abs/2311.08914

Zobrazit plný text záznamu

Report

Momentum-Based Policy Gradient with Second-Order Information

Autor: Salehkaleybar, Saber, Khorasani, Sadegh, Kiyavash, Negar, He, Niao, Thiran, Patrick

Variance-reduced gradient estimators for policy gradient methods have been one of the main focus of research in the reinforcement learning in recent years as they allow acceleration of the estimation process. We propose a variance-reduced policy-grad

Externí odkaz: http://arxiv.org/abs/2205.08253

Zobrazit plný text záznamu

Report

SVG-Net: An SVG-based Trajectory Prediction Model

Autor: Bahari, Mohammadhossein, Zehtab, Vahid, Khorasani, Sadegh, Ayromlou, Sana, Saadatnejad, Saeed, Alahi, Alexandre

Anticipating motions of vehicles in a scene is an essential problem for safe autonomous driving systems. To this end, the comprehension of the scene's infrastructure is often the main clue for predicting future trajectories. Most of the proposed appr

Externí odkaz: http://arxiv.org/abs/2110.03706

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání