Výsledky vyhledávání - "Puri, Ujjwal"

Report

Guaranteed Trust Region Optimization via Two-Phase KL Penalization

Autor: Zentner, K. R., Puri, Ujjwal, Huang, Zhehui, Sukhatme, Gaurav S.

On-policy reinforcement learning (RL) has become a popular framework for solving sequential decision problems due to its computational efficiency and theoretical simplicity. Some on-policy methods guarantee every policy update is constrained to a tru

Externí odkaz: http://arxiv.org/abs/2312.05405

Zobrazit plný text záznamu

Report

A Simple Approach to Continual Learning by Transferring Skill Parameters

Autor: Zentner, K. R., Julian, Ryan, Puri, Ujjwal, Zhang, Yulun, Sukhatme, Gaurav S.

In order to be effective general purpose machines in real world environments, robots not only will need to adapt their existing manipulation skills to new circumstances, they will need to acquire entirely new skills on-the-fly. A great promise of con

Externí odkaz: http://arxiv.org/abs/2110.10255

Zobrazit plný text záznamu

Report

Towards Exploiting Geometry and Time for Fast Off-Distribution Adaptation in Multi-Task Robot Learning

Autor: Zentner, K. R., Julian, Ryan, Puri, Ujjwal, Zhang, Yulun, Sukhatme, Gaurav

We explore possible methods for multi-task transfer learning which seek to exploit the shared physical structure of robotics tasks. Specifically, we train policies for a base set of pre-training tasks, then experiment with adapting to new off-distrib

Externí odkaz: http://arxiv.org/abs/2106.13237

Zobrazit plný text záznamu

Conference

Predicting the popularity of books before publication using machine learning.

Autor: Sachdeva, Hansika, Puri, Ujjwal, Poornima, S.

Publikováno v: AIP Conference Proceedings; 2024, Vol. 3075 Issue 1, p1-19, 19p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání