Výsledky vyhledávání

Report

Adaptively Learning to Select-Rank in Online Platforms

Autor: Wang, Jingyuan, Dong, Perry, Jin, Ying, Zhan, Ruohan, Zhou, Zhengyuan

Ranking algorithms are fundamental to various online platforms across e-commerce sites to content streaming services. Our research addresses the challenge of adaptively ranking items from a candidate pool for heterogeneous users, a key component in p

Externí odkaz: http://arxiv.org/abs/2406.05017

Zobrazit plný text záznamu

Report

RLIF: Interactive Imitation Learning as Reinforcement Learning

Autor: Luo, Jianlan, Dong, Perry, Zhai, Yuexiang, Ma, Yi, Levine, Sergey

Although reinforcement learning methods offer a powerful framework for automatic skill acquisition, for practical learning-based control problems in domains such as robotics, imitation learning often provides a more convenient and accessible alternat

Externí odkaz: http://arxiv.org/abs/2311.12996

Zobrazit plný text záznamu

Report

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

Autor: Luo, Jianlan, Dong, Perry, Wu, Jeffrey, Kumar, Aviral, Geng, Xinyang, Levine, Sergey

The offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected the data. While policy constraints, conservatism, and other methods fo

Externí odkaz: http://arxiv.org/abs/2310.11731

Zobrazit plný text záznamu

Near-Optimal High-Probability Convergence for Non-Convex Stochastic Optimization with Variance Reduction

Autor: Liu, Zijian, Dong, Perry, Jagabathula, Srikanth, Zhou, Zhengyuan

Traditional analyses for non-convex stochastic optimization problems characterize convergence bounds in expectation, which is inadequate as it does not supply a useful performance guarantee on a single run. Motivated by its importance, an emerging li

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a17dd7894fa9d78eee975fb421983982

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání