Výsledky vyhledávání - "Zhang, Junshan"

Report

OLLIE: Imitation Learning from Offline Pretraining to Online Finetuning

Autor: Yue, Sheng, Hua, Xingyuan, Ren, Ju, Lin, Sen, Zhang, Junshan, Zhang, Yaoxue

In this paper, we study offline-to-online Imitation Learning (IL) that pretrains an imitation policy from static demonstration data, followed by fast finetuning with minimal environmental interaction. We find the na\"ive combination of existing offli

Externí odkaz: http://arxiv.org/abs/2405.17477

Zobrazit plný text záznamu

Report

How to Leverage Diverse Demonstrations in Offline Imitation Learning

Autor: Yue, Sheng, Liu, Jiani, Hua, Xingyuan, Ren, Ju, Lin, Sen, Zhang, Junshan, Zhang, Yaoxue

Offline Imitation Learning (IL) with imperfect demonstrations has garnered increasing attention owing to the scarcity of expert data in many real-world domains. A fundamental problem in this scenario is how to extract positive behaviors from noisy da

Externí odkaz: http://arxiv.org/abs/2405.17476

Zobrazit plný text záznamu

Report

CarDreamer: Open-Source Learning Platform for World Model based Autonomous Driving

Autor: Gao, Dechen, Cai, Shuangyu, Zhou, Hanchu, Wang, Hang, Soltani, Iman, Zhang, Junshan

To safely navigate intricate real-world scenarios, autonomous vehicles must be able to adapt to diverse road conditions and anticipate future events. World model (WM) based reinforcement learning (RL) has emerged as a promising approach by learning a

Externí odkaz: http://arxiv.org/abs/2405.09111

Zobrazit plný text záznamu

Report

Communication-Efficient Training Workload Balancing for Decentralized Multi-Agent Learning

Autor: Mohammadabadi, Seyed Mahmoud Sajjadi, Yang, Lei, Yan, Feng, Zhang, Junshan

Decentralized Multi-agent Learning (DML) enables collaborative model training while preserving data privacy. However, inherent heterogeneity in agents' resources (computation, communication, and task size) may lead to substantial variations in traini

Externí odkaz: http://arxiv.org/abs/2405.00839

Zobrazit plný text záznamu

Report

Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback

Autor: Wang, Hang, Lin, Sen, Zhang, Junshan

The ensemble method is a promising way to mitigate the overestimation issue in Q-learning, where multiple function approximators are used to estimate the action values. It is known that the estimation bias hinges heavily on the ensemble size (i.e., t

Externí odkaz: http://arxiv.org/abs/2306.11918

Zobrazit plný text záznamu

Report

Warm-Start Actor-Critic: From Approximation Error to Sub-optimality Gap

Autor: Wang, Hang, Lin, Sen, Zhang, Junshan

Warm-Start reinforcement learning (RL), aided by a prior policy obtained from offline training, is emerging as a promising RL approach for practical applications. Recent empirical studies have demonstrated that the performance of Warm-Start RL can be

Externí odkaz: http://arxiv.org/abs/2306.11271

Zobrazit plný text záznamu

Report

Efficient Self-supervised Continual Learning with Progressive Task-correlated Layer Freezing

Autor: Yang, Li, Lin, Sen, Zhang, Fan, Zhang, Junshan, Fan, Deliang

Inspired by the success of Self-supervised learning (SSL) in learning visual representations from unlabeled data, a few recent works have studied SSL in the context of continual learning (CL), where multiple tasks are learned sequentially, giving ris

Externí odkaz: http://arxiv.org/abs/2303.07477

Zobrazit plný text záznamu

Report

CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning

Autor: Yue, Sheng, Wang, Guanbo, Shao, Wei, Zhang, Zhaofeng, Lin, Sen, Ren, Ju, Zhang, Junshan

This work aims to tackle a major challenge in offline Inverse Reinforcement Learning (IRL), namely the reward extrapolation error, where the learned reward function may fail to explain the task correctly and misguide the agent in unseen environments

Externí odkaz: http://arxiv.org/abs/2302.04782

Zobrazit plný text záznamu

Report

Algorithm Design for Online Meta-Learning with Task Boundary Detection

Autor: Sow, Daouda, Lin, Sen, Liang, Yingbin, Zhang, Junshan

Online meta-learning has recently emerged as a marriage between batch meta-learning and online learning, for achieving the capability of quick adaptation on new tasks in a lifelong manner. However, most existing approaches focus on the restrictive se

Externí odkaz: http://arxiv.org/abs/2302.00857

Zobrazit plný text záznamu

Report

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Autor: Wu, Qiong, Chen, Xu, Ouyang, Tao, Zhou, Zhi, Zhang, Xiaoxi, Yang, Shusen, Zhang, Junshan

Federated learning (FL) is a promising paradigm that enables collaboratively learning a shared model across massive clients while keeping the training data locally. However, for many existing FL systems, clients need to frequently exchange model para

Externí odkaz: http://arxiv.org/abs/2301.06447

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání