Výsledky vyhledávání

Report

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

Autor: Liu, Ruohong, Pan, Yuxin, Xu, Linjie, Song, Lei, You, Pengcheng, Chen, Yize, Bian, Jiang

Multi-objective reinforcement learning (MORL) excels at handling rapidly changing preferences in tasks that involve multiple criteria, even for unseen preferences. However, previous dominating MORL methods typically generate a fixed policy set or pre

Externí odkaz: http://arxiv.org/abs/2410.02236

Zobrazit plný text záznamu

Report

SEAL: SEmantic-Augmented Imitation Learning via Language Model

Autor: Gu, Chengyang, Pan, Yuxin, Bai, Haotian, Xiong, Hui, Chen, Yize

Hierarchical Imitation Learning (HIL) is a promising approach for tackling long-horizon decision-making tasks. While it is a challenging task due to the lack of detailed supervisory labels for sub-goal learning, and reliance on hundreds to thousands

Externí odkaz: http://arxiv.org/abs/2410.02231

Zobrazit plný text záznamu

Report

Learning and Optimization for Price-based Demand Response of Electric Vehicle Charging

Autor: Gu, Chengyang, Pan, Yuxin, Liu, Ruohong, Chen, Yize

In the context of charging electric vehicles (EVs), the price-based demand response (PBDR) is becoming increasingly significant for charging load management. Such response usually encourages cost-sensitive customers to adjust their energy demand in r

Externí odkaz: http://arxiv.org/abs/2404.10311

Zobrazit plný text záznamu

Report

Distance-rank Aware Sequential Reward Learning for Inverse Reinforcement Learning with Sub-optimal Demonstrations

Autor: Li, Lu, Pan, Yuxin, Chen, Ruobing, Liu, Jie, Wang, Zilin, Liu, Yu, Li, Zhiheng

Inverse reinforcement learning (IRL) aims to explicitly infer an underlying reward function based on collected expert demonstrations. Considering that obtaining expert demonstrations can be costly, the focus of current IRL techniques is on learning a

Externí odkaz: http://arxiv.org/abs/2310.08823

Zobrazit plný text záznamu

Report

Adjustable Robust Reinforcement Learning for Online 3D Bin Packing

Autor: Pan, Yuxin, Chen, Yize, Lin, Fangzhen

Designing effective policies for the online 3D bin packing problem (3D-BPP) has been a long-standing challenge, primarily due to the unpredictable nature of incoming box sequences and stringent physical constraints. While current deep reinforcement l

Externí odkaz: http://arxiv.org/abs/2310.04323

Zobrazit plný text záznamu

Report

Laxity-Aware Scalable Reinforcement Learning for HVAC Control

Autor: Liu, Ruohong, Pan, Yuxin, Chen, Yize

Demand flexibility plays a vital role in maintaining grid balance, reducing peak demand, and saving customers' energy bills. Given their highly shiftable load and significant contribution to a building's energy consumption, Heating, Ventilation, and

Externí odkaz: http://arxiv.org/abs/2306.16619

Zobrazit plný text záznamu

Report

The global solution of the minimal surface flow and translating surfaces

Autor: Ma, Li, Pan, Yuxin

In this paper, we study evolved surfaces over convex planar domains which are evolving by the minimal surface flow $$u_{t}= div\left(\frac{Du}{\sqrt{1+|Du|^2}}\right)-H(x,Du).$$ Here, we specify the angle of contact of the evolved surface to the boun

Externí odkaz: http://arxiv.org/abs/2304.06542

Zobrazit plný text záznamu

Report

Using Detection, Tracking and Prediction in Visual SLAM to Achieve Real-time Semantic Mapping of Dynamic Scenarios

Autor: Chen, Xingyu, Xue, Jianru, Fang, Jianwu, Pan, Yuxin, Zheng, Nanning

Publikováno v: 2020 IEEE Intelligent Vehicles Symposium (IV), 2020, pp. 666-671

In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU. In RDS-SLAM,

Externí odkaz: http://arxiv.org/abs/2210.04562

Zobrazit plný text záznamu

Report

Backward Imitation and Forward Reinforcement Learning via Bi-directional Model Rollouts

Autor: Pan, Yuxin, Lin, Fangzhen

Traditional model-based reinforcement learning (RL) methods generate forward rollout traces using the learnt dynamics model to reduce interactions with the real environment. The recent model-based RL method considers the way to learn a backward model

Externí odkaz: http://arxiv.org/abs/2208.02434

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání