Zobrazeno 1 - 10
of 328
pro vyhledávání: '"Pan, Yuxin"'
Multi-objective reinforcement learning (MORL) excels at handling rapidly changing preferences in tasks that involve multiple criteria, even for unseen preferences. However, previous dominating MORL methods typically generate a fixed policy set or pre
Externí odkaz:
http://arxiv.org/abs/2410.02236
Hierarchical Imitation Learning (HIL) is a promising approach for tackling long-horizon decision-making tasks. While it is a challenging task due to the lack of detailed supervisory labels for sub-goal learning, and reliance on hundreds to thousands
Externí odkaz:
http://arxiv.org/abs/2410.02231
In the context of charging electric vehicles (EVs), the price-based demand response (PBDR) is becoming increasingly significant for charging load management. Such response usually encourages cost-sensitive customers to adjust their energy demand in r
Externí odkaz:
http://arxiv.org/abs/2404.10311
Inverse reinforcement learning (IRL) aims to explicitly infer an underlying reward function based on collected expert demonstrations. Considering that obtaining expert demonstrations can be costly, the focus of current IRL techniques is on learning a
Externí odkaz:
http://arxiv.org/abs/2310.08823
Designing effective policies for the online 3D bin packing problem (3D-BPP) has been a long-standing challenge, primarily due to the unpredictable nature of incoming box sequences and stringent physical constraints. While current deep reinforcement l
Externí odkaz:
http://arxiv.org/abs/2310.04323
Demand flexibility plays a vital role in maintaining grid balance, reducing peak demand, and saving customers' energy bills. Given their highly shiftable load and significant contribution to a building's energy consumption, Heating, Ventilation, and
Externí odkaz:
http://arxiv.org/abs/2306.16619
Autor:
Ma, Li, Pan, Yuxin
In this paper, we study evolved surfaces over convex planar domains which are evolving by the minimal surface flow $$u_{t}= div\left(\frac{Du}{\sqrt{1+|Du|^2}}\right)-H(x,Du).$$ Here, we specify the angle of contact of the evolved surface to the boun
Externí odkaz:
http://arxiv.org/abs/2304.06542
Publikováno v:
2020 IEEE Intelligent Vehicles Symposium (IV), 2020, pp. 666-671
In this paper, we propose a lightweight system, RDS-SLAM, based on ORB-SLAM2, which can accurately estimate poses and build semantic maps at object level for dynamic scenarios in real time using only one commonly used Intel Core i7 CPU. In RDS-SLAM,
Externí odkaz:
http://arxiv.org/abs/2210.04562
Autor:
Pan, Yuxin, Lin, Fangzhen
Traditional model-based reinforcement learning (RL) methods generate forward rollout traces using the learnt dynamics model to reduce interactions with the real environment. The recent model-based RL method considers the way to learn a backward model
Externí odkaz:
http://arxiv.org/abs/2208.02434
Autor:
Pan, Yuxin, You, Qi, Tang, Zhixian, Wei, Qiuyun, Lin, Hui, Zhang, Dawei, Xu, Xiaodong, Yao, Xufeng
Publikováno v:
In Ceramics International 15 November 2024 50(22) Part C:48655-48661