Zobrazeno 1 - 10
of 428
pro vyhledávání: '"Wang, Xinqi"'
Autonomous underwater vehicles (AUVs) are valuable for ocean exploration due to their flexibility and ability to carry communication and detection units. Nevertheless, AUVs alone often face challenges in harsh and extreme sea conditions. This study i
Externí odkaz:
http://arxiv.org/abs/2409.02444
We initiate the study of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), exploring both theoretical foundations and empirical validations. We define the task as identifying Nash equilibrium from a preference-only offline dataset in g
Externí odkaz:
http://arxiv.org/abs/2409.00717
Our project proposes an end-to-end 3D face alignment and reconstruction network. The backbone of our model is built by Bottle-Neck structure via Depth-wise Separable Convolution. We integrate Coordinate Attention mechanism and Spatial Group-wise Enha
Externí odkaz:
http://arxiv.org/abs/2405.19659
Intelligent agents must be generalists, capable of quickly adapting to various tasks. In reinforcement learning (RL), model-based RL learns a dynamics model of the world, in principle enabling transfer to arbitrary reward functions through planning.
Externí odkaz:
http://arxiv.org/abs/2403.06328
There emerges a promising trend of using large language models (LLMs) to generate code-like plans for complex inference tasks such as visual reasoning. This paradigm, known as LLM-based planning, provides flexibility in problem solving and endows bet
Externí odkaz:
http://arxiv.org/abs/2308.09658
This paper presents a systematic study on gap-dependent sample complexity in offline reinforcement learning. Prior work showed when the density ratio between an optimal policy and the behavior policy is upper bounded (the optimal policy coverage assu
Externí odkaz:
http://arxiv.org/abs/2206.00177
Publikováno v:
In Chemical Engineering Journal 15 October 2024 498
Publikováno v:
In Journal of Hydrology August 2024 639
Publikováno v:
In Engineering Failure Analysis August 2024 162
Publikováno v:
In European Journal of Pharmacology 5 November 2024 982