Výsledky vyhledávání

Report

Infinigen Indoors: Photorealistic Indoor Scenes using Procedural Generation

Autor: Raistrick, Alexander, Mei, Lingjie, Kayan, Karhan, Yan, David, Zuo, Yiming, Han, Beining, Wen, Hongyu, Parakh, Meenal, Alexandropoulos, Stamatis, Lipson, Lahav, Ma, Zeyu, Deng, Jia

We introduce Infinigen Indoors, a Blender-based procedural generator of photorealistic indoor scenes. It builds upon the existing Infinigen system, which focuses on natural scenes, but expands its coverage to indoor scenes by introducing a diverse li

Externí odkaz: http://arxiv.org/abs/2406.11824

Zobrazit plný text záznamu

Report

FetchBench: A Simulation Benchmark for Robot Fetching

Autor: Han, Beining, Parakh, Meenal, Geng, Derek, Defay, Jack A, Luyang, Gan, Deng, Jia

Fetching, which includes approaching, grasping, and retrieving, is a critical challenge for robot manipulation tasks. Existing methods primarily focus on table-top scenarios, which do not adequately capture the complexities of environments where both

Externí odkaz: http://arxiv.org/abs/2406.11793

Zobrazit plný text záznamu

Report

Infinite Photorealistic Worlds using Procedural Generation

Autor: Raistrick, Alexander, Lipson, Lahav, Ma, Zeyu, Mei, Lingjie, Wang, Mingzhe, Zuo, Yiming, Kayan, Karhan, Wen, Hongyu, Han, Beining, Wang, Yihan, Newell, Alejandro, Law, Hei, Goyal, Ankit, Yang, Kaiyu, Deng, Jia

We introduce Infinigen, a procedural generator of photorealistic 3D scenes of the natural world. Infinigen is entirely procedural: every asset, from shape to texture, is generated from scratch via randomized mathematical rules, using no external sour

Externí odkaz: http://arxiv.org/abs/2306.09310

Zobrazit plný text záznamu

Report

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

Autor: Han, Beining, Zheng, Chongyi, Chan, Harris, Paster, Keiran, Zhang, Michael R., Ba, Jimmy

Publikováno v: NeurIPS2021

Deep Reinforcement Learning (RL) is successful in solving many complex Markov Decision Processes (MDPs) problems. However, agents often face unanticipated environmental changes after deployment in the real world. These changes are often spurious and

Externí odkaz: http://arxiv.org/abs/2110.14248

Zobrazit plný text záznamu

Report

On the Estimation Bias in Double Q-Learning

Autor: Ren, Zhizhou, Zhu, Guangxiang, Hu, Hao, Han, Beining, Chen, Jianglun, Zhang, Chongjie

Double Q-learning is a classical method for reducing overestimation bias, which is caused by taking maximum estimated values in the Bellman operation. Its variants in the deep Q-learning paradigm have shown great promise in producing reliable value p

Externí odkaz: http://arxiv.org/abs/2109.14419

Zobrazit plný text záznamu

Report

Off-Policy Reinforcement Learning with Delayed Rewards

Autor: Han, Beining, Ren, Zhizhou, Wu, Zuofan, Zhou, Yuan, Peng, Jian

We study deep reinforcement learning (RL) algorithms with delayed rewards. In many real-world tasks, instant rewards are often not readily accessible or even defined immediately after the agent performs actions. In this work, we first formally define

Externí odkaz: http://arxiv.org/abs/2106.11854

Zobrazit plný text záznamu

Report

Off-Policy Multi-Agent Decomposed Policy Gradients

Autor: Wang, Yihan, Han, Beining, Wang, Tonghan, Dong, Heng, Zhang, Chongjie

Multi-agent policy gradient (MAPG) methods recently witness vigorous progress. However, there is a significant performance discrepancy between MAPG methods and state-of-the-art multi-agent value-based approaches. In this paper, we investigate causes

Externí odkaz: http://arxiv.org/abs/2007.12322

Zobrazit plný text záznamu

Report

Towards Understanding Cooperative Multi-Agent Q-Learning with Value Factorization

Autor: Wang, Jianhao, Ren, Zhizhou, Han, Beining, Ye, Jianing, Zhang, Chongjie

Value factorization is a popular and promising approach to scaling up multi-agent reinforcement learning in cooperative settings, which balances the learning scalability and the representational capacity of value functions. However, the theoretical u

Externí odkaz: http://arxiv.org/abs/2006.00587

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání