Výsledky vyhledávání

Report

SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language Understanding

Autor: Luo, Junwei, Pang, Zhen, Zhang, Yongjun, Wang, Tingzhu, Wang, Linlin, Dang, Bo, Lao, Jiangwei, Wang, Jian, Chen, Jingdong, Tan, Yihua, Li, Yansheng

Remote Sensing Large Multi-Modal Models (RSLMMs) are developing rapidly and showcase significant capabilities in remote sensing imagery (RSI) comprehension. However, due to the limitations of existing datasets, RSLMMs have shortcomings in understandi

Externí odkaz: http://arxiv.org/abs/2406.10100

Zobrazit plný text záznamu

Report

On Efficient Reinforcement Learning for Full-length Game of StarCraft II

Autor: Liu, Ruo-Ze, Pang, Zhen-Jia, Meng, Zhou-Yu, Wang, Wenhai, Yu, Yang, Lu, Tong

Publikováno v: JAIR, 75 (2022), 213-260

StarCraft II (SC2) poses a grand challenge for reinforcement learning (RL), of which the main difficulties include huge state space, varying action space, and a long time horizon. In this work, we investigate a set of RL techniques for the full-lengt

Externí odkaz: http://arxiv.org/abs/2209.11553

Zobrazit plný text záznamu

Akademický článek

A novel kind of hollow SiO2@g-C3N4@TiO2 microspheres for rapid removal of dyes and antibiotics in water under sunlight

Autor: Zhou, Hao-Yu, Feng, Bai-Yu, Pang, Zhen-bin, Han, Guo-Zhi

Publikováno v: In Journal of Alloys and Compounds 5 December 2024 1007

Zobrazit plný text záznamu

Report

AliExpress Learning-To-Rank: Maximizing Online Model Performance without Going Online

Autor: Huzhang, Guangda, Pang, Zhen-Jia, Gao, Yongqing, Liu, Yawen, Shen, Weijie, Zhou, Wen-Ji, Da, Qing, Zeng, An-Xiang, Yu, Han, Yu, Yang, Zhou, Zhi-Hua

Learning-to-rank (LTR) has become a key technology in E-commerce applications. Most existing LTR approaches follow a supervised learning paradigm from offline labeled data collected from the online system. However, it has been noticed that previous L

Externí odkaz: http://arxiv.org/abs/2003.11941

Zobrazit plný text záznamu

Report

Efficient Reinforcement Learning for StarCraft by Abstract Forward Models and Transfer Learning

Autor: Liu, Ruo-Ze, Guo, Haifeng, Ji, Xiaozhong, Yu, Yang, Pang, Zhen-Jia, Xiao, Zitai, Wu, Yuzhou, Lu, Tong

Injecting human knowledge is an effective way to accelerate reinforcement learning (RL). However, these methods are underexplored. This paper presents our discovery that an abstract forward model (thought-game (TG)) combined with transfer learning (T

Externí odkaz: http://arxiv.org/abs/1903.00715

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

On Reinforcement Learning for Full-length Game of StarCraft

Autor: Pang, Zhen-Jia, Liu, Ruo-Ze, Meng, Zhou-Yu, Zhang, Yi, Yu, Yang, Lu, Tong

StarCraft II poses a grand challenge for reinforcement learning. The main difficulties of it include huge state and action space and a long-time horizon. In this paper, we investigate a hierarchical reinforcement learning approach for StarCraft II. T

Externí odkaz: http://arxiv.org/abs/1809.09095

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání