Výsledky vyhledávání - "Policy iteration"

Akademický článek

Integral reinforcement learning based dynamic stackelberg pursuit-evasion game for unmanned surface vehicles

Autor: Xiaoxiang Hu, Shuaizheng Liu, Jingwen Xu, Bing Xiao, Chenguang Guo

Publikováno v: Alexandria Engineering Journal, Vol 108, Iss , Pp 428-435 (2024)

The dynamic stackelberg pursuit-evasion (PE) game of unmanned surface vehicles (USVs) is discussed in this paper. The optimal solution method of the USVs’ PE game is proposed. The USVs’ PE game is firstly described by the pursuit motion on two-di

Externí odkaz: https://doaj.org/article/7b8011bbe44a45e5925ffa55775fb8f0

Zobrazit plný text záznamu

Akademický článek

Bias-free policy evaluation in the discrete-time adaptive linear quadratic optimal control in the presence of stochastic disturbances

Autor: Vina Putri Virgiani, Natsuki Ishigaki, Shiro Masuda

Publikováno v: SICE Journal of Control, Measurement, and System Integration, Vol 17, Iss 1 (2024)

The study proposes an adaptive Linear Quadratic (LQ) optimal regulator for discrete-time linear systems in the presence of stochastic disturbances through policy iteration with Actor/Critic structure. The existing deterministic policy iteration metho

Externí odkaz: https://doaj.org/article/cd249dfa95574946bdcd1fdf19e6f89e

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

An multi-UAV cooperative regional search method based on the track elimination strategy and policy iteration algorithm

Autor: CHEN Xing, CHEN Zhuo, YANG Bowen, LI Aoxiang

Publikováno v: Zhihui kongzhi yu fangzhen, Vol 46, Iss 1, Pp 37-43 (2024)

Multi-UAV cooperative regional search is widely used in military and civil, such as search, rescue, patrol, monitoring, and environmental survey. It is an open topic to enhance the efficiency of target search. The paper proposes a probability calcula

Externí odkaz: https://doaj.org/article/4bd96bc33ba744eca197376d23fb88eb

Zobrazit plný text záznamu

Akademický článek

Design and Experimental Validation of a Model-Free Controller for Beam Stabilization in Adaptive Optics Systems

Autor: Sicheng Guo, Tao Cheng, Kangjian Yang, Lingxi Kong, Chunxuan Su, Shuai Wang, Ping Yang

Publikováno v: IEEE Photonics Journal, Vol 16, Iss 6, Pp 1-12 (2024)

Stabilization of optical beams has always been a key factor affecting the performance of many optical systems. The Adaptive optics (AO) beam stabilization system requires further development to cope with increasingly complex application scenarios and

Externí odkaz: https://doaj.org/article/ee4eaea5847240b8848bd096badd8497

Zobrazit plný text záznamu

Akademický článek

Distributed randomized multiagent policy iteration in reinforcement learning

Autor: Weipeng Zhang

Publikováno v: Results in Control and Optimization, Vol 12, Iss , Pp 100255- (2023)

We propose a distributed randomized policy iteration algorithm for infinite horizon dynamic programming problems for which the control at each stage is m-dimensional. The traditional policy iteration algorithm involves performing a minimization over

Externí odkaz: https://doaj.org/article/f83e627039e143d2b263144636ba6914

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání