Výsledky vyhledávání - "policy search"

Akademický článek

Policy Search Reinforcement Learning Method in Latent Space

Autor: ZHAO Tingting, WANG Ying, SUN Wei, CHEN Yarui, WANG Yuan, YANG Jucheng

Publikováno v: Jisuanji kexue yu tansuo, Vol 18, Iss 4, Pp 1032-1046 (2024)

Policy search is an efficient learning method in the field of deep reinforcement learning (DRL), which is capable of solving large-scale problems with continuous state and action spaces and widely used in real-world problems. However, such method usu

Externí odkaz: https://doaj.org/article/38cab6c7b1a642e09717377cb10e2a4f

Zobrazit plný text záznamu

Akademický článek

Multi-Agent Guided Deep Reinforcement Learning Approach Against State Perturbed Adversarial Attacks

Autor: Cagri Cerci, Hakan Temeltas

Publikováno v: IEEE Access, Vol 12, Pp 156146-156159 (2024)

Deep reinforcement learning (DRL) algorithms interact with the environment and aim to learn without labeled data. In high-dimensional spaces, they evolve their policies to maximize the rewards they can collect. They have applications in various field

Externí odkaz: https://doaj.org/article/7beb0371d4ab4e30bf39a0a80112c82d

Zobrazit plný text záznamu

Akademický článek

Geometric Reinforcement Learning for Robotic Manipulation

Autor: Naseem Alhousani, Matteo Saveriano, Ibrahim Sevinc, Talha Abdulkuddus, Hatice Kose, Fares J. Abu-Dakka

Publikováno v: IEEE Access, Vol 11, Pp 111492-111505 (2023)

Reinforcement learning (RL) is a popular technique that allows an agent to learn by trial and error while interacting with a dynamic environment. The traditional Reinforcement Learning (RL) approach has been successful in learning and predicting Eucl

Externí odkaz: https://doaj.org/article/f0c812f1435241d7aa037cd169b74efd

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Multi-objective optimal design of interbasin water transfers: The Tagus-Segura aqueduct (Spain)

Autor: Carlotta Valerio, Matteo Giuliani, Andrea Castelletti, Alberto Garrido, Lucia De Stefano

Publikováno v: Journal of Hydrology: Regional Studies, Vol 46, Iss , Pp 101339- (2023)

Study region: The Tagus-Segura aqueduct (TSA) is a large and strategic water transfer scheme in Spain that connects Entrepeñas and Buendía reservoirs in the Tagus river headwaters to the Segura river basin, a highly stressed Mediterranean area. Stu

Externí odkaz: https://doaj.org/article/a4635fc89f864767b807ccd07f5e7bab

Zobrazit plný text záznamu

Akademický článek

Designing Lookahead Policies for Sequential Decision Problems in Transportation and Logistics

Autor: Warren B. Powell

Publikováno v: IEEE Open Journal of Intelligent Transportation Systems, Vol 3, Pp 313-327 (2022)

There is a wide range of sequential decision problems in transportation and logistics that require dealing with uncertainty. There are four classes of policies that we can draw on for different types of decisions, but many problems in transportation

Externí odkaz: https://doaj.org/article/d6d790db22d94705975507ba8bc1b774

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

A Functional Clipping Approach for Policy Optimization Algorithms

Autor: Wangshu Zhu, Andre Rosendo

Publikováno v: IEEE Access, Vol 9, Pp 96056-96063 (2021)

Proximal policy optimization (PPO) has yielded state-of-the-art results in policy search, a subfield of reinforcement learning, with one of its key points being the use of a surrogate objective function to restrict the step size at each policy update

Externí odkaz: https://doaj.org/article/38f20ef12085440b9d67d25543fc3b14

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání