Výsledky vyhledávání - "value iteration"

Akademický článek

A tutorial introduction to reinforcement learning

Autor: Mathukumalli Vidyasagar

Publikováno v: SICE Journal of Control, Measurement, and System Integration, Vol 16, Iss 1, Pp 172-191 (2023)

In this paper, we present a brief survey of reinforcement learning, with particular emphasis on stochastic approximation (SA) as a unifying theme. The scope of the paper includes Markov reward processes, Markov decision processes, SA algorithms, and

Externí odkaz: https://doaj.org/article/8e4ee9a0515949188c327645dab0c284

Zobrazit plný text záznamu

Akademický článek

A Hybrid Handover Scheme for Vehicular VLC/RF Communication Networks

Autor: Linqiong Jia, Shicheng Feng, Yijin Zhang, Jin-Yuan Wang

Publikováno v: Sensors, Vol 24, Iss 13, p 4323 (2024)

Visible light communication (VLC) is a promising complementary technology to its radio frequency (RF) counterpart to satisfy the high quality-of-service (QoS) requirements of intelligent vehicular communications by reusing LED street lights. In this

Externí odkaz: https://doaj.org/article/83c77596445e45fbb2415cd66f1d80b2

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Value Iteration Networks With Gated Summarization Module

Autor: Jinyu Cai, Jialong Li, Mingyue Zhang, Kenji Tei

Publikováno v: IEEE Access, Vol 11, Pp 60407-60420 (2023)

In this paper, we address the challenges faced by Value Iteration Networks (VIN) in handling larger input maps and mitigating the impact of accumulated errors caused by increased iterations. We propose a novel approach, Value Iteration Networks with

Externí odkaz: https://doaj.org/article/602aac8d46a84bee9a8c89bbd43ed410

Zobrazit plný text záznamu

Akademický článek

Optimal policy for under frequency load shedding based on heterogeneous Markovian opinion dynamics model

Autor: Muhammad Salman, Ali Nasir

Publikováno v: Alexandria Engineering Journal, Vol 63, Iss , Pp 599-611 (2023)

This paper proposes a Markov Decision Process model for calculation of an optimal policy, for under frequency load shedding problem. Major innovation in the modeling of the problem, is the incorporation of opinion dynamics model for calculation of th

Externí odkaz: https://doaj.org/article/9fa4b77d47bd4f83ba4f7dd99c3b396b

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Deriving the Optimal Strategy for the Two Dice Pig Game via Reinforcement Learning

Autor: Tian Zhu, Merry H. Ma

Publikováno v: Stats, Vol 5, Iss 3, Pp 805-818 (2022)

Games of chance have historically played a critical role in the development and teaching of probability theory and game theory, and, in the modern age, computer programming and reinforcement learning. In this paper, we derive the optimal strategy for

Externí odkaz: https://doaj.org/article/3b6413a538a94b9f9b49079ed1c1e594

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání