Výsledky vyhledávání

Report

A finite time analysis of distributed Q-learning

Multi-agent reinforcement learning (MARL) has witnessed a remarkable surge in interest, fueled by the empirical success achieved in applications of single-agent reinforcement learning (RL). In this study, we consider a distributed Q-learning scenario

Externí odkaz: http://arxiv.org/abs/2405.14078

Zobrazit plný text záznamu

Report

Finite-Time Error Analysis of Online Model-Based Q-Learning with a Relaxed Sampling Model

Autor: Lim, Han-Dong, Lee, HyeAnn, Lee, Donghwan

Reinforcement learning has witnessed significant advancements, particularly with the emergence of model-based approaches. Among these, $Q$-learning has proven to be a powerful algorithm in model-free settings. However, the extension of $Q$-learning t

Externí odkaz: http://arxiv.org/abs/2402.11877

Zobrazit plný text záznamu

Report

A primal-dual perspective for distributed TD-learning

Autor: Lim, Han-Dong, Lee, Donghwan

The goal of this paper is to investigate distributed temporal difference (TD) learning for a networked multi-agent Markov decision process. The proposed approach is based on distributed optimization algorithms, which can be interpreted as primal-dual

Externí odkaz: http://arxiv.org/abs/2310.00638

Zobrazit plný text záznamu

Report

Continuous-Time Distributed Dynamic Programming for Networked Multi-Agent Markov Decision Processes

Autor: Lee, Donghwan, Lim, Han-Dong, Kim, Do Wan

The main goal of this paper is to investigate continuous-time distributed dynamic programming (DP) algorithms for networked multi-agent Markov decision problems (MAMDPs). In our study, we adopt a distributed multi-agent framework where individual age

Externí odkaz: http://arxiv.org/abs/2307.16706

Zobrazit plný text záznamu

Report

Temporal Difference Learning with Experience Replay

Autor: Lim, Han-Dong, Lee, Donghwan

Temporal-difference (TD) learning is widely regarded as one of the most popular algorithms in reinforcement learning (RL). Despite its widespread use, it has only been recently that researchers have begun to actively study its finite time behavior, i

Externí odkaz: http://arxiv.org/abs/2306.09746

Zobrazit plný text záznamu

Report

Backstepping Temporal Difference Learning

Autor: Lim, Han-Dong, Lee, Donghwan

Off-policy learning ability is an important feature of reinforcement learning (RL) for practical applications. However, even one of the most elementary RL algorithms, temporal-difference (TD) learning, is known to suffer form divergence issue when th

Externí odkaz: http://arxiv.org/abs/2302.09875

Zobrazit plný text záznamu

Report

Finite-Time Analysis of Asynchronous Q-learning under Diminishing Step-Size from Control-Theoretic View

Autor: Lim, Han-Dong, Lee, Donghwan

Q-learning has long been one of the most popular reinforcement learning algorithms, and theoretical analysis of Q-learning has been an active research topic for decades. Although researches on asymptotic convergence analysis of Q-learning have a long

Externí odkaz: http://arxiv.org/abs/2207.12217

Zobrazit plný text záznamu

Report

Regularized Q-learning

Autor: Lim, Han-Dong, Lee, Donghwan

Q-learning is widely used algorithm in reinforcement learning community. Under the lookup table setting, its convergence is well established. However, its behavior is known to be unstable with the linear function approximation case. This paper develo

Externí odkaz: http://arxiv.org/abs/2202.05404

Zobrazit plný text záznamu