Výsledky vyhledávání - "Choe, Jean Seong Bjorn"

Report

Average-Reward Maximum Entropy Reinforcement Learning for Underactuated Double Pendulum Tasks

Autor: Choe, Jean Seong Bjorn, Choi, Bumkyu, Kim, Jong-kook

This report presents a solution for the swing-up and stabilisation tasks of the acrobot and the pendubot, developed for the AI Olympics competition at IROS 2024. Our approach employs the Average-Reward Entropy Advantage Policy Optimization (AR-EAPO),

Externí odkaz: http://arxiv.org/abs/2409.08938

Zobrazit plný text záznamu

Report

Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation

Autor: Choe, Jean Seong Bjorn, Kim, Jong-Kook

Entropy Regularisation is a widely adopted technique that enhances policy optimisation performance and stability. A notable form of entropy regularisation is augmenting the objective with an entropy term, thereby simultaneously optimising the expecte

Externí odkaz: http://arxiv.org/abs/2407.18143

Zobrazit plný text záznamu

Report

The Bid Picture: Auction-Inspired Multi-player Generative Adversarial Networks Training

Autor: Shim, Joo Yong, Choe, Jean Seong Bjorn, Kim, Jong-Kook

This article proposes auction-inspired multi-player generative adversarial networks training, which mitigates the mode collapse problem of GANs. Mode collapse occurs when an over-fitted generator generates a limited range of samples, often concentrat

Externí odkaz: http://arxiv.org/abs/2403.13866

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání