Výsledky vyhledávání - "Chung, Wesley"

Report

Parseval Regularization for Continual Reinforcement Learning

Autor: Chung, Wesley, Cherif, Lynn, Meger, David, Precup, Doina

Loss of plasticity, trainability loss, and primacy bias have been identified as issues arising when training deep neural networks on sequences of tasks -- all referring to the increased difficulty in training on new tasks. We propose to use Parseval

Externí odkaz: http://arxiv.org/abs/2412.07224

Zobrazit plný text záznamu

Report

The Role of Baselines in Policy Gradient Optimization

Autor: Mei, Jincheng, Chung, Wesley, Thomas, Valentin, Dai, Bo, Szepesvari, Csaba, Schuurmans, Dale

We study the effect of baselines in on-policy stochastic policy gradient optimization, and close the gap between the theory and practice of policy optimization methods. Our first contribution is to show that the \emph{state value} baseline allows on-

Externí odkaz: http://arxiv.org/abs/2301.06276

Zobrazit plný text záznamu

Report

Beyond variance reduction: Understanding the true impact of baselines on policy optimization

Autor: Chung, Wesley, Thomas, Valentin, Machado, Marlos C., Roux, Nicolas Le

Bandit and reinforcement learning (RL) problems can often be framed as optimization problems where the goal is to maximize average performance while having access only to stochastic estimates of the true gradient. Traditionally, stochastic optimizati

Externí odkaz: http://arxiv.org/abs/2008.13773

Zobrazit plný text záznamu

Report

Incrementally Learning Functions of the Return

Autor: Bennett, Brendan, Chung, Wesley, Zaheer, Muhammad, Liu, Vincent

Temporal difference methods enable efficient estimation of value functions in reinforcement learning in an incremental fashion, and are of broader interest because they correspond learning as observed in biological systems. Standard value functions c

Externí odkaz: http://arxiv.org/abs/1907.04651

Zobrazit plný text záznamu

Report

Importance Resampling for Off-policy Prediction

Autor: Schlegel, Matthew, Chung, Wesley, Graves, Daniel, Qian, Jian, White, Martha

Importance sampling (IS) is a common reweighting strategy for off-policy prediction in reinforcement learning. While it is consistent and unbiased, it can result in high variance updates to the weights for the value function. In this work, we explore

Externí odkaz: http://arxiv.org/abs/1906.04328

Zobrazit plný text záznamu

Report

High-confidence error estimates for learned value functions

Autor: Sajed, Touqir, Chung, Wesley, White, Martha

Estimating the value function for a fixed policy is a fundamental problem in reinforcement learning. Policy evaluation algorithms---to estimate value functions---continue to be developed, to improve convergence rates, improve stability and handle var

Externí odkaz: http://arxiv.org/abs/1808.09127

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Additional file 1 of Communication and role clarity inform TeleICU use: a qualitative analysis of opportunities and barriers in an established program using AACN framework

Autor: Krupp, Anna, Martino, Michael Di, Chung, Wesley, Krisda Chaiyachati, Anish K. Agarwal, Huffenberger, Ann Marie, Laudanski, Krzysztof

Additional file 1. “Tele-medicine – BMC Semi-structured Interview” – Provider and Patient Perspective on the Value of Direct-to-consumer Telehealth for Urgent Care: Telemedicine Provider Semi-structured Interview Guide – V4.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::232b25e81a44cdb4947ec20c8a1b4d0b

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání