Výsledky vyhledávání - "Rafiee, Banafsheh"

Report

Auxiliary task discovery through generate-and-test

Autor: Rafiee, Banafsheh, Ghiassian, Sina, Jin, Jun, Sutton, Richard, Luo, Jun, White, Adam

In this paper, we explore an approach to auxiliary task discovery in reinforcement learning based on ideas from representation learning. Auxiliary tasks tend to improve data efficiency by forcing the agent to learn auxiliary prediction and control ob

Externí odkaz: http://arxiv.org/abs/2210.14361

Zobrazit plný text záznamu

Report

What makes useful auxiliary tasks in reinforcement learning: investigating the effect of the target policy

Autor: Rafiee, Banafsheh, Jin, Jun, Luo, Jun, White, Adam

Auxiliary tasks have been argued to be useful for representation learning in reinforcement learning. Although many auxiliary tasks have been empirically shown to be effective for accelerating learning on the main task, it is not yet clear what makes

Externí odkaz: http://arxiv.org/abs/2204.00565

Zobrazit plný text záznamu

Report

From Eye-blinks to State Construction: Diagnostic Benchmarks for Online Representation Learning

Autor: Rafiee, Banafsheh, Abbas, Zaheer, Ghiassian, Sina, Kumaraswamy, Raksha, Sutton, Richard, Ludvig, Elliot, White, Adam

We present three new diagnostic prediction problems inspired by classical-conditioning experiments to facilitate research in online prediction learning. Experiments in classical conditioning show that animals such as rabbits, pigeons, and dogs can ma

Externí odkaz: http://arxiv.org/abs/2011.04590

Zobrazit plný text záznamu

Report

Improving Performance in Reinforcement Learning by Breaking Generalization in Neural Networks

Autor: Ghiassian, Sina, Rafiee, Banafsheh, Lo, Yat Long, White, Adam

Reinforcement learning systems require good representations to work well. For decades practical success in reinforcement learning was limited to small domains. Deep reinforcement learning systems, on the other hand, are scalable, not dependent on dom

Externí odkaz: http://arxiv.org/abs/2003.07417

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

Two geometric input transformation methods for fast online reinforcement learning with neural nets

Autor: Ghiassian, Sina, Yu, Huizhen, Rafiee, Banafsheh, Sutton, Richard S.

We apply neural nets with ReLU gates in online reinforcement learning. Our goal is to train these networks in an incremental manner, without the computationally expensive experience replay. By studying how individual neural nodes behave in online tra

Externí odkaz: http://arxiv.org/abs/1805.07476

Zobrazit plný text záznamu

Report

A First Empirical Study of Emphatic Temporal Difference Learning

Autor: Ghiassian, Sina, Rafiee, Banafsheh, Sutton, Richard S.

In this paper we present the first empirical study of the emphatic temporal-difference learning algorithm (ETD), comparing it with conventional temporal-difference learning, in particular, with linear TD(0), on on-policy and off-policy variations of

Externí odkaz: http://arxiv.org/abs/1705.04185

Zobrazit plný text záznamu