Výsledky vyhledávání - "Machado, A. C."

Report

Plastic Learning with Deep Fourier Features

Autor: Lewandowski, Alex, Schuurmans, Dale, Machado, Marlos C.

Deep neural networks can struggle to learn continually in the face of non-stationarity. This phenomenon is known as loss of plasticity. In this paper, we identify underlying principles that lead to plastic algorithms. In particular, we provide theore

Externí odkaz: http://arxiv.org/abs/2410.20634

Zobrazit plný text záznamu

Report

Demystifying the Recency Heuristic in Temporal-Difference Learning

Autor: Daley, Brett, Machado, Marlos C., White, Martha

Publikováno v: Reinforcement Learning Journal, vol. 1, no. 1, 2024

The recency heuristic in reinforcement learning is the assumption that stimuli that occurred closer in time to an acquired reward should be more heavily reinforced. The recency heuristic is one of the key assumptions made by TD($\lambda$), which rein

Externí odkaz: http://arxiv.org/abs/2406.12284

Zobrazit plný text záznamu

Report

Learning Continually by Spectral Regularization

Autor: Lewandowski, Alex, Bortkiewicz, Michał, Kumar, Saurabh, György, András, Schuurmans, Dale, Ostaszewski, Mateusz, Machado, Marlos C.

Loss of plasticity is a phenomenon where neural networks can become more difficult to train over the course of learning. Continual learning algorithms seek to mitigate this effect by sustaining good performance while maintaining network trainability.

Externí odkaz: http://arxiv.org/abs/2406.06811

Zobrazit plný text záznamu

Report

Radiation hardness of open Fabry-Perot microcavities

Autor: Rodrigues-Machado, Fernanda C., Janitz, Erika, Bernard, Simon, Bekerat, Hamed, McEwen, Malcolm, Renaud, James, Enger, Shirin A., Childress, Lilian, Sankey, Jack C.

High-finesse microcavities offer a platform for compact, high-precision sensing by employing high-reflectivity, low-loss mirrors to create effective optical path lengths that are orders of magnitude larger than the device geometry. Here, we investiga

Externí odkaz: http://arxiv.org/abs/2404.08787

Zobrazit plný text záznamu

Report

Averaging $n$-step Returns Reduces Variance in Reinforcement Learning

Autor: Daley, Brett, White, Martha, Machado, Marlos C.

Multistep returns, such as $n$-step returns and $\lambda$-returns, are commonly used to improve the sample efficiency of reinforcement learning (RL) methods. The variance of the multistep returns becomes the limiting factor in their length; looking t

Externí odkaz: http://arxiv.org/abs/2402.03903

Zobrazit plný text záznamu

Report

GVFs in the Real World: Making Predictions Online for Water Treatment

Autor: Janjua, Muhammad Kamran, Shah, Haseeb, White, Martha, Miahi, Erfan, Machado, Marlos C., White, Adam

Publikováno v: Machine Learning (2023): 1-31

In this paper we investigate the use of reinforcement-learning based prediction approaches for a real drinking-water treatment plant. Developing such a prediction system is a critical step on the path to optimizing and automating water treatment. Bef

Externí odkaz: http://arxiv.org/abs/2312.01624

Zobrazit plný text záznamu

Report

Harnessing Discrete Representations For Continual Reinforcement Learning

Autor: Meyer, Edan, White, Adam, Machado, Marlos C.

Reinforcement learning (RL) agents make decisions using nothing but observations from the environment, and consequently, heavily rely on the representations of those observations. Though some recent breakthroughs have used vector-based categorical re

Externí odkaz: http://arxiv.org/abs/2312.01203

Zobrazit plný text záznamu

Report

Directions of Curvature as an Explanation for Loss of Plasticity

Autor: Lewandowski, Alex, Tanaka, Haruto, Schuurmans, Dale, Machado, Marlos C.

Loss of plasticity is a phenomenon in which neural networks lose their ability to learn from new experience. Despite being empirically observed in several problem settings, little is understood about the mechanisms that lead to loss of plasticity. In

Externí odkaz: http://arxiv.org/abs/2312.00246

Zobrazit plný text záznamu

Report

AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning

Autor: Pramanik, Subhojeet, Elelimy, Esraa, Machado, Marlos C., White, Adam

In this paper we investigate transformer architectures designed for partially observable online reinforcement learning. The self-attention mechanism in the transformer architecture is capable of capturing long-range dependencies and it is the main re

Externí odkaz: http://arxiv.org/abs/2310.15719

Zobrazit plný text záznamu

Report

Proper Laplacian Representation Learning

Autor: Gomez, Diego, Bowling, Michael, Machado, Marlos C.

The ability to learn good representations of states is essential for solving large reinforcement learning problems, where exploration, generalization, and transfer are particularly challenging. The Laplacian representation is a promising approach to

Externí odkaz: http://arxiv.org/abs/2310.10833

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání