Combining Backpropagation with Equilibrium Propagation to improve an Actor-Critic RL framework

Autor:	Yoshimasa Kubo, Eric Chalmers, Artur Luczak
Rok vydání:	2022
DOI:	10.1101/2022.06.21.496871
Popis:	Backpropagation has been used to train neural networks for many years, allowing them to solve a wide variety of tasks like image classification, speech recognition, and reinforcement learning tasks. But the biological plausibility of backpropagation as a mechanism of neural learning has been questioned. Equilibrium Propagation (EP) has been proposed as a more biologically plausible alternative and achieves comparable accuracy on the CIFAR-10 image classification task. This study proposes the first EP-based reinforcement learning architecture: an actor-critic architecture with the actor network trained by EP. We show that this model can solve the basic control tasks often used as benchmarks for BP-based models. Interestingly, our trained model demonstrates more consistent high-reward behavior than a comparable model trained exclusively by backpropagation.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::63bc5755e9891037fb27892202c8231d https://doi.org/10.1101/2022.06.21.496871 Zobrazit plný text záznamu