A robotic model of hippocampal reverse replay for reinforcement learning.

Autor:	Whelan MT; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.; Sheffield Robotics, Sheffield, United Kingdom., Jimenez-Rodriguez A; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.; Sheffield Robotics, Sheffield, United Kingdom., Prescott TJ; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.; Sheffield Robotics, Sheffield, United Kingdom., Vasilaki E; Department of Computer Science, The University of Sheffield, Sheffield, United Kingdom.; Sheffield Robotics, Sheffield, United Kingdom.
Jazyk:	angličtina
Zdroj:	Bioinspiration & biomimetics [Bioinspir Biomim] 2022 Dec 02; Vol. 18 (1). Date of Electronic Publication: 2022 Dec 02.
DOI:	10.1088/1748-3190/ac9ffc
Abstrakt:	Hippocampal reverse replay, a phenomenon in which recently active hippocampal cells reactivate in the reverse order, is thought to contribute to learning, particularly reinforcement learning (RL), in animals. Here, we present a novel computational model which exploits reverse replay to improve stability and performance on a homing task. The model takes inspiration from the hippocampal-striatal network, and learning occurs via a three-factor RL rule. To augment this model with hippocampal reverse replay, we derived a policy gradient learning rule that associates place-cell activity with responses in cells representing actions and a supervised learning rule of the same form, interpreting the replay activity as a 'target' frequency. We evaluated the model using a simulated robot spatial navigation task inspired by the Morris water maze. Results suggest that reverse replay can improve performance stability over multiple trials. Our model exploits reverse reply as an additional source for propagating information about desirable synaptic changes, reducing the requirements for long-time scales in eligibility traces combined with low learning rates. We conclude that reverse replay can positively contribute to RL, although less stable learning is possible in its absence. Analogously, we postulate that reverse replay may enhance RL in the mammalian hippocampal-striatal system rather than provide its core mechanism. (Creative Commons Attribution license.)
Databáze:	MEDLINE
Externí odkaz:	Zobrazit plný text záznamu