A general model of hippocampal and dorsal striatal learning and decision making.

Autor: Geerts JP; Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London W1T 4JG, United Kingdom.; Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, United Kingdom., Chersi F; Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, United Kingdom.; GrAI Matter Labs, 75012 Paris, France., Stachenfeld KL; DeepMind, London N1C 4AG, United Kingdom., Burgess N; Institute of Cognitive Neuroscience, University College London, London WC1N 3AZ, United Kingdom; n.burgess@ucl.ac.uk.; Sainsbury Wellcome Centre for Neural Circuits and Behaviour, University College London, London W1T 4JG, United Kingdom.
Jazyk: angličtina
Zdroj: Proceedings of the National Academy of Sciences of the United States of America [Proc Natl Acad Sci U S A] 2020 Dec 08; Vol. 117 (49), pp. 31427-31437. Date of Electronic Publication: 2020 Nov 23.
DOI: 10.1073/pnas.2007981117
Abstrakt: Humans and other animals use multiple strategies for making decisions. Reinforcement-learning theory distinguishes between stimulus-response (model-free; MF) learning and deliberative (model-based; MB) planning. The spatial-navigation literature presents a parallel dichotomy between navigation strategies. In "response learning," associated with the dorsolateral striatum (DLS), decisions are anchored to an egocentric reference frame. In "place learning," associated with the hippocampus, decisions are anchored to an allocentric reference frame. Emerging evidence suggests that the contribution of hippocampus to place learning may also underlie its contribution to MB learning by representing relational structure in a cognitive map. Here, we introduce a computational model in which hippocampus subserves place and MB learning by learning a "successor representation" of relational structure between states; DLS implements model-free response learning by learning associations between actions and egocentric representations of landmarks; and action values from either system are weighted by the reliability of its predictions. We show that this model reproduces a range of seemingly disparate behavioral findings in spatial and nonspatial decision tasks and explains the effects of lesions to DLS and hippocampus on these tasks. Furthermore, modeling place cells as driven by boundaries explains the observation that, unlike navigation guided by landmarks, navigation guided by boundaries is robust to "blocking" by prior state-reward associations due to learned associations between place cells. Our model, originally shaped by detailed constraints in the spatial literature, successfully characterizes the hippocampal-striatal system as a general system for decision making via adaptive combination of stimulus-response learning and the use of a cognitive map.
Competing Interests: The authors declare no competing interest.
(Copyright © 2020 the Author(s). Published by PNAS.)
Databáze: MEDLINE