Superstitious learning of abstract order from random reinforcement.

Autor: Jin Y; Department of Biological Sciences, Columbia University, New York, NY 10027.; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027., Jensen G; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027.; Department of Psychology, Reed College, Portland, OR 97202.; Department of Neuroscience, Columbia University, New York, NY 10027., Gottlieb J; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027.; Department of Neuroscience, Columbia University, New York, NY 10027.; Kavli Institute for Brain Science, Columbia University, New York, NY 10027., Ferrera V; Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027.; Department of Neuroscience, Columbia University, New York, NY 10027.; Kavli Institute for Brain Science, Columbia University, New York, NY 10027.
Jazyk: angličtina
Zdroj: Proceedings of the National Academy of Sciences of the United States of America [Proc Natl Acad Sci U S A] 2022 Aug 30; Vol. 119 (35), pp. e2202789119. Date of Electronic Publication: 2022 Aug 23.
DOI: 10.1073/pnas.2202789119
Abstrakt: Humans and other animals often infer spurious associations among unrelated events. However, such superstitious learning is usually accounted for by conditioned associations, raising the question of whether an animal could develop more complex cognitive structures independent of reinforcement. Here, we tasked monkeys with discovering the serial order of two pictorial sets: a "learnable" set in which the stimuli were implicitly ordered and monkeys were rewarded for choosing the higher-rank stimulus and an "unlearnable" set in which stimuli were unordered and feedback was random regardless of the choice. We replicated prior results that monkeys reliably learned the implicit order of the learnable set. Surprisingly, the monkeys behaved as though some ordering also existed in the unlearnable set, showing consistent choice preference that transferred to novel untrained pairs in this set, even under a preference-discouraging reward schedule that gave rewards more frequently to the stimulus that was selected less often. In simulations, a model-free reinforcement learning algorithm ( Q -learning) displayed a degree of consistent ordering among the unlearnable set but, unlike the monkeys, failed to do so under the preference-discouraging reward schedule. Our results suggest that monkeys infer abstract structures from objectively random events using heuristics that extend beyond stimulus-outcome conditional learning to more cognitive model-based learning mechanisms.
Databáze: MEDLINE