Humans forage for reward in reinforcement learning tasks.

Autor: Zid M; Department of Neuroscience, University of Montreal, Montreal, QC , H3T 1J4, Canada., Laurie VJ; Department of Neuroscience, University of Montreal, Montreal, QC , H3T 1J4, Canada., Levine-Champagne A; Department of Neuroscience, University of Montreal, Montreal, QC , H3T 1J4, Canada., Shourkeshti A; Department of Neuroscience, University of Montreal, Montreal, QC , H3T 1J4, Canada., Harrell D; Department of Psychiatry, University of Minnesota, Minneapolis, MN, 55455, USA., Herman AB; Department of Psychiatry, University of Minnesota, Minneapolis, MN, 55455, USA., Ebitz RB; Department of Neuroscience, University of Montreal, Montreal, QC , H3T 1J4, Canada.
Jazyk: angličtina
Zdroj: BioRxiv : the preprint server for biology [bioRxiv] 2024 Jul 08. Date of Electronic Publication: 2024 Jul 08.
DOI: 10.1101/2024.07.08.602539
Abstrakt: How do we make good decisions in uncertain environments? In psychology and neuroscience, the classic answer is that we calculate the value of each option and then compare the values to choose the most rewarding, modulo some exploratory noise. An ethologist, conversely, would argue that we commit to one option until its value drops below a threshold, at which point we start exploring other options. In order to determine which view better describes human decision-making, we developed a novel, foraging-inspired sequential decision-making model and used it to ask whether humans compare to threshold ("Forage") or compare alternatives ("Reinforcement-Learn" [RL]). We found that the foraging model was a better fit for participant behavior, better predicted the participants' tendency to repeat choices, and predicted the existence of held-out participants with a pattern of choice that was almost impossible under RL. Together, these results suggest that humans use foraging computations, rather than RL, even in classic reinforcement learning tasks.
Competing Interests: Declaration of Interest The authors have no competing interests to disclose.
Databáze: MEDLINE