Pigeon and human performance in a multi-armed bandit task in response to changes in variable interval schedules

Autor: Michael E. Young, Dennis Garlick, Aaron P. Blaisdell, Deborah Racey, Jennifer Pham
Rok vydání: 2011
Předmět:
Zdroj: Learning & Behavior. 39:245-258
ISSN: 1543-4508
1543-4494
Popis: The tension between exploitation of the best options and exploration of alternatives is a ubiquitous problem that all organisms face. To examine this trade-off across species, pigeons and people were trained on an eight-armed bandit task in which the options were rewarded on a variable interval (VI) schedule. At regular intervals, each option's VI changed, thus encouraging dynamic increases in exploration in response to these anticipated changes. Both species showed sensitivity to the payoffs that was often well modeled by Luce's (1963) decision rule. For pigeons, exploration of alternative options was driven by experienced changes in the payoff schedules, not the beginning of a new session, even though each session signaled a new schedule. In contrast, people quickly learned to explore in response to signaled changes in the payoffs.
Databáze: OpenAIRE