Human Dorsal Striatal Activity during Choice Discriminates Reinforcement Learning Behavior from the Gambler's Fallacy
- Creators
- Jessup, Ryan K.
- O'Doherty, John P.
Abstract
Reinforcement learning theory has generated substantial interest in neurobiology, particularly because of the resemblance between phasic dopamine and reward prediction errors. Actor–critic theories have been adapted to account for the functions of the striatum, with parts of the dorsal striatum equated to the actor. Here, we specifically test whether the human dorsal striatum—as predicted by an actor–critic instantiation—is used on a trial-to-trial basis at the time of choice to choose in accordance with reinforcement learning theory, as opposed to a competing strategy: the gambler's fallacy. Using a partial-brain functional magnetic resonance imaging scanning protocol focused on the striatum and other ventral brain areas, we found that the dorsal striatum is more active when choosing consistent with reinforcement learning compared with the competing strategy. Moreover, an overlapping area of dorsal striatum along with the ventral striatum was found to be correlated with reward prediction errors at the time of outcome, as predicted by the actor–critic framework. These findings suggest that the same region of dorsal striatum involved in learning stimulus–response associations may contribute to the control of behavior during choice, thereby using those learned associations. Intriguingly, neither reinforcement learning nor the gambler's fallacy conformed to the optimal choice strategy on the specific decision-making task we used. Thus, the dorsal striatum may contribute to the control of behavior according to reinforcement learning even when the prescriptions of such an algorithm are suboptimal in terms of maximizing future rewards.
Additional Information
© 2011 The Authors. Received Dec. 9, 2010; revised Feb. 7, 2011; accepted March 4, 2011. This work was funded by Science Foundation Ireland Grant 08/IN.1/B1844 to J.P.O. We thank Donal Cahill for his help recruiting participants and collecting data, and Charlotte Prevost for advice on the manuscript. Author contributions: R.K.J. and J.P.O. designed research; R.K.J. performed research; R.K.J. analyzed data; R.K.J. and J.P.O. wrote the paper.Attached Files
Published - Jessup2011p13832J_Neurosci.pdf
Files
Name | Size | Download all |
---|---|---|
md5:a308bbe314819374ae16ea956d1c5cb0
|
853.5 kB | Preview Download |
Additional details
- PMCID
- PMC6622656
- Eprint ID
- 23740
- Resolver ID
- CaltechAUTHORS:20110519-152522041
- Science Foundation, Ireland
- 08/IN1/B1854
- Created
-
2011-05-19Created from EPrint's datestamp field
- Updated
-
2021-11-09Created from EPrint's last_modified field