Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments
Abstract
Humans possess an exceptional aptitude to efficiently make decisions from high-dimensional sensory observations. However, it is unknown how the brain compactly represents the current state of the environment to guide this process. The deep Q-network (DQN) achieves this by capturing highly nonlinear mappings from multivariate inputs to the values of potential actions. We deployed DQN as a model of brain activity and behavior in participants playing three Atari video games during fMRI. Hidden layers of DQN exhibited a striking resemblance to voxel activity in a distributed sensorimotor network, extending throughout the dorsal visual pathway into posterior parietal cortex. Neural state-space representations emerged from nonlinear transformations of the pixel space bridging perception to action and reward. These transformations reshape axes to reflect relevant high-level features and strip away information about task-irrelevant sensory features. Our findings shed light on the neural encoding of task representations for decision-making in real-world situations.
Additional Information
© 2020 Elsevier. Received 10 July 2020, Revised 15 October 2020, Accepted 17 November 2020, Available online 15 December 2020. This work is supported by NIDA (grant R01DA040011 to J.O.D. and L.C.) and the NIMH Caltech Conte Center for the Neurobiology of Social Decision-Making (grant P50 MH094258 to J.O.D.). We would like to thank Kiyohito Iigaya and other members of the O'Doherty lab for helpful feedback and discussions. Author Contributions. L.C., J.C., and J.P.O. designed the project. L.C. and J.C. developed experimental protocol and collected data. L.C. performed the analyses and wrote the draft of the manuscript. L.C., J.C., Y.Y., and J.P.O. discussed analyses and edited the manuscript. J.P.O. acquired funding. The authors declare no competing interests.Attached Files
Accepted Version - nihms-1650354.pdf
Supplemental Material - 1-s2.0-S0896627320308990-mmc1.pdf
Files
Name | Size | Download all |
---|---|---|
md5:b16453d113861512882cca7559aca716
|
1.9 MB | Preview Download |
md5:4895bbdcf1bff5e126dbaffcbed8dbfe
|
3.0 MB | Preview Download |
Additional details
- PMCID
- PMC7897245
- Eprint ID
- 107165
- Resolver ID
- CaltechAUTHORS:20201217-133745087
- NIH
- R01DA040011
- NIH
- P50 MH094258
- Created
-
2020-12-17Created from EPrint's datestamp field
- Updated
-
2022-03-03Created from EPrint's last_modified field
- Caltech groups
- Tianqiao and Chrissy Chen Institute for Neuroscience