Welcome to the new version of CaltechAUTHORS. Login is currently restricted to library staff. If you notice any issues, please email coda@library.caltech.edu
Published February 17, 2021 | Supplemental Material + Accepted Version
Journal Article Open

Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments

Abstract

Humans possess an exceptional aptitude to efficiently make decisions from high-dimensional sensory observations. However, it is unknown how the brain compactly represents the current state of the environment to guide this process. The deep Q-network (DQN) achieves this by capturing highly nonlinear mappings from multivariate inputs to the values of potential actions. We deployed DQN as a model of brain activity and behavior in participants playing three Atari video games during fMRI. Hidden layers of DQN exhibited a striking resemblance to voxel activity in a distributed sensorimotor network, extending throughout the dorsal visual pathway into posterior parietal cortex. Neural state-space representations emerged from nonlinear transformations of the pixel space bridging perception to action and reward. These transformations reshape axes to reflect relevant high-level features and strip away information about task-irrelevant sensory features. Our findings shed light on the neural encoding of task representations for decision-making in real-world situations.

Additional Information

© 2020 Elsevier. Received 10 July 2020, Revised 15 October 2020, Accepted 17 November 2020, Available online 15 December 2020. This work is supported by NIDA (grant R01DA040011 to J.O.D. and L.C.) and the NIMH Caltech Conte Center for the Neurobiology of Social Decision-Making (grant P50 MH094258 to J.O.D.). We would like to thank Kiyohito Iigaya and other members of the O'Doherty lab for helpful feedback and discussions. Author Contributions. L.C., J.C., and J.P.O. designed the project. L.C. and J.C. developed experimental protocol and collected data. L.C. performed the analyses and wrote the draft of the manuscript. L.C., J.C., Y.Y., and J.P.O. discussed analyses and edited the manuscript. J.P.O. acquired funding. The authors declare no competing interests.

Attached Files

Accepted Version - nihms-1650354.pdf

Supplemental Material - 1-s2.0-S0896627320308990-mmc1.pdf

Files

1-s2.0-S0896627320308990-mmc1.pdf
Files (5.0 MB)
Name Size Download all
md5:b16453d113861512882cca7559aca716
1.9 MB Preview Download
md5:4895bbdcf1bff5e126dbaffcbed8dbfe
3.0 MB Preview Download

Additional details

Created:
August 22, 2023
Modified:
October 23, 2023