Machine learning to design integral membrane channelrhodopsins for efficient eukaryotic expression and plasma membrane localization
Abstract
There is growing interest in studying and engineering integral membrane proteins (MPs) that play key roles in sensing and regulating cellular response to diverse external signals. A MP must be expressed, correctly inserted and folded in a lipid bilayer, and trafficked to the proper cellular location in order to function. The sequence and structural determinants of these processes are complex and highly constrained. Here we describe a predictive, machine-learning approach that captures this complexity to facilitate successful MP engineering and design. Machine learning on carefully-chosen training sequences made by structure-guided SCHEMA recombination has enabled us to accurately predict the rare sequences in a diverse library of channelrhodopsins (ChRs) that express and localize to the plasma membrane of mammalian cells. These light-gated channel proteins of microbial origin are of interest for neuroscience applications, where expression and localization to the plasma membrane is a prerequisite for function. We trained Gaussian process (GP) classification and regression models with expression and localization data from 218 ChR chimeras chosen from a 118,098-variant library designed by SCHEMA recombination of three parent ChRs. We use these GP models to identify ChRs that express and localize well and show that our models can elucidate sequence and structure elements important for these processes. We also used the predictive models to convert a naturally occurring ChR incapable of mammalian localization into one that localizes well.
Additional Information
© 2017 Bedbrook et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Received: August 9, 2017; Accepted: September 21, 2017; Published: October 23, 2017. We thank Twist Bioscience for synthesizing and cloning ChR sequences as part of their α and β manufacturing programs. We thank the Gradinaru and Arnold labs for helpful discussions. We also thank Dr. John Bedbrook for critical reading of the manuscript. Imaging was performed in the Biological Imaging Facility, with the support of the Caltech Beckman Institute and the Arnold and Mabel Beckman Foundation. Data Availability: All relevant data are either within the paper and its Supporting Information files or published in ref 5. This work is funded by the National Institute for Mental Health R21MH103824 (VG and FHA) and the Institute for Collaborative Biotechnologies through grant number W911F-09-0001 from the U.S. Army Research Office (FHA). The content is solely the responsibility of the authors and does not necessarily reflect the position or policy of the National Center for Research Resources, the National Institutes of Health, or the Government, and no official endorsement should be inferred. VG is a Heritage Principal Investigator supported by the Heritage Medical Research Institute. CNB and AJR are funded by Ruth L. Kirschstein National Research Service Awards (F31MH102913 and F32GM116319, respectively). KKY is a trainee in the Caltech Biotechnology Leadership Program, and has received financial support from the Donna and Benjamin M. Rosen Bioengineering Center. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Authors declare no competing interests. Author Contributions: Conceptualization: Claire N. Bedbrook, Kevin K. Yang, Austin J. Rice, Viviana Gradinaru, Frances H. Arnold. Formal analysis: Claire N. Bedbrook, Kevin K. Yang. Methodology: Claire N. Bedbrook, Kevin K. Yang, Austin J. Rice. Project administration: Frances H. Arnold. Software: Claire N. Bedbrook, Kevin K. Yang. Supervision: Viviana Gradinaru, Frances H. Arnold. Visualization: Claire N. Bedbrook, Kevin K. Yang, Austin J. Rice. Writing ± original draft: Claire N. Bedbrook, Kevin K. Yang. Writing ± review & editing: Claire N. Bedbrook, Kevin K. Yang, Austin J. Rice, Viviana Gradinaru, Frances H. Arnold.Attached Files
Published - journal.pcbi.1005786.pdf
Supplemental Material - journal.pcbi.1005786.s001.csv
Supplemental Material - journal.pcbi.1005786.s002.tif
Supplemental Material - journal.pcbi.1005786.s003.tif
Supplemental Material - journal.pcbi.1005786.s004.tif
Supplemental Material - journal.pcbi.1005786.s005.tif
Supplemental Material - journal.pcbi.1005786.s006.tif
Supplemental Material - journal.pcbi.1005786.s007.tif
Supplemental Material - journal.pcbi.1005786.s008.tif
Supplemental Material - journal.pcbi.1005786.s009.tif
Supplemental Material - journal.pcbi.1005786.s010.tif
Supplemental Material - journal.pcbi.1005786.s011.tif
Supplemental Material - journal.pcbi.1005786.s012.tif
Supplemental Material - journal.pcbi.1005786.s013.tif
Files
Name | Size | Download all |
---|---|---|
md5:48eb03ee7592689e972514a07ebef521
|
544.5 kB | Preview Download |
md5:d425d26102d0f664ac826fc828881523
|
1.4 MB | Preview Download |
md5:b90c1a2f38a198c261456a93622365ac
|
5.2 MB | Preview Download |
md5:ed9b7dfe07111fb039cfbd8fb30f0c34
|
2.0 MB | Preview Download |
md5:4a2be7f48c85aad72449e6e0b9f327ae
|
54.0 kB | Preview Download |
md5:69fae4cc3c2a7fff51d67c2eb0f8e506
|
989.9 kB | Preview Download |
md5:8c27bef00b29f82c3b1bf3747333c101
|
1.5 MB | Preview Download |
md5:048b41a782ab56bf237c84ce4f3393d1
|
1.7 MB | Preview Download |
md5:a1ba791e7b19d38cc56f30c1e156df10
|
442.1 kB | Preview Download |
md5:93d7eeb4848bf308af4981ae31f3d2f1
|
4.3 MB | Preview Download |
md5:dbf9789a191c72d9efd64a6416a01165
|
223.3 kB | Preview Download |
md5:b00908bc67338f8483416bcbd073f64b
|
824.5 kB | Preview Download |
md5:ea47c76247dd58a26aa22d0ca530c188
|
3.2 MB | Preview Download |
md5:86b870f0a04e6ede2d9311f0c6a53592
|
2.1 MB | Preview Download |
Additional details
- PMCID
- PMC5695628
- Eprint ID
- 82781
- Resolver ID
- CaltechAUTHORS:20171030-140148566
- NIH
- R21MH103824
- Army Research Office (ARO)
- W911F-09-0001
- Heritage Medical Research Institute
- NIH Predoctoral Fellowship
- F31MH102913
- NIH Predoctoral Fellowship
- F32GM116319
- Donna and Benjamin M. Rosen Bioengineering Center
- Caltech Beckman Institute
- Arnold and Mabel Beckman Foundation
- Created
-
2017-10-30Created from EPrint's datestamp field
- Updated
-
2023-06-01Created from EPrint's last_modified field
- Caltech groups
- Heritage Medical Research Institute, Rosen Bioengineering Center