A Caltech Library Service

Protein sequence design with deep generative models

Wu, Zachary and Johnston, Kadina E. and Arnold, Frances H. and Yang, Kevin K. (2021) Protein sequence design with deep generative models. . (Unpublished)

[img] PDF - Submitted Version
Creative Commons Attribution Non-commercial No Derivatives.


Use this Persistent URL to link to this item:


Protein engineering seeks to identify protein sequences with optimized properties. When guided by machine learning, protein sequence generation methods can draw on prior knowledge and experimental efforts to improve this process. In this review, we highlight recent applications of machine learning to generate protein sequences, focusing on the emerging field of deep generative methods.

Item Type:Report or Paper (Discussion Paper)
Related URLs:
URLURL TypeDescription Paper
Wu, Zachary0000-0003-2429-9812
Arnold, Frances H.0000-0002-4027-364X
Yang, Kevin K.0000-0001-9045-6826
Additional Information:Attribution-NonCommercial-NoDerivatives 4.0 International (CC BY-NC-ND 4.0). The authors wish to thank members Lucas Schaus and Sabine Brinkmann-Chen for feedback on early drafts. This work is supported by the Camille and Henry Dreyfus Foundation (ML-20-194) and the NSF Division of Chemical, Bioengineering, Environmental, and Transport Systems (1937902).
Funding AgencyGrant Number
Camille and Henry Dreyfus FoundationML-20-194
Record Number:CaltechAUTHORS:20210413-080510593
Persistent URL:
Usage Policy:No commercial reproduction, distribution, display or performance rights in this work are provided.
ID Code:108709
Deposited By: Tony Diaz
Deposited On:13 Apr 2021 21:45
Last Modified:19 Apr 2021 22:44

Repository Staff Only: item control page