Adaptive machine learning for protein engineering.

Journal: Current opinion in structural biology

Published Date: Dec 9, 2021

Abstract

Machine-learning models that learn from data to predict how protein sequence encodes function are emerging as a useful protein engineering tool. However, when using these models to suggest new protein designs, one must deal with the vast combinatorial complexity of protein sequences. Here, we review how to use a sequence-to-function machine-learning surrogate model to select sequences for experimental measurement. First, we discuss how to select sequences through a single round of machine-learning optimization. Then, we discuss sequential optimization, where the goal is to discover optimized sequences and improve the model across multiple rounds of training, optimization, and experimental measurement.

Authors

Brian L Hie

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA, 02139, USA.
Kevin K Yang

Division of Chemistry and Chemical Engineering; California Institute of Technology; Pasadena, California; United States of America.

Keywords

Amino Acid Sequence Machine Learning Protein Engineering Proteins

External Resources

View on PubMed Access via DOI PubMed (34896756)

Adaptive machine learning for protein engineering.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals