Machine learning for evolutionary-based and physics-inspired protein design: Current and future synergies.

Journal: Current opinion in structural biology
Published Date:

Abstract

Computational protein design facilitates the discovery of novel proteins with prescribed structure and functionality. Exciting designs were recently reported using novel data-driven methodologies that can be roughly divided into two categories: evolutionary-based and physics-inspired approaches. The former infer characteristic sequence features shared by sets of evolutionary-related proteins, such as conserved or coevolving positions, and recombine them to generate candidates with similar structure and function. The latter approaches estimate key biochemical properties, such as structure free energy, conformational entropy, or binding affinities using machine learning surrogates, and optimize them to yield improved designs. Here, we review recent progress along both tracks, discuss their strengths and weaknesses, and highlight opportunities for synergistic approaches.

Authors

  • Cyril Malbranke
    Laboratory of Physics of the Ecole Normale Supérieure, PSL Research, CNRS UMR 8023, Sorbonne Université, Université de Paris, Paris, France; Institut Pasteur, Université Paris Cité, CNRS UMR 6047, Synthetic Biology, 75015 Paris, France. Electronic address: cyril.malbranke@phys.ens.fr.
  • David Bikard
    Synthetic Biology Group, Microbiology Department, Institut Pasteur, Paris, France.
  • Simona Cocco
    Laboratory of Physics of the Ecole Normale Supérieure, CNRS and PSL Research, 75005 Paris, France cocco@lps.ens.fr.
  • Rémi Monasson
    Laboratory of Physics of the Ecole Normale Supérieure, CNRS and PSL Research, 75005 Paris, France monasson@lpt.ens.fr.
  • Jérôme Tubiana
    Laboratory of Physics of the Ecole Normale Supérieure, CNRS and PSL Research, 75005 Paris, France jertubiana@gmail.com.