New Frontiers for Machine Learning in Protein Science.

Journal: Journal of molecular biology
PMID:

Abstract

Protein function is fundamentally reliant on inter-molecular interactions that underpin the ability of proteins to form complexes driving biological processes in living cells. Increasingly, such interactions are recognised as being formed between proteins that exist on a broad spectrum of dynamic conformational states and levels of intrinsic disorder. Additionally, the sizes of the structures formed can range from simple binary complexes to large dynamic biomolecular condensates measuring 100 nm or more. Understanding the parameters that govern such interactions, how they form, how they lead to function and what happens when they take place in unintended manners and lead to disease, represent some of the core questions for molecular biosciences. In light of recent advances made in solving the protein folding problem by machine learning methods, we discuss here the challenges and opportunities brought by these new data-driven approaches for the next frontiers of biomolecular science.

Authors

  • Alexey S Morgunov
    Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom.
  • Kadi L Saar
    Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, United Kingdom.
  • Michele Vendruscolo
    Centre for Misfolding Diseases, Department of Chemistry, University of Cambridge, UK. Electronic address: mv245@cam.ac.uk.
  • Tuomas P J Knowles
    Yusuf Hamied Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.