Improved protein structure prediction using potentials from deep learning.

Journal: Nature
Published Date:

Abstract

Protein structure prediction can be used to determine the three-dimensional shape of a protein from its amino acid sequence. This problem is of fundamental importance as the structure of a protein largely determines its function; however, protein structures can be difficult to determine experimentally. Considerable progress has recently been made by leveraging genetic information. It is possible to infer which amino acid residues are in contact by analysing covariation in homologous sequences, which aids in the prediction of protein structures. Here we show that we can train a neural network to make accurate predictions of the distances between pairs of residues, which convey more information about the structure than contact predictions. Using this information, we construct a potential of mean force that can accurately describe the shape of a protein. We find that the resulting potential can be optimized by a simple gradient descent algorithm to generate structures without complex sampling procedures. The resulting system, named AlphaFold, achieves high accuracy, even for sequences with fewer homologous sequences. In the recent Critical Assessment of Protein Structure Prediction (CASP13)-a blind assessment of the state of the field-AlphaFold created high-accuracy structures (with template modelling (TM) scores of 0.7 or higher) for 24 out of 43 free modelling domains, whereas the next best method, which used sampling and contact information, achieved such accuracy for only 14 out of 43 domains. AlphaFold represents a considerable advance in protein-structure prediction. We expect this increased accuracy to enable insights into the function and malfunction of proteins, especially in cases for which no structures for homologous proteins have been experimentally determined.

Authors

  • Andrew W Senior
    DeepMind, London, UK.
  • Richard Evans
    DeepMind, London, UK.
  • John Jumper
    DeepMind, London, UK.
  • James Kirkpatrick
    DeepMind, London EC4 5TW, United Kingdom; kirkpatrick@google.com.
  • Laurent Sifre
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Tim Green
    DeepMind, London, UK.
  • Chongli Qin
    DeepMind, London, UK.
  • Augustin Žídek
    DeepMind, London, UK.
  • Alexander W R Nelson
    DeepMind, London, UK.
  • Alex Bridgland
    DeepMind, London, UK.
  • Hugo Penedones
    DeepMind, London, UK.
  • Stig Petersen
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Karen Simonyan
    DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Steve Crossan
    DeepMind, London, UK.
  • Pushmeet Kohli
    DeepMind, London, UK.
  • David T Jones
    Department of Computer Science, Bioinformatics Group, University College London, Gower Street, London, WC1E 6BT, United Kingdom. d.t.jones@ucl.ac.uk.
  • David Silver
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Koray Kavukcuoglu
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.
  • Demis Hassabis
    Google DeepMind, 5 New Street Square, London EC4A 3TW, UK.