Protein structure prediction with in-cell photo-crosslinking mass spectrometry and deep learning.

Journal: Nature biotechnology
Published Date:

Abstract

While AlphaFold2 can predict accurate protein structures from the primary sequence, challenges remain for proteins that undergo conformational changes or for which few homologous sequences are known. Here we introduce AlphaLink, a modified version of the AlphaFold2 algorithm that incorporates experimental distance restraint information into its network architecture. By employing sparse experimental contacts as anchor points, AlphaLink improves on the performance of AlphaFold2 in predicting challenging targets. We confirm this experimentally by using the noncanonical amino acid photo-leucine to obtain information on residue-residue contacts inside cells by crosslinking mass spectrometry. The program can predict distinct conformations of proteins on the basis of the distance restraints provided, demonstrating the value of experimental data in driving protein structure prediction. The noise-tolerant framework for integrating data in protein structure prediction presented here opens a path to accurate characterization of protein structures from in-cell data.

Authors

  • Kolja Stahl
    Robotics and Biology Laboratory, Technische Universität Berlin, Berlin, Germany.
  • Andrea Graziadei
    Chair of Bioanalytics, Technische Universität Berlin, Berlin, Germany.
  • Therese Dau
    Technische Universität Berlin, Chair of Bioanalytics, Berlin, Germany.
  • Oliver Brock
    Robotics and Biology Laboratory, Technische Universität Berlin, 10587 Berlin, Germany. Electronic address: oliver.brock@tu-berlin.de.
  • Juri Rappsilber
    Wellcome Trust Centre for Cell Biology, School of Biological Sciences, University of Edinburgh, Edinburgh EH9 3BF, United Kingdom Bill.Earnshaw@ed.ac.uk Juri.Rappsilber@ed.ac.uk.