Automated model building and protein identification in cryo-EM maps.

Journal: Nature
PMID:

Abstract

Interpreting electron cryo-microscopy (cryo-EM) maps with atomic models requires high levels of expertise and labour-intensive manual intervention in three-dimensional computer graphics programs. Here we present ModelAngelo, a machine-learning approach for automated atomic model building in cryo-EM maps. By combining information from the cryo-EM map with information from protein sequence and structure in a single graph neural network, ModelAngelo builds atomic models for proteins that are of similar quality to those generated by human experts. For nucleotides, ModelAngelo builds backbones with similar accuracy to those built by humans. By using its predicted amino acid probabilities for each residue in hidden Markov model sequence searches, ModelAngelo outperforms human experts in the identification of proteins with unknown sequences. ModelAngelo will therefore remove bottlenecks and increase objectivity in cryo-EM structure determination.

Authors

  • Kiarash Jamali
    MRC Laboratory of Molecular Biology, Cambridge, UK. kjamali@mrc-lmb.cam.ac.uk.
  • Lukas Käll
    Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology and Health, Royal Institute of Technology─KTH, Box 1031, SE-17121 Solna, Sweden.
  • Rui Zhang
    Department of Cardiology, Zhongda Hospital, Medical School of Southeast University, Nanjing, China.
  • Alan Brown
    Blavatnik Institute, Harvard Medical School, Boston, MA, USA.
  • Dari Kimanius
    MRC Laboratory of Molecular Biology, Cambridge, UK. dari@mrc-lmb.cam.ac.uk.
  • Sjors H W Scheres