Amortized template matching of molecular conformations from cryoelectron microscopy images using simulation-based inference.

Journal: Proceedings of the National Academy of Sciences of the United States of America
Published Date:

Abstract

Characterizing the conformational ensemble of biomolecular systems is key to understand their functions. Cryoelectron microscopy (cryo-EM) captures two-dimensional snapshots of biomolecular ensembles, giving in principle access to thermodynamics. However, these images are very noisy and show projections of the molecule in unknown orientations, making it very difficult to identify the biomolecule's conformation in each individual image. Here, we introduce cryo-EM simulation-based inference (cryoSBI) to infer the conformations of biomolecules and the uncertainties associated with the inference from individual cryo-EM images. CryoSBI builds on simulation-based inference, a merger of physics-based simulations and probabilistic deep learning, allowing us to use Bayesian inference even when likelihoods are too expensive to calculate. We begin with an ensemble of conformations, templates from experiments, and molecular modeling, serving as structural hypotheses. We train a neural network approximating the Bayesian posterior using simulated images from these templates and then use it to accurately infer the conformation of the biomolecule from each experimental image. Training is only done once on simulations, and after that, it takes just a few milliseconds to make inference on an image, making cryoSBI suitable for arbitrarily large datasets and direct analysis on micrographs. CryoSBI eliminates the need to estimate particle pose and imaging parameters, significantly enhancing the computational speed compared to explicit likelihood methods. Importantly, we obtain interpretable machine learning models by integrating physics-based approaches with deep neural networks, ensuring that our results are transparent and reliable. We illustrate and benchmark cryoSBI on synthetic data and showcase its promise on experimental single-particle cryo-EM data.

Authors

  • Lars Dingeldein
    Institute of Physics, Faculty of Physics, Goethe University Frankfurt, Frankfurt am Main 60438, Germany.
  • David Silva-Sánchez
    Department of Applied Mathematics, Yale University, New Haven, CT 06520.
  • Luke Evans
    Center for Computational Mathematics, Flatiron Institute, New York, NY 10010.
  • Edoardo D'Imprima
    Istituto di Ricovero e Cura a Carattere Scientifico Humanitas Research Hospital, Unit of Correlative Light Emission Microscopy Core, Rozzano, Milan 20089, Italy.
  • Nikolaus Grigorieff
    RNA Therapeutics Institute, University of Massachusetts Chan Medical School, Worcester, MA 01605.
  • Roberto Covino
    Frankfurt Institute for Advanced Studies, Frankfurt am Main 60438, Germany.
  • Pilar Cossio
    Biophysics of Tropical Diseases, Max Planck Tandem Group, University of Antioquia UdeA, Calle 70 No. 52-21, Medellin, Colombia.