A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven p Predictions in Proteins.

Journal: Journal of chemical theory and computation

PMID: 35837736

Abstract

Existing computational methods for estimating p values in proteins rely on theoretical approximations and lengthy computations. In this work, we use a data set of 6 million theoretically determined p shifts to train deep learning models, which are shown to rival the physics-based predictors. These neural networks managed to infer the electrostatic contributions of different chemical groups and learned the importance of solvent exposure and close interactions, including hydrogen bonds. Although trained only using theoretical data, our pKAI+ model displayed the best accuracy in a test set of ∼750 experimental values. Inference times allow speedups of more than 1000× compared to physics-based methods. By combining speed, accuracy, and a reasonable understanding of the underlying physics, our models provide a game-changing solution for fast estimations of macroscopic p values from ensembles of microscopic values as well as for many downstream applications such as molecular docking and constant-pH molecular dynamics simulations.

Authors

Pedro B P S Reis

Machine Learning Research, Bayer A.G., Berlin 13353, Germany.
Marco Bertolini

Medical Physics, Azienda USL-IRCCS di Reggio Emilia, Italy. Electronic address: marco.bertolini@ausl.re.it.
Floriane Montanari

Department of Bioinformatics , Bayer AG , Berlin , Germany . Email: robin.winter@bayer.com.
Walter Rocchia

CONCEPT Lab, Istituto Italiano di Tecnologia, via Morego 30, 16163 Genova, Italy.
Miguel Machuqueiro

BioISI - Instituto de Biosistemas e Ciências Integrativas, Faculdade de Ciências, Universidade de Lisboa, Portugal. Electronic address: machuque@ciencias.ulisboa.pt.
Djork-Arné Clevert

Department of Bioinformatics , Bayer AG , Berlin , Germany . Email: robin.winter@bayer.com.

Keywords

Deep Learning Molecular Docking Simulation Molecular Dynamics Simulation Proteins Static Electricity

External Resources

View on PubMed Access via DOI PubMed (35837736)

A Fast and Interpretable Deep Learning Approach for Accurate Electrostatics-Driven p Predictions in Proteins.

Abstract

Authors

Keywords

External Resources

Popular Topics

Recent Journals