Improving Predictions with a Multitask Convolutional Siamese Network.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

The lead optimization phase of drug discovery refines an initial hit molecule for desired properties, especially potency. Synthesis and experimental testing of the small perturbations during this refinement can be quite costly and time-consuming. Relative binding free energy (RBFE, also referred to as ) methods allow the estimation of binding free energy changes after small changes to a ligand scaffold. Here, we propose and evaluate a Siamese convolutional neural network (CNN) for the prediction of RBFE between two bound ligands. We show that our multitask loss is able to improve on a previous state-of-the-art Siamese network for RBFE prediction via increased regularization of the latent space. The Siamese network architecture is well suited to the prediction of RBFE in comparison to a standard CNN trained on the same data (Pearson's R of 0.553 and 0.5, respectively). When evaluated on a left-out protein family, our Siamese CNN shows variability in its RBFE predictive performance depending on the protein family being evaluated (Pearson's R ranging from -0.44 to 0.97). RBFE prediction performance can be improved during generalization by injecting only a few examples (few-shot learning) from the evaluation data set during model training.

Authors

  • Andrew T McNutt
    Department of Medicine, Division of Nephrology and Hypertension, Indiana University School of Medicine, Indianapolis, Indiana, USA.
  • David Ryan Koes
    Department of Computational and Systems Biology, School of Medicine, University of Pittsburgh, Suite 3064, Biomedical Science Tower 3 (BST3), 3501 Fifth Avenue, Pittsburgh, PA, 15260, USA. dkoes@pitt.edu.