iDCNNPred: an interpretable deep learning model for virtual screening and identification of PI3Ka inhibitors against triple-negative breast cancer.

Journal: Molecular diversity
Published Date:

Abstract

Triple-negative breast cancer (TNBC) lacks estrogen, progesterone, and HER2 expression, accounting for 15-20% of breast cancer cases. It is challenging due to low therapeutic response, heterogeneity, and aggressiveness. The PI3Ka isoform is a promising therapeutic target, often hyperactivated in TNBC, contributing to uncontrolled growth and cancer cell formation. We have proposed an interpretable deep convolutional neural network prediction (iDCNNPred) system using 2D molecular images to classify bioactivity and identify potential PI3Ka inhibitors. We built Custom-DCNN models and pre-trained models such as AlexNet, SqueezeNet, and VGG19 by using the Bayesian optimization algorithm, and found that our Custom-DCNN model performed better than a pre-trained model with lower complexity and memory usage. All top-performed models were screened with the Maybridge Chemical library to find predictive hit molecules. The screened molecules were further evaluated for protein-ligand interaction with molecular docking and finally 12 promising hits were shortlisted for biological validation using in-vitro PI3K inhibition studies. After biological evaluation, 4 potent molecules with different structural moieties were identified, and these molecules present new starting scaffolds for further improvement in terms of their potency and selectivity as PI3K inhibitors with the help of medicinal chemistry efforts. Furthermore, we also showed the significance of the interpretation and visualization of the model's predictions by the Grad-CAM technique, enhancing the robustness, transparency, and interpretability of the model's predictions. The data and script files and prediction run of models used for this study to reproduce the experiment are available in the GitHub repository at https://github.com/ravishankar1307/iDCNNPred.git .

Authors

  • Ravishankar Jaiswal
    Biochemistry and Structural Biology Division, CSIR-Central Drug Research Institute, Sector 10, Jankipuram Extension, Sitapur Road, Lucknow, 226031, India.
  • Girdhar Bhati
    Biochemistry and Structural Biology Division, CSIR-Central Drug Research Institute, Sector 10, Jankipuram Extension, Sitapur Road, Lucknow 226031, India; Academy of Scientific and Innovative Research (AcSIR), Ghaziabad 201002, India.
  • Shakil Ahmed
    International Center for Chemical and Biological Sciences, H.E.J. Research Institute of Chemistry, University of Karachi, Karachi, 75270, Pakistan.
  • Mohammad Imran Siddiqi
    Molecular and Structural Biology Division, CSIR-Central Drug Research Institute, Lucknow 226031, India. mi_siddiqi@cdri.res.in and Academy of Scientific and Innovative Research (AcSIR), CSIR-Central Drug Research Institute, Campus, Lucknow 226031, India.