DeepBindBC: A practical deep learning method for identifying native-like protein-ligand complexes in virtual screening.

Journal: Methods (San Diego, Calif.)
Published Date:

Abstract

Identifying native-like protein-ligand complexes (PLCs) from an abundance of docking decoys is critical for large-scale virtual drug screening in early-stage drug discovery lead searching efforts. Providing reliable prediction is still a challenge for most current affinity predicting models because of a lack of non-binding data during model training, lost critical physical-chemical features, and difficulties in learning abstract information with limited neural layers. In this work, we proposed a deep learning model, DeepBindBC, for classifying putative ligands as binding or non-binding. Our model incorporates information on non-binding interactions, making it more suitable for real applications. ResNet model architecture and more detailed atom type representation guarantee implicit features can be learned more accurately. Here, we show that DeepBindBC outperforms Autodock Vina, Pafnucy, and DLSCORE for three DUD.E testing sets. Moreover, DeepBindBC identified a novel human pancreatic α-amylase binder validated by a fluorescence spectral experiment (K = 1.0 × 10 M). Furthermore, DeepBindBC can be used as a core component of a hybrid virtual screening pipeline that incorporating many other complementary methods, such as DFCNN, Autodock Vina docking, and pocket molecular dynamics simulation. Additionally, an online web server based on the model is available at http://cbblab.siat.ac.cn/DeepBindBC/index.php for the user's convenience. Our model and the web server provide alternative tools in the early steps of drug discovery by providing accurate identification of native-like PLCs.

Authors

  • Haiping Zhang
    Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Key Laboratory of Oilseeds processing, Ministry of Agriculture, Oil Crops and Lipids Process Technology National & Local Joint Engineering Laboratory, Hubei Key Laboratory of Lipid Chemistry and Nutrition, Wuhan 430062, China.
  • Tingting Zhang
    Department of Environmental Science and Engineering, College of Chemical Engineering, Beijing University of Chemical Technology, Beijing 100029, China. Electronic address: zhangtt@mail.buct.edu.cn.
  • Konda Mani Saravanan
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Center for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.
  • Linbu Liao
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong Province 518055, PR China; College of Software Technology, Zhejiang University, Zhejiang Province 315048, PR China.
  • Hao Wu
    Zhejiang Institute of Tianjin University (Shaoxing), Shaoxing, China.
  • Haishan Zhang
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Center for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.
  • Huiling Zhang
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Center for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.
  • Yi Pan
    Department of Neurosis and Psychosomatic Diseases, Huzhou Third Municipal Hospital, The Affiliated Hospital of Huzhou University, Huzhou, Zhejiang, China.
  • Xuli Wu
    School of Medicine, Shenzhen University, Shenzhen, Guangdong Province 518060, PR China. Electronic address: wxl@szu.edu.cn.
  • Yanjie Wei
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Center for High Performance Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong, China.