IVS2vec: A tool of Inverse Virtual Screening based on word2vec and deep learning techniques.

Journal: Methods (San Diego, Calif.)
Published Date:

Abstract

Inverse Virtual Screening is a powerful technique in the early stage of drug discovery process. This technique can provide important clues for biologically active molecules, which is useful in the following researches of durg discovery. In this work, combining with Word2vec, a natural language processing technique, dense fully connected neural network (DFCNN) algorithm is utilized to build up a prediction model. This model is able to perform a binary classification. Based on the query molecule, the input protein candidates can be classified into two subsets. One set is that potential targets with high possibilities to bind with the query molecule and the other one is that the proteins with low possibilities to bind with the query molecule. This model is named as IVS2vec. IVS2vec also can output a score reflecting binding possibility of the association between a protein and a molecule, which is useful to improve efficiency of research. We applied IVS2vec on several databases related to drug development and shown that our model can detect possible therapeutic targets. In addition, our model can identify targets related to adverse drug reactions which is useful to improve medication safety and repurpose drugs. Moreover, IVS2vec can give a very fast speed to perform prediction jobs. It is suitable for processing a large number of compounds in the chemical databases. We also find that IVS2vec has potential capabilities and outperform other state-of-the-art docking tools such as Autodock vina. In this study, IVS2vec brings many convincing results than Autodock vina in the reverse target searching case of Quercetin.

Authors

  • Haiping Zhang
    Oil Crops Research Institute, Chinese Academy of Agricultural Sciences, Key Laboratory of Oilseeds processing, Ministry of Agriculture, Oil Crops and Lipids Process Technology National & Local Joint Engineering Laboratory, Hubei Key Laboratory of Lipid Chemistry and Nutrition, Wuhan 430062, China.
  • Linbu Liao
    Joint Engineering Research Center for Health Big Data Intelligent Analysis Technology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen, Guangdong Province 518055, PR China; College of Software Technology, Zhejiang University, Zhejiang Province 315048, PR China.
  • Yunting Cai
    Department of Biology, Southern University of Science and Technology, Shenzhen, Guangdong Province 518055, PR China.
  • Yuhui Hu
    Department of Biology, Southern University of Science and Technology, Shenzhen, Guangdong Province 518055, PR China.
  • Hao Wang
    Department of Cardiology, Second Medical Center, Chinese PLA General Hospital, Beijing, China.