A combined drug discovery strategy based on machine learning and molecular docking.

Journal: Chemical biology & drug design
Published Date:

Abstract

Data mining methods based on machine learning play an increasingly important role in drug design and discovery. In the current work, eight machine learning methods including decision trees, k-Nearest neighbor, support vector machines, random forests, extremely randomized trees, AdaBoost, gradient boosting trees, and XGBoost were evaluated comprehensively through a case study of ACC inhibitor data sets. Internal and external data sets were employed for cross-validation of the eight machine learning methods. Results showed that the extremely randomized trees model performed best and was adopted as the first step of virtual screening. Together with structure-based virtual screening in the second step, this combined strategy obtained desirable results. This work indicates that the combination of machine learning methods with traditional structure-based virtual screening can effectively strengthen the ability in finding potential hits from large compound database for a given target.

Authors

  • Yanmin Zhang
    Department of Paediatric Cardiology, Shaanxi Institute for Pediatric Diseases, Affiliate Children's Hospital of Xi'an Jiaotong University, Xi'an, China.
  • Yuchen Wang
    College of Management, University of Massachusetts Boston, Boston, MA, USA.
  • Weineng Zhou
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.
  • Yuanrong Fan
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.
  • Junnan Zhao
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.
  • Lu Zhu
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.
  • Shuai Lu
    Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, Nanjing, China.
  • Tao Lu
    Laboratory of Molecular Design and Drug Discovery, School of Science, China Pharmaceutical University, Nanjing, China.
  • Yadong Chen
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.
  • Haichun Liu
    Laboratory of Molecular Design and Drug Discovery, School of Science, China; Pharmaceutical University, 639 Longmian Avenue, Nanjing, 211198 Jiangsu, China.