Incorporating Explicit Water Molecules and Ligand Conformation Stability in Machine-Learning Scoring Functions.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Structure-based drug design is critically dependent on accuracy of molecular docking scoring functions, and there is of significant interest to advance scoring functions with machine learning approaches. In this work, by judiciously expanding the training set, exploring new features related to explicit mediating water molecules as well as ligand conformation stability, and applying extreme gradient boosting (XGBoost) with Δ-Vina parametrization, we have improved robustness and applicability of machine-learning scoring functions. The new scoring function ΔXGB can not only perform consistently among the top compared to classical scoring functions for the CASF-2016 benchmark but also achieves significantly better prediction accuracy in different types of structures that mimic real docking applications.

Authors

  • Jianing Lu
    Department of Chemistry , New York University , New York , New York 10003 , United States.
  • Xuben Hou
    Department of Chemistry , New York University , New York , New York 10003 , United States.
  • Cheng Wang
    Department of Pathology, Dalhousie University, Halifax, NS, Canada.
  • Yingkai Zhang
    Department of Chemistry , New York University , New York , New York 10003 , United States.