MILCDock: Machine Learning Enhanced Consensus Docking for Virtual Screening in Drug Discovery.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Molecular docking tools are regularly used to computationally identify new molecules in virtual screening for drug discovery. However, docking tools suffer from inaccurate scoring functions with widely varying performance on different proteins. To enable more accurate ranking of active over inactive ligands in virtual screening, we created a machine learning consensus docking tool, MILCDock, that uses predictions from five traditional molecular docking tools to predict the probability a ligand binds to a protein. MILCDock was trained and tested on data from both the DUD-E and LIT-PCBA docking datasets and shows improved performance over traditional molecular docking tools and other consensus docking methods on the DUD-E dataset. LIT-PCBA targets proved to be difficult for all methods tested. We also find that DUD-E data, although biased, can be effective in training machine learning tools if care is taken to avoid DUD-E's biases during training.

Authors

  • Connor J Morris
    Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA.
  • Jacob A Stern
    Department of Physics and Astronomy, Brigham Young University, Provo, Utah84602, United States.
  • Brenden Stark
    Department of Physics and Astronomy, Brigham Young University, Provo, Utah84602, United States.
  • Max Christopherson
    Department of Physics and Astronomy, Brigham Young University, Provo, Utah84602, United States.
  • Dennis Della Corte
    Department of Physics and Astronomy, Brigham Young University, Provo, UT, USA. Dennis.DellaCorte@byu.edu.