Knowledge-Based Methods To Train and Optimize Virtual Screening Ensembles.

Journal: Journal of chemical information and modeling
Published Date:

Abstract

Ensemble docking can be a successful virtual screening technique that addresses the innate conformational heterogeneity of macromolecular drug targets. Yet, lacking a method to identify a subset of conformational states that effectively segregates active and inactive small molecules, ensemble docking may result in the recommendation of a large number of false positives. Here, three knowledge-based methods that construct structural ensembles for virtual screening are presented. Each method selects ensembles by optimizing an objective function calculated using the receiver operating characteristic (ROC) curve: either the area under the ROC curve (AUC) or a ROC enrichment factor (EF). As the number of receptor conformations, N, becomes large, the methods differ in their asymptotic scaling. Given a set of small molecules with known activities and a collection of target conformations, the most resource intense method is guaranteed to find the optimal ensemble but scales as O(2(N)). A recursive approximation to the optimal solution scales as O(N(2)), and a more severe approximation leads to a faster method that scales linearly, O(N). The techniques are generally applicable to any system, and we demonstrate their effectiveness on the androgen nuclear hormone receptor (AR), cyclin-dependent kinase 2 (CDK2), and the peroxisome proliferator-activated receptor δ (PPAR-δ) drug targets. Conformations that consisted of a crystal structure and molecular dynamics simulation cluster centroids were used to form AR and CDK2 ensembles. Multiple available crystal structures were used to form PPAR-δ ensembles. For each target, we show that the three methods perform similarly to one another on both the training and test sets.

Authors

  • Robert V Swift
    Department of Chemistry and Biochemistry, University of California, San Diego , La Jolla, California 92093-0340, United States.
  • Siti A Jusoh
    Faculty of Pharmacy, Universiti Teknologi MARA , 42300 Bandar Puncak Alam, Malaysia.
  • Tavina L Offutt
    Department of Chemistry & Biochemistry and the National Biomedical Computation Resource, University of California, San Diego , La Jolla, California 92093, United States.
  • Eric S Li
    Department of Chemistry and Biochemistry, University of California, San Diego , La Jolla, California 92093-0340, United States.
  • Rommie E Amaro
    Department of Chemistry & Biochemistry and the National Biomedical Computation Resource, University of California, San Diego , La Jolla, California 92093, United States.