Machine Learning on DNA-Encoded Libraries: A New Paradigm for Hit Finding.

Journal: Journal of medicinal chemistry
PMID:

Abstract

DNA-encoded small molecule libraries (DELs) have enabled discovery of novel inhibitors for many distinct protein targets of therapeutic value. We demonstrate a new approach applying machine learning to DEL selection data by identifying active molecules from large libraries of commercial and easily synthesizable compounds. We train models using only DEL selection data and apply automated or automatable filters to the predictions. We perform a large prospective study (∼2000 compounds) across three diverse protein targets: sEH (a hydrolase), ERα (a nuclear receptor), and c-KIT (a kinase). The approach is effective, with an overall hit rate of ∼30% at 30 μM and discovery of potent compounds (IC < 10 nM) for every target. The system makes useful predictions even for molecules dissimilar to the original DEL, and the compounds identified are diverse, predominantly drug-like, and different from known ligands. This work demonstrates a powerful new approach to hit-finding.

Authors

  • Kevin McCloskey
    Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA, 94043, USA.
  • Eric A Sigel
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Steven Kearnes
    Stanford University, 318 Campus Dr. S296, Stanford, CA, 94305, USA. kearnes@stanford.edu.
  • Ling Xue
  • Xia Tian
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Dennis Moccia
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Diana Gikunju
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Sana Bazzaz
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Betty Chan
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Matthew A Clark
    X-Chem, Waltham, Massachusetts 02453, United States.
  • John W Cuozzo
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Marie-Aude Guié
    X-Chem, Waltham, Massachusetts 02453, United States.
  • John P Guilinger
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Christelle Huguet
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Christopher D Hupp
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Anthony D Keefe
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Christopher J Mulhern
    X-Chem, Waltham, Massachusetts 02453, United States.
  • Ying Zhang
    Department of Nephrology, Nanchong Central Hospital Affiliated to North Sichuan Medical College, Nanchong, China.
  • Patrick Riley
    Google Inc., 1600 Amphitheatre Pkwy, Mountain View, CA, 94043, USA.