Artificial intelligence-enabled virtual screening of ultra-large chemical libraries with deep docking.

Journal: Nature protocols
Published Date:

Abstract

With the recent explosion of chemical libraries beyond a billion molecules, more efficient virtual screening approaches are needed. The Deep Docking (DD) platform enables up to 100-fold acceleration of structure-based virtual screening by docking only a subset of a chemical library, iteratively synchronized with a ligand-based prediction of the remaining docking scores. This method results in hundreds- to thousands-fold virtual hit enrichment (without significant loss of potential drug candidates) and hence enables the screening of billion molecule-sized chemical libraries without using extraordinary computational resources. Herein, we present and discuss the generalized DD protocol that has been proven successful in various computer-aided drug discovery (CADD) campaigns and can be applied in conjunction with any conventional docking program. The protocol encompasses eight consecutive stages: molecular library preparation, receptor preparation, random sampling of a library, ligand preparation, molecular docking, model training, model inference and the residual docking. The standard DD workflow enables iterative application of stages 3-7 with continuous augmentation of the training set, and the number of such iterations can be adjusted by the user. A predefined recall value allows for control of the percentage of top-scoring molecules that are retained by DD and can be adjusted to control the library size reduction. The procedure takes 1-2 weeks (depending on the available resources) and can be completely automated on computing clusters managed by job schedulers. This open-source protocol, at https://github.com/jamesgleave/DD_protocol , can be readily deployed by CADD researchers and can significantly accelerate the effective exploration of ultra-large portions of a chemical space.

Authors

  • Francesco Gentile
    Vancouver Prostate Centre, Department of Urologic Sciences, The University of British Columbia, Vancouver, BC V6H 3Z6, Canada.
  • Jean Charle Yaacoub
    Vancouver Prostate Centre, Department of Urologic Sciences, The University of British Columbia, Vancouver, BC V6H 3Z6, Canada.
  • James Gleave
    Vancouver Prostate Centre, Department of Urologic Sciences, The University of British Columbia, Vancouver, BC V6H 3Z6, Canada.
  • Michael Fernandez
    Data61, CSIRO , 343 Royal Parade, Parkville, Victoria 3052, Australia.
  • Anh-Tien Ton
    Vancouver Prostate Centre, Department of Urologic Sciences, The University of British Columbia, Vancouver, BC, Canada.
  • Fuqiang Ban
    Vancouver Prostate Centre, Department of Urologic Sciences , Faculty of Medicine, University of British Columbia , 2660 Oak Street , Vancouver , British Columbia V6H 3Z6 , Canada.
  • Abraham Stern
    NVIDIA Corporation, Santa Clara, CA 95051, USA.
  • Artem Cherkasov
    Vancouver Prostate Centre, Department of Urologic Sciences , Faculty of Medicine, University of British Columbia , 2660 Oak Street , Vancouver , British Columbia V6H 3Z6 , Canada.