Is Multitask Deep Learning Practical for Pharma?

Journal: Journal of chemical information and modeling
PMID:

Abstract

Multitask deep learning has emerged as a powerful tool for computational drug discovery. However, despite a number of preliminary studies, multitask deep networks have yet to be widely deployed in the pharmaceutical and biotech industries. This lack of acceptance stems from both software difficulties and lack of understanding of the robustness of multitask deep networks. Our work aims to resolve both of these barriers to adoption. We introduce a high-quality open-source implementation of multitask deep networks as part of the DeepChem open-source platform. Our implementation enables simple python scripts to construct, fit, and evaluate sophisticated deep models. We use our implementation to analyze the performance of multitask deep networks and related deep models on four collections of pharmaceutical data (three of which have not previously been analyzed in the literature). We split these data sets into train/valid/test using time and neighbor splits to test multitask deep learning performance under challenging conditions. Our results demonstrate that multitask deep networks are surprisingly robust and can offer strong improvement over random forests. Our analysis and open-source implementation in DeepChem provide an argument that multitask deep networks are ready for widespread use in commercial drug discovery.

Authors

  • Bharath Ramsundar
    Department of Computer Science , Stanford University , Stanford , CA 94305 , USA.
  • Bowen Liu
    Department of Physics, Shanghai University of Electric Power, Shanghai 200090, China.
  • Zhenqin Wu
    Department of Chemistry , Stanford University , Stanford , CA 94305 , USA . Email: pande@stanford.edu.
  • Andreas Verras
    Merck & Co., Inc. , Kenilworth, New Jersey 07033, United States.
  • Matthew Tudor
    Chemistry Capabilities and Screening, Merck & Co., Inc. , 770 Sumneytown Pike, West Point, Pennsylvania 19846, United States.
  • Robert P Sheridan
  • Vijay Pande
    Department of Chemistry , Stanford University , Stanford , CA 94305 , USA . Email: pande@stanford.edu.