MEDICASCY: A Machine Learning Approach for Predicting Small-Molecule Drug Side Effects, Indications, Efficacy, and Modes of Action.

Journal: Molecular pharmaceutics
Published Date:

Abstract

To improve the drug discovery yield, a method which is implemented at the beginning of drug discovery that accurately predicts drug side effects, indications, efficacy, and mode of action based solely on the input of the drug's chemical structure is needed. In contrast, extant predictive methods do not comprehensively address these aspects of drug discovery and rely on features derived from extensive, often unavailable experimental information for novel molecules. To address these issues, we developed MEDICASCY, a multilabel-based boosted random forest machine learning method that only requires the small molecule's chemical structure for the drug side effect, indication, efficacy, and probable mode of action target predictions; however, it has comparable or even significantly better performance than existing approaches requiring far more information. In retrospective benchmarking on high confidence predictions, MEDICASCY shows about 78% precision and recall for predicting at least one severe side effect and 72% precision drug efficacy. Experimental validation of MEDICASCY's efficacy predictions on novel molecules shows close to 80% precision for the inhibition of growth in ovarian, breast, and prostate cancer cell lines. Thus, MEDICASCY should improve the success rate for new drug approval. A web service for academic users is available at http://pwp.gatech.edu/cssb/MEDICASCY.

Authors

  • Hongyi Zhou
    Department of Urology, The Affiliated Wuxi People's Hospital of Nanjing Medical University, Wuxi People's Hospital, Wuxi Medical Center, Nanjing Medical University, Wuxi, China.
  • Hongnan Cao
    Center for the Study of Systems Biology, School of Biological Sciences, Georgia Institute of Technology, 950 Atlantic Drive, N.W., Atlanta, Georgia 30332, United States.
  • Lilya Matyunina
    School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332-0230, United States.
  • Madelyn Shelby
    School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332-0230, United States.
  • Lauren Cassels
    School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia 30332-0230, United States.
  • John F McDonald
    School of Biological Sciences, Georgia Institute of Technology, Atlanta, Georgia, United States of America.
  • Jeffrey Skolnick
    School of Biology, Georgia Institute of Technology, Atlanta, GA 30332, USA.