Machine learning on drug-specific data to predict small molecule teratogenicity.

Journal: Reproductive toxicology (Elmsford, N.Y.)
PMID:

Abstract

Pregnant women are an especially vulnerable population, given the sensitivity of a developing fetus to chemical exposures. However, prescribing behavior for the gravid patient is guided on limited human data and conflicting cases of adverse outcomes due to the exclusion of pregnant populations from randomized, controlled trials. These factors increase risk for adverse drug outcomes and reduce quality of care for pregnant populations. Herein, we propose the application of artificial intelligence to systematically predict the teratogenicity of a prescriptible small molecule from information inherent to the drug. Using unsupervised and supervised machine learning, our model probes all small molecules with known structure and teratogenicity data published in research-amenable formats to identify patterns among structural, meta-structural, and in vitro bioactivity data for each drug and its teratogenicity score. With this workflow, we discovered three chemical functionalities that predispose a drug towards increased teratogenicity and two moieties with potentially protective effects. Our models predict three clinically-relevant classes of teratogenicity with AUC = 0.8 and nearly double the predictive accuracy of a blind control for the same task, suggesting successful modeling. We also present extensive barriers to translational research that restrict data-driven studies in pregnancy and therapeutically "orphan" pregnant populations. Collectively, this work represents a first-in-kind platform for the application of computing to study and predict teratogenicity.

Authors

  • Anup P Challa
    Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville 37203, TN, United States; Department of Biomedical Informatics, Harvard Medical School, Boston 02115, MA, United States; National Center for Advancing Translational Sciences, National Institutes of Health, Rockville 20850, MD, United States; Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville 37212, TN, United States. Electronic address: anup.p.challa.1@vumc.org.
  • Andrew L Beam
    Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA.
  • Min Shen
    National Center for Advancing Translational Sciences, National Institutes of Health, Rockville 20850, MD, United States.
  • Tyler Peryea
    National Center for Advancing Translational Sciences (NCATS) , National Institutes of Health , 9800 Medical Center Drive , Rockville , Maryland 20850 , United States.
  • Robert R Lavieri
    Vanderbilt Institute for Clinical and Translational Research, Vanderbilt University Medical Center, Nashville 37203, TN, United States.
  • Ethan S Lippmann
    Department of Chemical and Biomolecular Engineering, Vanderbilt University, Nashville 37212, TN, United States.
  • David M Aronoff
    Department of Medicine, Division of Infectious Disease, Vanderbilt University Medical Center, Nashville, Tennessee.