ToxMPNN: A deep learning model for small molecule toxicity prediction.

Journal: Journal of applied toxicology : JAT
Published Date:

Abstract

Machine learning (ML) has shown a great promise in predicting toxicity of small molecules. However, the availability of data for such predictions is often limited. Because of the unsatisfactory performance of models trained on a single toxicity endpoint, we collected toxic small molecules with multiple toxicity endpoints from previous study. The dataset comprises 27 toxic endpoints categorized into seven toxicity classes, namely, carcinogenicity and mutagenicity, acute oral toxicity, respiratory toxicity, irritation and corrosion, cardiotoxicity, CYP450, and endocrine disruption. In addition, a binary classification Common-Toxicity task was added based on the aforementioned dataset. To improve the performance of the models, we added marketed drugs as negative samples. This study presents a toxicity predictive model, ToxMPNN, based on the message passing neural network (MPNN) architecture, aiming to predict the toxicity of small molecules. The results demonstrate that ToxMPNN outperforms other models in capturing toxic features within the molecular structure, resulting in more precise predictions with the ROC_AUC testing score of 0.886 for the Toxicity_drug dataset. Furthermore, it was observed that adding marketed drugs as negative samples not only improves the predictive performance of the binary classification Common-Toxicity task but also enhances the stability of the model prediction. It shows that the graph-based deep learning (DL) algorithms in this study can be used as a trustworthy and effective tool to assess small molecule toxicity in the development of new drugs.

Authors

  • Yini Zhou
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China.
  • Chao Ning
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China.
  • Yijun Tan
    School of Computer and Communication Engineering, Changsha University of Science and Technology, Changsha, China.
  • Yaqi Li
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China.
  • Jiaxu Wang
    School of Aeronautics and Astronautics, Sichuan University, Chengdu 610065, China.
  • Yuanyuan Shu
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China.
  • Songping Liang
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha, China.
  • Zhonghua Liu
    The National and Local Joint Engineering Laboratory of Animal Peptide Drug Development, College of Life Sciences, Hunan Normal University, Changsha 410081, People's Republic of China. Electronic address: Liuzh@hunnu.edu.cn.
  • Ying Wang
    Key Laboratory of Macromolecular Science of Shaanxi Province, School of Chemistry & Chemical Engineering, Shaanxi Normal University, Xi'an, Shaanxi 710062, China.