Deep Learning Algorithm Based on Molecular Fingerprint for Prediction of Drug-Induced Liver Injury.

Journal: Toxicology
Published Date:

Abstract

Drug-induced liver injury (DILI) is one the rare adverse drug reaction (ADR) and multifactorial endpoints. Current preclinical animal models struggle to anticipate it, and in silico methods have emerged as a way with significant potential for doing so. In this study, a high-quality dataset of 1573 compounds was assembled. The 48 classification models, which depended on six different molecular fingerprints, were built via deep neural network (DNN) and seven machine learning algorithms. Comparing the results of the DNN and machine learning models, the optional performing model was found as the one developed based on the DNN with ECFP_6 as input, which achieved the area under the receiver operating characteristic curve (AUC) of 0.713, balanced accuracy (BA) of 0.680, and F1 of 0.753. In addition, we used the SHapley Additive exPlanations (SHAP) algorithm to interpret the models, identified the crucial structural fragments related to DILI risk, and selected the top ten substructures with the highest contribution rankings to serve as warning indicators for subsequent drug hepatotoxicity screening studies. The study demonstrates that the DNN models developed based on molecular fingerprints can be a trustworthy and efficient tool for determining the risk of DILI during the pre-development of novel medications.

Authors

  • Qiong Yang
    Institute of Modern Physics, Chinese Academy of Science, Lanzhou 730000, China.
  • Shuwei Zhang
    State Key Laboratory of Fine Chemicals, Dalian University of Technology, Dalian, Liaoning 116024, China. Electronic address: zswei@dlut.edu.cn.
  • Yan Li
    Interdisciplinary Research Center for Biology and Chemistry, Liaoning Normal University, Dalian, China.