Prediction of acute toxicity of organic contaminants to fish: Model development and a novel approach to identify reactive substructures.

Journal: Journal of hazardous materials
Published Date:

Abstract

In this study, count-based Morgan fingerprints (CMF) were employed to represent the fundamental chemical structures of contaminants, and a neural network model (R² = 0.76) was developed to predict acute fish toxicity (AFT) of organic compounds. Models based on CMF consistently outperformed those based on binary Morgan fingerprints (BMF), likely due to the latter's inefficiency in describing homologous structures. The similarity of CMF was calculated using an improved method based on Tanimoto distance, which was used for calculation of dataset partition and application domain. The similarity-based dataset partitioning method ensured structural diversity within the training set and improved performance on the validation set, demonstrating its potential for toxicological structure analysis and priority screening. Toxic substructures identified by Shapley additive explanation (SHAP) method were substituted benzenes, long carbon chains, unsaturated carbons and halogen atoms. By incorporating K and monitoring shifts in feature importance, the influence of substructures on AFT was further delineated, revealing their roles in facilitating exposure (e.g.: long carbon chains) and reactive toxicity (e.g.: methyl). Additionally, we compared the toxicity of similar substructures and the same substructure in different chemical environments as well. To address SHAP's insensitivity to low-variance features, this study introduced a novel metric termed the toxicity index (TI), designed to pinpoint substructures that are present in minimal quantities yet potentially exhibit high toxicity. With TI, we identified several important substructures, such as parathion and polycyclic substituents. Finally, prevalent toxic substructures and potential highly toxic substances were identified in two external datasets.

Authors

  • Shangyu Li
    School of Environmental Science and Engineering, Tianjin University, Tianjin 300072, China.
  • Mingming Zhang
    Department of Gastroenterology, Qilu Hospital of Shandong University, Jinan, Shandong, China.
  • Peizhe Sun
    School of Environmental Science and Engineering, Tianjin University, Tianjin 300072, China. Electronic address: sunpeizhe@tju.edu.cn.