Graph neural networks-enhanced relation prediction for ecotoxicology (GRAPE).

Journal: Journal of hazardous materials
PMID:

Abstract

Exposure to toxic chemicals threatens species and ecosystems. This study introduces a novel approach using Graph Neural Networks (GNNs) to integrate aquatic toxicity data, providing an alternative to complement traditional in vivo ecotoxicity testing. This study pioneers the application of GNN in ecotoxicology by formulating the problem as a relation prediction task. GRAPE's key innovation lies in simultaneously modelling 444 aquatic species and 2826 chemicals within a graph, leveraging relations from existing datasets where informative species and chemical features are augmented to make informed predictions. Extensive evaluations demonstrate the superiority of GRAPE over Logistic Regression (LR) and Multi-Layer Perceptron (MLP) models, achieving remarkable improvements of up to a 30% increase in recall values. GRAPE consistently outperforms LR and MLP in predicting novel chemicals and new species. In particular, GRAPE showcases substantial enhancements in recall values, with improvements of ≥ 100% for novel chemicals and up to 13% for new species. Specifically, GRAPE correctly predicts the effects of novel chemicals (104 out of 126) and effects on new species (7 out of 8). Moreover, the study highlights the effectiveness of the proposed chemical features and induced network topology through GNN for accurately predicting metallic (74 out of 86) and organic (612 out of 674) chemicals, showcasing the broad applicability and robustness of the GRAPE model in ecotoxicological investigations. The code/data are provided at https://github.com/csiro-robotics/GRAPE.

Authors

  • Gaurangi Anand
    Environment, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Dutton Park 4102, QLD, Australia.
  • Piotr Koniusz
    Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Black Mountain 2601, ACT, Australia. Electronic address: piotr.koniusz@csiro.au.
  • Anupama Kumar
    Department of Medicine, Division of Hematology and Oncology, University of California San Francisco, San Francisco, CA.
  • Lisa A Golding
    Environment, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Dutton Park 4102, QLD, Australia.
  • Matthew J Morgan
    Environment, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Black Mountain 2601, ACT, Australia.
  • Peyman Moghadam
    Data61, Commonwealth Scientific and Industrial Research Organisation (CSIRO), Pullenvale 4069, QLD, Australia.