Detecting Technical Anomalies in High-Frequency Water-Quality Data Using Artificial Neural Networks.

Journal: Environmental science & technology
Published Date:

Abstract

Anomaly detection (AD) in high-volume environmental data requires one to tackle a series of challenges associated with the typical low frequency of anomalous events, the broad-range of possible anomaly types, and local nonstationary environmental conditions, suggesting the need for flexible statistical methods that are able to cope with unbalanced high-volume data problems. Here, we aimed to detect anomalies caused by technical errors in water-quality (turbidity and conductivity) data collected by automated in situ sensors deployed in contrasting riverine and estuarine environments. We first applied a range of artificial neural networks that differed in both learning method and hyperparameter values, then calibrated models using a Bayesian multiobjective optimization procedure, and selected and evaluated the "best" model for each water-quality variable, environment, and anomaly type. We found that semi-supervised classification was better able to detect sudden spikes, sudden shifts, and small sudden spikes, whereas supervised classification had higher accuracy for predicting long-term anomalies associated with drifts and periods of otherwise unexplained high variability.

Authors

  • Javier Rodriguez-Perez
    Univ. Pau & Pays de l'Adour E2S UPPALaboratoire des Mathématiques et de leurs applications, CNRS, 64600 Anglet, France.
  • Catherine Leigh
    Biosciences and Food Technology Discipline, School of Science, RMIT University, 3000 Bundoora, Australia.
  • Benoit Liquet
    Univ. Pau & Pays de l'Adour E2S UPPALaboratoire des Mathématiques et de leurs applications, CNRS, 64600 Anglet, France.
  • Claire Kermorvant
    Univ. Pau & Pays de l'Adour E2S UPPALaboratoire des Mathématiques et de leurs applications, CNRS, 64600 Anglet, France.
  • Erin Peterson
    Australian Research Council Centre of Excellence for Mathematical and Statistical Frontiers (ACEMS), 4000 Brisbane, Australia.
  • Damien Sous
    Université de Toulon, Aix Marseille Université, CNRS, IRD, Mediterranean Institute of Oceanography (MIO), 83062 La Garde, France.
  • Kerrie Mengersen
    ARC Centre of Excellence for Mathematical & Statistical Frontiers (ACEMS), Queensland University of Technology (QUT), 2 George St, Brisbane QLD 4000, Australia. k.mengersen@qut.edu.au.