Machine learning methods for anomaly classification in wastewater treatment plants.

Journal: Journal of environmental management
Published Date:

Abstract

Modern wastewater treatment plants base their biological processes on advanced control systems which ensure compliance with discharge limits and minimize energy consumption responding to information from on-line probes. The correct readings of probes are particularly crucial for intermittent aeration controllers, which rely on real-time measurements of ammonia and oxygen in biological tanks. These data are also an important resource for developing artificial intelligence algorithms that can identify process or sensor anomalies, thus guiding the choices of plant operators and automatic process controllers. However, using anomaly detection and classification algorithms in real-time wastewater treatment is challenging because of the noisy nature of sensor measurements, the difficulty of obtaining labeled real-plant data, and the complex and interdependent mechanisms that govern biological processes. This work aims at thoroughly exploring the performance of machine learning methods in detecting and classifying the main anomalies in plants operating with intermittent aeration. Using oxygen, ammonia and aeration power measurements from a set of plants in Italy, we perform both binary and multiclass classification, and we compare them through a rigorous validation procedure that includes a test on an unknown dataset, proposing a new evaluation protocol. The classification methods explored are support vector machine, multilayer perceptron, random forest, and two gradient boosting methods (LightGBM and XGBoost). The best performance was achieved using the gradient boosting ensemble algorithms, with up to 96% of anomalies detected and up to 84% and 62% of anomalies classified correctly on the first and second datasets respectively.

Authors

  • Francesca Bellamoli
    University of Trento, Department of Information Engineering and Computer Science, via Sommarive 9, Trento, 38123, Italy; ETC Sustainable Solutions Srl, via dei Palustei 16, Trento, 38121, Italy. Electronic address: francesca.bellamoli@unitn.it.
  • Mattia Di Iorio
    D-3 Srl, via dei Palustei 16, Trento, 38121, Italy.
  • Marco Vian
    ETC Sustainable Solutions Srl, via dei Palustei 16, Trento, 38121, Italy.
  • Farid Melgani
    Department of Information Engineering and Computer Science, University of Trento, Via Sommarive 9, 38123 Trento, Italy.