Machine learning-based evolution of water quality prediction model: An integrated robust framework for comparative application on periodic return and jitter data.
Journal:
Environmental pollution (Barking, Essex : 1987)
Published Date:
Feb 9, 2025
Abstract
Accurate water quality prediction is paramount for the sustainable management of surface water resources. Current deep learning models face challenges in reliably forecasting water quality due to the non-stationarity of environmental conditions and the intricate interactions among various environmental factors. This study introduces a novel, multi-level coupled machine learning framework that integrates data denoising, feature selection, and Long Short-Term Memory (LSTM) networks to enhance predictive accuracy. The findings demonstrate that the LSTM model incorporates data denoising pre-processing, capturing non-stationary water quality patterns more effectively than the baseline model, enhancing prediction performance (R increased by 1.01%). The most adept model with wavelet transform exhibited superior adaptability and predictability, achieving a modest but statistically significant increase in R values of 0.81% and 0.51% relative to incorporate moving average and complete ensemble empirical mode decomposition with adaptive noise techniques, respectively. The integrated models varied in their suitability for time series characterized by different patterns of variability (stability vs. instability, periodicity vs. non-periodicity). We conducted multi-step ahead predictions (t+1 and t+3 days) and employed two training configurations (80-20% and 70-30% splits) for dissolved oxygen and the permanganate index across four monitoring stations within the world's largest long-distance inter-basin water diversion project, to assess the reliability and robustness of the proposed water quality prediction models under varying conditions. The integration of data denoising techniques with LSTM networks substantially improves the prediction of dynamic water quality indices in complex environmental settings. Future research should explore the scalability of this framework across different geographical and climatic conditions to further validate its effectiveness and utility in global water resource management.