Machine Learning-Driven Dynamic Measurement of Environmental Indicators in Multiple Scenes and Multiple Disturbances.
Journal:
Environmental science & technology
Published Date:
Jul 24, 2025
Abstract
Digital city water management systems require extensive data sensing for various environmental indicators, yet measurement accuracy often falls short under diverse extreme conditions. This study proposes a chemical oxygen demand (COD) measurement method based on ultraviolet-visible spectrum analysis and machine learning (ML), taking into account the removal of interferences, including temperature, pH, turbidity, common anions and cations, as well as COD composition and different water environments. The data collected from the river and wastewater were processed through principal component analysis, and random forest (RF) performed the best among the multiclass models with a mean absolute percentage error (MAPE) of only 6.73% for total COD (TCOD), dissolved COD (SCOD), and particulate COD (PCOD). RF has excellent transferability with an average MAPE of 8.17% for TCOD, PCOD, and COD in another real wastewater and river. Interpretability analysis elucidates the mechanism of PCA downscaling on the model. Techno-economic assessment revealed that this method incurs only 60.9% of the costs of laboratory monitoring and 49.3% of the costs of conventional automatic monitoring stations. Life cycle assessment showed that the introduction of ML can reduce environmental impacts by 31.32%. The study concludes with a discussion of the dynamic feasibility of this approach in future urban water systems.