A novel hybrid variable cross layer-based machine learning model improves the accuracy and interpretation of energy intensity prediction of wastewater treatment plant.

Journal: Journal of environmental management
Published Date:

Abstract

Energy intensity (EI) prediction in wastewater treatment plants (WWTPs) suffers from inaccuracy and non-interpretability due to poor data quality, complex mechanisms and various confounding variables. In this study, the novel hybrid variable cross layer-based machine learning (VCL-ML) model was devised, which generates new knowledge with monitoring indicators (e.g., COD, etc.) and then embeds both domain knowledge and monitoring indicators into the ML model. This novel hybrid VCL-ML model achieves a root-mean-square error (RMSE) of 0.021 kW h/m³ with an 8.7% improvement over the conventional ML (Con-ML) model. The Shapley additive explanation demonstrated that domain knowledge features are ranked high and have important interpretable implications for the model, such as capacity utilization (CU), which measures the efficiency of resource use, and total nitrogen remaining rate (TN_rr), which indicates the nitrogen retention in a system. Partially dependent interactions between domain knowledge (e.g., sludge yield) and monitoring indexes (e.g., influent pH) could contribute to the interpretation of reality. By comparing the feature categorization between VCL-ML and Con-ML models, temporal information (e.g., month) and removal information (e.g., TN_rr) played an important role in the model's performance improvement. This result highlights the strong correlation between wastewater treatment plant energy intensity with pollutant removal and temporal information while weakening the contribution of other redundant features. This VCL-ML model improves the predicting accuracy and interpretation of the EI of WWTPs, which can be used in the optimal operation and sustainable management of WWTPs.

Authors

  • Yucheng Li
    Department of Radiation Therapy, Zhejiang Provincial People's Hospital, Affiliated People's Hospital, Hangzhou Medical College, Hangzhou, 310014, Zhejiang, People's Republic of China. liyucheng0402@163.com.
  • Chen Cai
    College of Environmental Science and Engineering, Tongji University, Shanghai, 200092, PR China; Institute of Carbon Neutrality, Tongji University, Shanghai, 200092, PR China. Electronic address: caic@tongji.edu.cn.
  • Erwu Liu
    College of Electronic Information and Engineering, Tongji University, Shanghai, 200092, PR China.
  • Xiaofeng Lin
    Department of Medical Imaging, Sun Yat-sen University Cancer Center, State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangzhou 510060, China.
  • Ying Zhang
    Department of Nephrology, Nanchong Central Hospital Affiliated to North Sichuan Medical College, Nanchong, China.
  • Hongjing Chen
    Fuzhou Water Group Co., Ltd, Fujian, 350001, PR China.
  • Zhongqing Wei
    Fuzhou Water Group Co., Ltd, Fujian, 350001, PR China.
  • Xiangfeng Huang
    College of Environmental Science and Engineering, Tongji University, Shanghai, 200092, PR China; Institute of Carbon Neutrality, Tongji University, Shanghai, 200092, PR China.
  • Ru Guo
    Division of Neurosurgery, The University of British Columbia, Vancouver, Canada. Electronic address: rucheng@student.ubc.ca.
  • Kaiming Peng
    College of Environmental Science and Engineering, Tongji University, Shanghai, 200092, PR China; Institute of Carbon Neutrality, Tongji University, Shanghai, 200092, PR China.
  • Jia Liu
    Department of Colorectal Oncology, Tianjin Medical University Cancer Institute and Hospital, National Clinical Research Center for Cancer, Tianjin's Clinical Research Center for Cancer, Tianjin Key Laboratory of Digestive Cancer, Tianjin, China.