Prediction of urban surface water quality scenarios using hybrid stacking ensembles machine learning model in Howrah Municipal Corporation, West Bengal.

Journal: Journal of environmental management
Published Date:

Abstract

In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg). Then six independent factors were utilized (i.e. Precipitation (Pr), Maximum Temperature (Tmax), Minimum Temperature (Tmin), Normalized Difference Turbidity Index (NDTI), Normalized Difference Chlorophyll Index (NDCI), and Total Dissolved Solids (TDS)) for predicting the WQI mapping through the different ML models. This study demonstrated that the SE-Cubist model outperforms other ML models. During the testing phase, it achieved the best modeling results with an R = 0.975, RMSE = 0.351, and MAE = 0.197. The ReliefF and SHAP analyses identified Pr and Tmax as the most significant factors influencing WQI within the study area.

Authors

  • Chiranjit Singha
    Department of Agricultural Engineering, Institute of Agriculture, Visva-Bharati (A Central University), Sriniketan, Birbhum, West Bengal, 731236, India.
  • Ishita Bhattacharjee
    Maulana Abul Kalam Azad University of Technology, Kolkata, India. Electronic address: ishitabhattacharjee09@gmail.com.
  • Satiprasad Sahoo
    Prajukti Research Private Limited, Baruipur, West Bengal, India; International Center for Agricultural Research in the Dry Areas (ICARDA), Maadi, Cairo, Egypt. Electronic address: satispss@gmail.com.
  • Kamal Abdelrahman
    Department of Geology & Geophysics, College of Science, King Saud University, Riyadh, Saudi Arabia. Electronic address: khassanein@ksu.edu.sa.
  • Md Galal Uddin
    School of Engineering, University of Galway, Ireland; Ryan Institute, University of Galway, Ireland; MaREI Research Centre, University of Galway, Ireland; Eco-HydroInformatics Research Group (EHIRG), Civil Engineering, University of Galway, Ireland. Electronic address: mdgalal.uddin@universityofgalway.ie.
  • Mohammed S Fnais
    Department of Geology & Geophysics, College of Science, King Saud University, Riyadh, Saudi Arabia. Electronic address: mfnais@ksu.edu.sa.
  • Ajit Govind
    International Center for Agricultural Research in the Dry Areas (ICARDA), Maadi, Cairo, Egypt. Electronic address: a.govind@cgiar.org.
  • Mohamed Abioui
    Geosciences, Environment and Geomatics Laboratory (GEG), Department of Earth Sciences, Faculty of Sciences, Ibnou Zohr University, Agadir, Morocco; MARE-Marine and Environmental Sciences Centre - Sedimentary Geology Group, Department of Earth Sciences, Faculty of Sciences and Technology, University of Coimbra, Coimbra, Portugal; Laboratory for Sustainable Innovation and Applied Research, Universiapolis-International University of Agadir, Agadir, Morocco. Electronic address: m.abioui@uiz.ac.ma.