Spatial prediction of groundwater salinity in multiple aquifers of the Mekong Delta region using explainable machine learning models.

Journal: Water research
PMID:

Abstract

Groundwater salinization is a prevalent issue in coastal regions, yet accurately predicting and understanding its causal factors remains challenging due to the complexity of the groundwater system. Therefore, this study predicted groundwater salinity in multi-layered aquifers spanning the entire Mekong Delta (MD) region using machine learning (ML) models based on an in situ dataset and using three indicators (Cl, pH, and HCO). We applied nine different decision tree-based models and evaluated their prediction performances. The models were trained using 13 input variables: weather (2), hydrogeological conditions (4), water levels (3), groundwater usage (2), and relative distance from water sources (2). Subsequently, by employing model interpretation techniques, we quantified the significance of factors within the model prediction. Performance evaluations of the ML models demonstrated that the Extra Trees model exhibited superior performance and demonstrated generalization capabilities in predicting Cl concentration, whereas the Bagging and Random Forest models outperformed the other models in predicting pH and HCO concentration. The coefficients of determination were determined to be 0.94, 0.67, and 0.78 for Cl, pH, and HCO, respectively Additionally, the model interpretation effectively identified significant factors that depended on the target variables and aquifers. In particular, salinity indicators and aquifers that were strongly influenced by the artificial usage of groundwater were identified. Therefore, our research, which provides accurate spatial predictions and interpretations of groundwater salinity in the MD, has the potential to establish a foundation for formulating effective groundwater management policies to control groundwater salinization.

Authors

  • Heewon Jeong
    Department of Civil Urban Earth and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), UNIST-gil 50, Ulsan, 44919, Republic of Korea.
  • Ather Abbas
    Department of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), 50, UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan, 44919 South Korea.
  • Hyo Gyeom Kim
    Future and Fusion Lab of Architectural, Civil and Environmental Engineering, Korea University, Seoul 02841, South Korea.
  • Hoang Van Hoan
    National Center for Water Resources Planning and Investigation, Sai Dong Ward, Long Bien District, 1000 Hanoi, Vietnam.
  • Pham Van Tuan
    Division for Water Resources Planning and Investigation for the South of Vietnam, An Khanh Ward, Thu Duc City, Hochiminh 71300, Vietnam.
  • Phan Thang Long
    Division for Water Resources Planning and Investigation for the South of Vietnam, An Khanh Ward, Thu Duc City, Hochiminh 71300, Vietnam.
  • Eunhee Lee
    Korea Institute of Geoscience and Mineral Resources, 124 Gwahak-ro, Yuseong-gu, Daejeon 34132, South Korea. Electronic address: eunheelee@kigam.re.kr.
  • Kyung Hwa Cho
    School of Urban and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), 50 UNIST-gil, Eonyang-eup, Ulju-gun, Ulsan 689-798, Republic of Korea.