Interpretable machine learning reveals the importance of geography and landscape arrangement for surface water quality across China.

Journal: Water research
Published Date:

Abstract

Elucidating the influence of land use patterns on surface water quality is crucial for effective watershed management. Despite numerous studies in individual watersheds, factors influencing water quality in diverse geographical environments are less understood due to data and methodological constraints in large-scale studies. This study employs Interpretable Machine Learning (IML) to explore the drivers of water quality variations across 234 watersheds in China. Results reveal that urban land is the primary source of nitrogen pollution, while rural residences contribute substantially to phosphorus pollution. Water bodies are key sinks for both nutrients. Climate and land use compositions show substantial variations across watersheds with distinct geographical locations. These geography-related factors together contributed 82 %-89 % relative importance to water quality variations across China, implicating the dominant role of geography in shaping water quality. Additionally, the spatial arrangements of source-sink landscapes exhibit greater variations within the same geographic zone, whose impact on water quality is also inevitable. This highlights the potential to enhance water quality via optimizing landscape spatial arrangements given current land use composition and production routines that have been adapted to geographical conditions. Our study demonstrates the utility of IML in discerning key factors affecting water quality in large-scale assessments, offering valuable insights for targeted watershed management strategies.

Authors

  • Kerong Huo
    Hubei Provincial Engineering Research Center of Non-Point Source Pollution Control, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan 430077, China; University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory for Environment and Disaster Monitoring and Evaluation of Hubei, Jianghan Plain-Honghu Lake Station for Wetland Ecosystem Research, Wuhan 430077, China.
  • Wangzheng Shen
    Hubei Provincial Engineering Research Center of Non-Point Source Pollution Control, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan 430077, China; University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory for Environment and Disaster Monitoring and Evaluation of Hubei, Jianghan Plain-Honghu Lake Station for Wetland Ecosystem Research, Wuhan 430077, China.
  • Junchong Wei
    Hubei Provincial Engineering Research Center of Non-Point Source Pollution Control, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan 430077, China; University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory for Environment and Disaster Monitoring and Evaluation of Hubei, Jianghan Plain-Honghu Lake Station for Wetland Ecosystem Research, Wuhan 430077, China.
  • Liang Zhang
  • Qingyu Feng
    Research Center for Eco-Environmental Sciences, Chinese Academy of Sciences, Beijing 100085, China.
  • Yanhua Zhuang
    Hubei Provincial Engineering Research Center of Non-Point Source Pollution Control, Innovation Academy for Precision Measurement Science and Technology, Chinese Academy of Sciences, Wuhan 430077, China; University of Chinese Academy of Sciences, Beijing 100049, China; Key Laboratory for Environment and Disaster Monitoring and Evaluation of Hubei, Jianghan Plain-Honghu Lake Station for Wetland Ecosystem Research, Wuhan 430077, China.
  • Sisi Li
    Department of Gastroenterology, Shanghai First People's Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai, People's Republic of China.