Development and application of machine learning models for prediction of soil available cadmium based on soil properties and climate features.

Journal: Environmental pollution (Barking, Essex : 1987)
PMID:

Abstract

Identifying the key influencing factors in soil available cadmium (Cd) is crucial for preventing the Cd accumulation in the food chain. However, current experimental methods and traditional prediction models for assessing available Cd are time-consuming and ineffective. In this study, machine learning (ML) models were developed to investigate the intricate interactions among soil properties, climate features, and available Cd, aiming to identify the key influencing factors. The optimal model was obtained through a combination of stratified sampling, Bayesian optimization, and 10-fold cross-validation. It was further explained through the utilization of permutation feature importance, 2D partial dependence plot, and 3D interaction plot. The findings revealed that pH, surface pressure, sensible heat net flux and organic matter content significantly influenced the Cd accumulation in the soil. By utilizing historical soil surveys and climate change data from China, this study predicted the spatial distribution trend of available Cd in the Chinese region, highlighting the primary areas with heightened Cd activity. These areas were primarily located in the eastern, southern, central, and northeastern China. This study introduces a novel methodology for comprehending the process of available Cd accumulation in soil. Furthermore, it provides recommendations and directions for the remediation and control of soil Cd pollution.

Authors

  • Zhihui Yang
    Institute of Artificial Intelligence, School of Computer Science, Wuhan University, China. Electronic address: zhy@whu.edu.cn.
  • Hui Xia
    Key Laboratory of Environmental Medicine and Engineering of Ministry of Education, School of Public Health, Southeast University, Nanjing 210009, China.
  • Ziyun Guo
    Institute of Environmental Science and Engineering, School of Metallurgy and Environment, Central South University, 410083, Changsha, China.
  • Yanyan Xie
    Department of Gastrointestinal Surgery, Hernia Center, West China Hospital, Sichuan University, Chengdu, China.
  • Qi Liao
    Department of Preventative Medicine, Zhejiang Provincial Key Laboratory of Pathophysiology, Medical School of Ningbo University, Ningbo, China.
  • Weichun Yang
    Institute of Environmental Science and Engineering, School of Metallurgy and Environment, Central South University, 410083, Changsha, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, 410083, Changsha, China.
  • Qingzhu Li
    School of Metallurgy and Environment, Central South University, Changsha, 410083, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, Changsha, 410083, China.
  • ChunHua Dong
    Soil and Fertilizer Institute of Hunan Province, 410125, Changsha, China.
  • Mengying Si
    Institute of Environmental Science and Engineering, School of Metallurgy and Environment, Central South University, 410083, Changsha, China; Chinese National Engineering Research Center for Control & Treatment of Heavy Metal Pollution, 410083, Changsha, China. Electronic address: simysmile@csu.edu.cn.