Mapping surface soil organic carbon density of cultivated land using machine learning in Zhengzhou.

Journal: Environmental geochemistry and health
PMID:

Abstract

Research on soil organic carbon (SOC) is crucial for improving soil carbon sinks and achieving the "double-carbon" goal. This study introduces ten auxiliary variables based on the data from a 2021 land quality survey in Zhengzhou and a multi-objective regional geochemical survey. It uses geostatistical ordinary kriging (OK) interpolation, as well as classical machine learning (ML) models, including random forest (RF) and support vector machine (SVM), to map soil organic carbon density (SOCD) in the topsoil layer (0 - 20 cm) of cultivated land. It partitions the sampling data to assess the generalization capability of the machine learning models, with Zhongmu County designated as an independent test set (dataset2) and the remaining data as the training set (dataset1). The three models are trained using dataset1, and the trained machine learning models are directly applied to dataset2 to evaluate and compare their generalization performance. The distribution of SOCD and SOCS in soils of various types and textures is analyzed using the optimal interpolation method. The results indicated that: (1) The average SOC densities predicted by OK interpolation, RF, and SVM are 3.70, 3.74, and 3.63 kg/m, with test set precisions (R) of 0.34, 0.60, and 0.81, respectively. (2) ML achieves a significantly higher predictive precision than traditional OK interpolation. The RF model's precision is 0.21 higher than the SVM model and more precise in estimating carbon stock. (3) When applied to the dataset2, the RF model exhibited superior generalization capabilities (R = 0.52, MSE = 0.32) over the SVM model (R = 0.32, MSE = 0.45). (4) The spatial distribution of surface SOCD in the study area exhibits a decreasing gradient from west to east and from south to north. The total carbon stock in the study area is estimated at approximately 10.76 × 10t. (5) The integration of soil attribute variables, climatic variables, remote sensing data, and machine learning techniques holds significant promise for the high-precision and high-quality mapping of soil organic carbon density (SOCD) in agricultural soils.

Authors

  • Hengliang Guo
    National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China.
  • Jinyang Wang
    Department of Clinical Medicine, Xinjiang Medical University, Urumqi, 830017, China.
  • Dujuan Zhang
    National Supercomputing Center in Zhengzhou, Zhengzhou University, Zhengzhou, 450001, China.
  • Jian Cui
    Department of Thoracic Surgery, Beijing Chuiyangliu Hospital, Beijing, China.
  • Yonghao Yuan
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Haoming Bao
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Mengjiao Yang
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Jiahui Guo
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Feng Chen
    Department of Integrated Care Management Center, West China Hospital, Sichuan University, Chengdu, Sichuan, China.
  • Wenge Zhou
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Gang Wu
    State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, Jiangsu, P. R. China.
  • Yang Guo
    Innovation Research Institute of Combined Acupuncture and Medicine, Shaanxi University of CM, Xianyang 712046, China.
  • Haitao Wei
    Department of Endocrinology, First People's Hospital of Nanning, Nanning 530021, China.
  • Baojin Qiao
    School of Geoscience and Technology, Zhengzhou University, Zhengzhou, 450001, China.
  • Shan Zhao
    Department of Mathematics, University of Alabama, Tuscaloosa, AL 35487-0350, USA.