Rapid identification of soil organic matter level via visible and near-infrared spectroscopy: Effects of two-dimensional correlation coefficient and extreme learning machine.

Journal: The Science of the total environment
Published Date:

Abstract

Accurate estimation of soil organic matter (SOM) is essential in understanding the spatial distribution of SOM to identify areas that need fertilization and the required grade of those fertilizers. Visible and near-infrared spectroscopy is a promising alternative to time consuming and costly conventional soil assessment methods. However, this approach is highly dependent on selecting suitable preprocessing strategies and data mining techniques for regression analysis. In this study, 2D correlation coefficients, including ratio, difference, and normalized difference indices, were introduced to select sensitive spectral parameters. The performance of extreme learning machine (ELM) was evaluated via comparison with that of support vector machine (SVM) for SOM estimation. A total of 257 soil samples were collected from Hubei Province, Central China, with SOM contents and reflectance spectra measured in the laboratory. Five spectral pretreatments, except for the raw spectra, were applied. SVM and ELM models were calibrated on spectral parameters selected by one-dimensional and 2D correlation coefficients and subsequently applied to predict SOM. Results showed that 2D correlation coefficient can effectively highlight the detailed SOM information compared with that of one-dimensional correlation coefficient. The ELM models yielded superior predictability relative to SVM models in all eight established models. The most excellent estimation accuracy was obtained by 2D ratio index and ELM (TRI-ELM) method, with an independent validation R and a ratio of performance to interquartile range of 0.83 and 3.49, respectively. The SOM fertility levels of predicted SOM showed that TRI-ELM method presented the largest similarity to laboratory-measured SOM levels, and misclassified samples were all concentrated within one error level. In summary, our study indicates that the TRI-ELM model is a rapid, inexpensive, and relatively accurate method for identifying SOM fertility level.

Authors

  • Yongsheng Hong
    School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China; State Key Laboratory of Soil and Sustainable Agriculture, Chinese Academy of Sciences, Nanjing 210008, China.
  • Songchao Chen
    INRA, Unité InfoSol, 45075 Orléans, France; UMR SAS, INRA, Agrocampus Ouest, 35042 Rennes, France.
  • Yong Zhang
    Outpatient Department of Hepatitis, The Sixth Affiliated People's Hospital of Dalian Medical University, Dalian, Liaoning, China.
  • Yiyun Chen
    School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China; State Key Laboratory of Soil and Sustainable Agriculture, Chinese Academy of Sciences, Nanjing 210008, China. Electronic address: chenyy@whu.edu.cn.
  • Lei Yu
    School of Urban and Environmental Sciences, Central China Normal University, Wuhan 430079, China; Key Laboratory for Geographical Process Analysis & Simulation of Hubei Province, Central China Normal University, Wuhan 430079, China.
  • Yanfang Liu
    School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China. Electronic address: yfliu610@sina.com.
  • Yaolin Liu
    School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China.
  • Hang Cheng
    School of Resource and Environmental Sciences, Wuhan University, Wuhan 430079, China; State Key Laboratory of Soil and Sustainable Agriculture, Chinese Academy of Sciences, Nanjing 210008, China.
  • Yi Liu
    Department of Interventional Therapy, Ningbo No. 2 Hospital, Ningbo, China.

Keywords

No keywords available for this article.