SMRFR: A global multilayer soil moisture dataset generated using Random Forest from multi-source data.

Journal: Scientific data
Published Date:

Abstract

Accurate and continuous monitoring of soil moisture (SM) is crucial for a wide range of applications in agriculture, hydrology, and climate modelling. In this study, we present a novel machine learning (ML) based framework for generating a continuously updated, multilayer global SM dataset: SMRFR (Soil Moisture via Random Forest Regression). Leveraging publicly available reanalysis and remote sensing data, SMRFR provides daily SM estimates at five soil layers (0-5, 5-10, 10-30, 30-50 and 50-100 cm) with a spatial resolution of 9 km, covering the period from 2000 to 2023. Evaluation results demonstrate that SMRFR effectively captures both spatial and temporal SM variability. It also exhibits strong generalization capacity, successfully transferring knowledge across continents and accurately capturing transient and seasonal SM dynamics following rainfall events. SMRFR achieved an unbiased root mean square error of 0.0339 m/m on the validation set. Our novel SM dataset offers a basis and valuable reference for agricultural, hydrological, and ecological research, enabling improved analysis and modelling of SM dynamics at regional to global scales.

Authors

  • Yuhan Liu
    School of Basic Medical Sciences, Fujian Medical University, Fuzhou, China.
  • Yuanyuan Zha
    State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan, 430072, China. zhayuan87@whu.edu.cn.
  • Gulin Ran
    State Key Laboratory of Water Resources Engineering and Management, Wuhan University, Wuhan, 430072, China.
  • Yonggen Zhang
    Institute of Surface-Earth System Science, School of Earth System Science, Tianjin University, Tianjin, 300072, China.
  • Liangsheng Shi
    State Key Laboratory of Water Resources Engineering And Management, Wuhan University, Wuhan, Hubei, China.

Keywords

No keywords available for this article.