Physically-constrained evapotranspiration models with machine learning parameterization outperform pure machine learning: Critical role of domain knowledge.

Journal: PloS one
Published Date:

Abstract

Physics-informed machine learning techniques have emerged to tackle challenges inherent in pure machine learning (ML) approaches. One such technique, the hybrid approach, has been introduced to estimate terrestrial evapotranspiration (ET), a crucial variable linking water, energy, and carbon cycles. A key advantage of these hybrid ET models is their improved performance, particularly under extreme conditions, compared to ET estimates relying solely on ML. However, the mechanisms driving their improved performance are not well understood. To address this gap, we developed six hybrid approaches based on different physical formulations of ET and compared them with a pure ML model. All models employed the random forest algorithm and were trained on daily-scale ET observations, in-situ meteorological data and satellite remote sensing. We found a strong correlation (r = 0.93) between the sensitivity of ET estimates to machine-learned parameters and model error (root-mean-square error; RMSE), indicating that reduced sensitivity minimizes error propagation and improves performance. Notably, the most accurate hybrid model (RMSE = 17.8 W m-2 in energy unit) utilized a novel empirical parameter, which is relatively stable due to land-atmosphere equilibrium, outperforming both the pure ML model and hybrid models requiring conventional parameters (e.g., surface conductance). These results imply that conventional parameterizations may require reevaluated to effectively integrate physical models with machine learning, as conventional choices may not be optimal for this new, hybrid, paradigm. This study underscores the critical role of domain knowledge in setting up hybrid models, potentially guiding future hybrid model developments beyond ET estimation.

Authors

  • Yeonuk Kim
    Institute for Resources, Environment and Sustainability, University of British Columbia, Vancouver, Canada.
  • Monica Garcia
    Estación Experimental de Zonas Áridas. Consejo Superior de Investigaciones Científicas (EEZA-CSIC), Almería, Spain.
  • T Andrew Black
    Faculty of Land and Food Systems, University of British Columbia, Vancouver, Canada.
  • Mark S Johnson
    Biochemistry, Faculty of Science and Engineering, Åbo Akademi University, Turku, Finland.