Air temperature estimation and modeling using data driven techniques based on best subset regression model in Egypt.

Journal: Scientific reports
Published Date:

Abstract

Air temperature plays a critical role in estimating agricultural water requirements, hydrological processes, and the climate change impacts. This study aims to identify the most accurate forecasting model for daily minimum (T) and maximum (T) temperatures in a semi-arid environment. Five machine learning models-linear regression (LR), additive regression (AR), support vector machine (SVM), random subspace (RSS), and M5 pruned (M5P)-were compared for T and T forecasting in Gharbia Governorate, Egypt, using data from 1979 to 2014. The dataset was divided into 75% for training and 25% for testing. Model input combinations were selected based on best subset regression analysis, result shows the best combination was T, T, T, T, T, T, T and T, T, T, T, T, T, T for daily minimum maximum air temperature forecasting, respectively. The M5P model outperformed the other models in predicting both T and T. For T, the M5P model achieved the lowest root mean square error (RMSE) of 2.4881 °C, mean absolute error (MAE) of 1.9515, and relative absolute error (RAE) of 40.4887, alongside the highest Nash-Sutcliffe efficiency (NSE) of 0.8048 and Pearson correlation coefficient (PCC) of 0.8971. In T forecasting, M5P showed a lower RMSE of 2.7696 °C, MAE of 1.9867, RAE of 29.5440, and higher NSE of 0.8720 and R² of 0.8720. These results suggest that M5P is a robust and precise model for temperature forecasting, significantly outperforming LR, AR, RSS, and SVM models. The findings provide valuable insights for improving decision-making in areas such as water resource management, irrigation systems, and agricultural productivity, offering a reliable tool for enhancing operational efficiency and sustainability in semi-arid regions. The Friedman ANOVA and Dunn's test confirm significant differences among temperature forecasting models. Additive Regression overestimates, while Linear Regression and SVM align closely with actual values. Random Subspace and M5P exhibit high variability, with SVM differing significantly. For maximum temperature, Random Subspace and M5P perform similarly, while SVM remains distinct.

Authors

  • Ahmed Elbeltagi
    Agricultural Engineering Department, Faculty of Agriculture, Mansoura University, Mansoura, 35516, Egypt; College of Environmental and Resource Sciences, Zhejiang University, Hangzhou, 310058, China.
  • Dinesh Kumar Vishwakarma
    Biometric Research Laboratory, Department of Information Technology, Delhi Technological University, Bawana Road, Delhi, 110042, India.
  • Okan Mert Katipoğlu
    Department of Civil Engineering, Faculty of Engineering and Architecture, Erzincan Binali Yıldırım University, Erzincan, Turkey. okatipoglu@erzincan.edu.tr.
  • Kallem Sushanth
    Department of Agricultural and Food Engineering, IIT Kharagpur, Kharagpur 721302, West Bengal, India. Electronic address: kallemsushi@gmail.com.
  • Salim Heddam
    Faculty of Science, Agronomy Department, Hydraulics Division, University 20 Août 1955 Skikda, Skikda, Algeri.
  • Bhaskar Pratap Singh
    ANDUAT-Krishi Vigyan Kendra, Haidergarh, Barabanki, Uttar Pradesh, 225124, India.
  • Abhishek Shukla
    Department of Irrigation and Drainage Engineering, College of Technology, G. B. Pant University of Agriculture and Technology, Pantnagar, Uttarakhand, 263145, India.
  • Vinay Kumar Gautam
    School of Natural Resource Management, College of Post Graduate Studies in Agricultural Sciences, Central Agricultural University, Imphal, Umiam, 793103, Meghalaya, India.
  • Chaitanya Baliram Pande
    Institute of Energy Infrastructure, Universiti Tenaga Nasional, Kajang, 43000, Malaysia; New Era and Development in Civil Engineering Research Group, Scientific Research Center, Al-Ayen University, Thi-Qar, Nasiriyah, 64001, Iraq. Electronic address: chaitanay45@gmail.com.
  • Saddam Hussain
    School of Electrical Engineering, University Technology Malaysia, Johor Bahru 81310, Malaysia.
  • Subhankar Ghosh
    CVPR Unit, Indian Statistical Institute, Kolkata, India.
  • Hossein Dehghanisanij
    Agricultural Research, Education and Extension Organization, Agricultural Engineering Research Institute, Post Box 31585-845, Karaj, Alborz, Iran. h.dehghanisanij@areeo.ac.ir.
  • Ali Salem
    Civil Engineering Department, Faculty of Engineering, Minia University, Minia, 61111, Egypt. salem.ali@mik.pte.hu.

Keywords

No keywords available for this article.