Explainable AI analysis for smog rating prediction.

Journal: Scientific reports
Published Date:

Abstract

Smog poses a direct threat to human health and the environment. Addressing this issue requires understanding how smog is formed. While major contributors include industries, fossil fuels, crop burning, and ammonia from fertilizers, vehicles play a significant role. Individually, a vehicle's contribution to smog may be small, but collectively, the vast number of vehicles has a substantial impact. Manually assessing the contribution of each vehicle to smog is impractical. However, advancements in machine learning make it possible to quantify this contribution. By creating a dataset with features such as vehicle model, year, fuel consumption (city), and fuel type, a predictive model can classify vehicles based on their smog impact, rating them on a scale from 1 (poor) to 8 (excellent). This study proposes a novel approach using Random Forest and Explainable Boosting Classifier models, along with SMOTE (Synthetic Minority Oversampling Technique), to predict the smog contribution of individual vehicles. The results outperform previous studies, with the proposed model achieving an accuracy of 86%. Key performance metrics include a Mean Squared Error of 0.2269, R-Squared (R) of 0.9624, Mean Absolute Error of 0.2104, Explained Variance Score of 0.9625, and a Max Error of 4.3500. These results incorporate explainable AI techniques, using both agnostic and specific models, to provide clear and actionable insights. This work represents a significant step forward, as the dataset was last updated only five months ago, underscoring the timeliness and relevance of the research.

Authors

  • Yazeed Yasin Ghadi
    Department of Computer Science and Software Engineering, Al Ain University, Abu Dhabi, UAE.
  • Sheikh Muhammad Saqib
    Department of Computing and Information Technology, Gomal University, Dera Ismail Khan, 29050, Pakistan. saqibsheikh4@gu.edu.pk.
  • Tehseen Mazhar
    Department of Computer Science, Virtual University of Pakistan, Lahore, Punjab, Pakistan.
  • Ahmad Almogren
    Chia of Pervasive and Mobile Computing, College of Computer and Information Sciences, King Saud University, Riyadh, 11543, Saudi Arabia.
  • Wajahat Waheed
    Department of Electrical and Computer Engineering, Purdue University, Indiana, 46323, USA.
  • Ayman Altameem
    Department of Natural and Engineering Sciences, College of Applied Studies and Community Services, King Saud University, 11543, Riyadh, Saudi Arabia.
  • Habib Hamam
    School of Electrical Engineering, Department of Electrical and Electronic Engineering Science, University of Johannesburg, Johannesburg, South Africa.