Accurate and efficient prediction of atmospheric PM, PM, PM, and O concentrations using a customized software package based on a machine-learning algorithm.

Journal: Chemosphere
PMID:

Abstract

Particulate matter (PM) and ozone (O) pollution have been attracting increasing attention recently due to their severe harm to human health. PM and O are secondary pollutants, and there remain significant challenges in accurately and efficiently predicting their concentrations in the atmosphere. In this study, one-year monitoring data of PM, PM, PM, and O concentrations as well as meteorological parameters and concentrations of various precursors (i.e., nitrogen oxides, SO, CO, alkanes, aldehydes, and ketones) are obtained at a monitoring site in central China's Hunan province. The eXtreme Gradient Boosting model is trained and tested to achieve efficient and accurate predictions of the concentrations of the four pollutants. The effects of different datasets, input features, and model parameters on the prediction accuracy are investigated. Principal component analysis is employed to further reduce the dimensions of features, increasing the prediction efficiency. Finally, all model training and prediction processes are incorporated in an executable application using the PyQt5 framework to build a user interface. The customized software supports user-defined modeling. The software can simultaneously predict PM, PM, PM, and O concentrations, making the prediction process convenient.

Authors

  • Le Xie
    Research Institute of Med-X, Shanghai Jiao Tong University, Shanghai, China.
  • Jiawei He
    Department of Critical Care Medicine, Beijing Friendship Hospital, Capital Medical University, Beijing, China.
  • Ruiqi Lei
    College of Chemistry and Chemical Engineering, Central South University, Changsha, 410083, China.
  • Maoqing Fan
    Atmospheric Environment Monitoring department, Changsha Environmental Monitoring Centre of Hunan Province, Changsha, 410001, China.
  • Huimin Huang