Forecasting air pollutant concentration using a novel spatiotemporal deep learning model based on clustering, feature selection and empirical wavelet transform.

Journal: The Science of the total environment
Published Date:

Abstract

Accurate forecasting of air pollutant concentration is of great importance since it is an essential part of the early warning system. However, it still remains a challenge due to the limited information of emission source and high uncertainties of the dynamic processes. In order to improve the accuracy of air pollutant concentration forecast, this study proposes a novel hybrid model using clustering, feature selection, real-time decomposition by empirical wavelet transform, and deep learning neural network. First, all air pollutant time series are decomposed by empirical wavelet transform based on real-time decomposition, and subsets of output data are constructed by combining corresponding decomposed components. Second, each subset of output data is classified into several clusters by clustering algorithm, and then appropriate inputs are selected by feature selection method. Third, a deep learning-based predictor, which uses three dimensional convolutional neural network and bidirectional long short-term memory neural network, is applied to predict decomposition components of each cluster. Last, air pollutant concentration forecast for each monitoring station is obtained by reconstructing predicted values of all the decomposition components. PM concentration data of Beijing, China is used to validate and test our model. Results show that the proposed model outperforms other models used in this study. In our model, mean absolute percentage error for 1, 6, 10 h ahead PM concentration prediction is 4.03%, 6.87%, and 8.98%, respectively. These outcomes demonstrate that the proposed hybrid model is a powerful tool to provide highly accurate forecast for air pollutant concentration.

Authors

  • Jusong Kim
    Tianjin Key Laboratory of Hazardous Waste Safety Disposal and Recycling Technology, School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin 300384, China; Department of Mathematics, University of Science, Pyongyang 999091, DPR Korea.
  • Xiaoli Wang
    Demonstration Center of Future Product, Beijing Aircraft Technology Research Institute, COMAC, Beijing, China.
  • Chollyong Kang
    Department of Mathematics, University of Science, Pyongyang 999091, DPR Korea.
  • Jinwon Yu
    Tianjin Key Laboratory of Hazardous Waste Safety Disposal and Recycling Technology, School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin 300384, China; Department of Mathematics, University of Science, Pyongyang 999091, DPR Korea.
  • Penghui Li
    Tianjin Key Laboratory of Hazardous Waste Safety Disposal and Recycling Technology, School of Environmental Science and Safety Engineering, Tianjin University of Technology, Tianjin 300384, China. Electronic address: lipenghui406@163.com.