Improving the construction and prediction strategy of the Air Quality Health Index (AQHI) using machine learning: A case study in Guangzhou, China.

Journal: Ecotoxicology and environmental safety
PMID:

Abstract

Effectively capturing the risk of air pollution and informing residents is vital to public health. The widely used Air Quality Index (AQI) has been criticized for failing to accurately represent the non-threshold linear relationship between air pollution and health outcomes. Although the Air Quality Health Index (AQHI) was developed to address these limitations, it lacks comprehensive construction criteria. This work proposed a novel construction and prediction strategy of AQHI using machine learning methods. Our RF-Alasso-QGC method integrated Random Forest (RF), Adaptive Lasso (Alasso), and Quantile-based G-Computation (QGC) for effective pollutant selection and AQHI construction. The RF-Alasso method excluded CO, while identified PM, PM, NO, SO, and O as major contributors to mortality. The QGC method controlled the additive and synergistic effects among these air pollutants. Compared to the Standard-AQHI, the new RF-Alasso-QGC-AQHI demonstrated a stronger correlation with health outcomes, with an interquartile (IQR) increase associated with a 1.80 % (1.44 %, 2.17 %) increase in total mortality, and the best goodness of fit. Additionally, the hybrid Auto Regressive Moving Average-Long Short Term Memory (ARIMA-LSTM) successfully forecast the new AQHI, achieving a coefficient of determination (R²) of 0.961. The work demonstrated that the improved AQHI construction and prediction strategy more efficiently communicate and provide early warnings of the health risks of multiple air pollutants.

Authors

  • Lei Zhang
    Division of Gastroenterology, Union Hospital, Tongji Medical College Medical College, Huazhong University of Science and Technology, Wuhan, China.
  • Yuanyuan Chen
    Center for Radiation Oncology, Affiliated Hangzhou Cancer Hospital, Zhejiang University School of Medicine, Hangzhou 310001, China.
  • Hang Dong
    Centre for Medical Informatics, Usher Institute of Population Health Sciences and Informatics, University of Edinburgh, Edinburgh, United Kingdom; Health Data Research UK, London, United Kingdom. Electronic address: hang.dong@ed.ac.uk.
  • Di Wu
    University of Melbourne, Melbourne, VIC 3010 Australia.
  • Sili Chen
    Department of Preventive Medicine, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China.
  • Xin Li
    Veterinary Diagnostic Center, Shanghai Animal Disease Control Center, Shanghai, China.
  • Boheng Liang
    Guangzhou Center for Disease Control and Prevention, Guangzhou 510440, China. Electronic address: 14927462@qq.com.
  • Qiaoyuan Yang
    Department of Preventive Medicine, School of Public Health, Guangzhou Medical University, Xinzao, Panyu District, Guangzhou 511436, China; Guangdong Provincial Key Laboratory of Major Obstetric Diseases, Guangdong Provincial Clinical Research Center for Obstetrics and Gynecology, The Third Affiliated Hospital of Guangzhou Medical University, Guangzhou, China. Electronic address: qiaoyuan_yang@gzhmu.edu.cn.