Machine learning-based prediction of ambient CO and CH concentrations with high temporal resolution in Seoul metropolitan area.

Journal: Environmental pollution (Barking, Essex : 1987)
Published Date:

Abstract

Machine learning has the potential to support the growing need for high-resolution greenhouse gas monitoring in urban and industrial environments, where deploying extensive sensor networks is often limited by cost and operational challenges. This study presents a novel approach for estimating greenhouse gas (GHG) concentrations using routinely collected air quality and meteorological data from existing monitoring stations. Focusing on the Seoul metropolitan area in the Republic of Korea, we developed and evaluated three machine learning models - Random Forest, Long Short-Term Memory (LSTM), and an ensemble learning approach - to predict CO and CH concentrations without relying on additional GHG monitoring equipment. Among these, the ensemble learning model outperformed the individual models, consistently achieving lower error metrics, even in data-limited scenarios. Feature importance analysis identifies NO, CO, O, and temperature as key predictors of CO and CH level variations, highlighting the influence of combustion-related pollutants and photochemical processes. Cross-validation results confirm the model's out-of-sample capabilities; however, local factors, such as traffic density, industrial activities, and meteorology, can affect performance. Consequently, model retraining or transfer learning may be required when applying the model to new locations with comparable emission profiles or atmospheric conditions. These findings emphasize the importance of localized context in model application while also demonstrating the broader applicability of the approach. By utilizing data already available through urban monitoring networks, this study offers a scalable and cost-effective strategy to support high-resolution GHG monitoring and inform targeted climate policies in complex urban-industrial regions.

Authors

  • Seongjun Park
    Department of Materials Science and Engineering, Korea Advanced Institute of Science and Technology (KAIST), Daejeon, 34141, Republic of Korea.
  • Kwang-Joo Moon
    Climate Change Research Division, Climate Change and Carbon Research Department, National Institute of Environmental Research, Incheon, Republic of Korea. Electronic address: iamiyan@korea.kr.
  • Hyo-Jin Eom
    Climate Change Research Division, Climate Change and Carbon Research Department, National Institute of Environmental Research, Incheon, Republic of Korea. Electronic address: hjeom2017@korea.kr.
  • Seung-Muk Yi
    Department of Environmental Health Sciences, Graduate School of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea; Institute of Health and Environment, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea.
  • Youngkwon Kim
    Department of Environmental Health Sciences, Graduate School of Public Health, Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea.
  • Moonkyung Kim
    Department of Environmental Health Sciences, Graduate School of Public Health, Seoul National University, Seoul, Republic of Korea. Electronic address: strikingirl@snu.ac.kr.
  • Donghyun Rim
    Department of Architectural Engineering, Pennsylvania State University, University Park, PA, USA. Electronic address: dxr51@psu.edu.
  • Young Su Lee
    Department of Energy and Environmental Engineering, Soonchunhyang University, Soonchunhyang-ro, Sinchang-myeon, Asan-si, Chungcheongnam-do, 31538, Republic of Korea.