Understanding the Driving Forces of Summer PM1 Composition in Seoul, Korea, with Explainable Machine Learning

Qihua Hu, Jihye Moon and Hwajin Kim*, 
{"title":"Understanding the Driving Forces of Summer PM1 Composition in Seoul, Korea, with Explainable Machine Learning","authors":"Qihua Hu,&nbsp;Jihye Moon and Hwajin Kim*,&nbsp;","doi":"10.1021/acsestair.3c0011610.1021/acsestair.3c00116","DOIUrl":null,"url":null,"abstract":"<p >This study leverages explainable machine learning, specifically XGBoost models with Shapley Additive Explanations (SHAP), to explore the chemical properties of atmospheric aerosols in Seoul, Korea, during the summer of 2019. Focusing on non-refractory particulate matter (NR-PM<sub>1</sub>) properties measured by high-resolution time-of-flight aerosol mass spectrometry (HR-ToF-AMS), the research extends to organic aerosol (OA) sources identified via positive matrix factorization of high-resolution MS data. The models achieved good predictive accuracy (<i>R</i><sup>2</sup> &gt; 0.90) for all species concentrations, except for hydrocarbon-like OA (HOA) due to frequent concentration fluctuations. The model outcomes aligned well with those previously achieved using conventional methods (chemical transport model and correlational analysis), confirming that relative humidity is associated with nocturnal nitrate concentration and photochemistry associated with sulfate concentration in the summertime in Seoul. Importantly, the models revealed mostly nonlinear relationships between atmospheric factors, such as temperature and particulate matter (PM) components, thereby deepening the understanding of formation processes. Notably, different potential formation mechanisms were discerned for more oxidized oxygenated OA (MO-OOA) and oxidized primary OA (OPOA). For MO-OOA, SHAP analysis showed a plateau in SHAP values at an O<sub><i>x</i></sub> concentration of 0.085 ppm, which suggested potential fragmentation from further oxidation and agreed with previous chamber experiments. Conversely, the lack of a plateau in the O<sub><i>x</i></sub> values for OPOA implied potential ongoing oxidation, suggesting a higher and longer atmospheric oxidation potential. This approach offers rapid and potential insights into complex atmospheric aerosol formation processes. It is essential to acknowledge that SHAP values do not establish causality, and knowledge of the underlying physical and chemical processes was required to conclude valid and comprehensive interpretations of the ML results.</p><p >This study represents a pioneering effort in applying explainable machine learning techniques to HR-ToF-AMS data, achieving rapid results yet providing potential insights of OA formation mechanisms, such as the oxidation turning point for MO-OOA and OPOA.</p>","PeriodicalId":100014,"journal":{"name":"ACS ES&T Air","volume":"1 9","pages":"960–972 960–972"},"PeriodicalIF":0.0000,"publicationDate":"2024-07-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.acs.org/doi/epdf/10.1021/acsestair.3c00116","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS ES&T Air","FirstCategoryId":"1085","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acsestair.3c00116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This study leverages explainable machine learning, specifically XGBoost models with Shapley Additive Explanations (SHAP), to explore the chemical properties of atmospheric aerosols in Seoul, Korea, during the summer of 2019. Focusing on non-refractory particulate matter (NR-PM1) properties measured by high-resolution time-of-flight aerosol mass spectrometry (HR-ToF-AMS), the research extends to organic aerosol (OA) sources identified via positive matrix factorization of high-resolution MS data. The models achieved good predictive accuracy (R2 > 0.90) for all species concentrations, except for hydrocarbon-like OA (HOA) due to frequent concentration fluctuations. The model outcomes aligned well with those previously achieved using conventional methods (chemical transport model and correlational analysis), confirming that relative humidity is associated with nocturnal nitrate concentration and photochemistry associated with sulfate concentration in the summertime in Seoul. Importantly, the models revealed mostly nonlinear relationships between atmospheric factors, such as temperature and particulate matter (PM) components, thereby deepening the understanding of formation processes. Notably, different potential formation mechanisms were discerned for more oxidized oxygenated OA (MO-OOA) and oxidized primary OA (OPOA). For MO-OOA, SHAP analysis showed a plateau in SHAP values at an Ox concentration of 0.085 ppm, which suggested potential fragmentation from further oxidation and agreed with previous chamber experiments. Conversely, the lack of a plateau in the Ox values for OPOA implied potential ongoing oxidation, suggesting a higher and longer atmospheric oxidation potential. This approach offers rapid and potential insights into complex atmospheric aerosol formation processes. It is essential to acknowledge that SHAP values do not establish causality, and knowledge of the underlying physical and chemical processes was required to conclude valid and comprehensive interpretations of the ML results.

This study represents a pioneering effort in applying explainable machine learning techniques to HR-ToF-AMS data, achieving rapid results yet providing potential insights of OA formation mechanisms, such as the oxidation turning point for MO-OOA and OPOA.

Abstract Image

利用可解释机器学习了解韩国首尔夏季 PM1 构成的驱动因素
本研究利用可解释的机器学习,特别是带有夏普利相加解释(SHAP)的 XGBoost 模型,探索 2019 年夏季韩国首尔大气气溶胶的化学特性。研究重点是通过高分辨率飞行时间气溶胶质谱(HR-ToF-AMS)测量的非难降解颗粒物(NR-PM1)特性,并扩展到通过高分辨率质谱数据的正矩阵因式分解确定的有机气溶胶(OA)来源。模型对所有物种的浓度都达到了良好的预测精度(R2 > 0.90),但碳氢化合物类有机气溶胶(HOA)除外,因为其浓度波动频繁。模型结果与之前使用传统方法(化学传输模型和相关分析)得出的结果非常吻合,证实相对湿度与首尔夏季的夜间硝酸盐浓度相关,光化学与硫酸盐浓度相关。重要的是,模型揭示了温度和颗粒物(PM)成分等大气因素之间的大部分非线性关系,从而加深了对形成过程的理解。值得注意的是,对于更氧化的含氧 OA(MO-OOA)和氧化的原生 OA(OPOA),发现了不同的潜在形成机制。对于 MO-OOA,SHAP 分析表明,当氧化物浓度为 0.085 ppm 时,SHAP 值会达到一个高点,这表明进一步氧化可能会导致碎裂,这与之前的室实验结果一致。相反,OPOA 的 Ox 值没有出现高原,这意味着可能正在发生氧化,表明大气中的氧化潜能更高、时间更长。这种方法可以快速、潜在地了解复杂的大气气溶胶形成过程。必须承认的是,SHAP 值并不能确定因果关系,要对 ML 结果做出有效而全面的解释,还需要了解基本的物理和化学过程。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信