A 1D CNN-based emulator of CMAQ: Predicting NO2 concentration over the most populated urban regions in Texas

Mahsa Payami, Yunsoo Choi, A. K. Salman, Seyedali Mousavinezhad, Jincheol Park, A. Pouyaei
{"title":"A 1D CNN-based emulator of CMAQ: Predicting NO2 concentration over the most populated urban regions in Texas","authors":"Mahsa Payami, Yunsoo Choi, A. K. Salman, Seyedali Mousavinezhad, Jincheol Park, A. Pouyaei","doi":"10.1175/aies-d-23-0055.1","DOIUrl":null,"url":null,"abstract":"\nIn this study, we developed an emulator of the Community Multiscale Air Quality (CMAQ) model by employing a 1-dimensional Convolutional Neural Network (CNN) algorithm to predict hourly surface nitrogen dioxide (NO2) concentrations over the most densely populated urban regions in Texas. The inputs for the emulator were the same as those for the CMAQ model, which includes emission, meteorology, and land use land cover data. We trained the model over June, July, and August (JJA) of 2011 and 2014 and then tested it on JJA of 2017, achieving an Index of Agreement (IOA) of 0.95 and a correlation of 0.90. We also employed temporal 3-fold cross-validation to evaluate the model’s performance, ensuring the robustness and generalizability of the results. To gain deeper insights and understand the factors influencing the model’s surface NO2 predictions, we conducted a Shapley Additive Explanations analysis. The results revealed solar radiation reaching the surface, Planetary Boundary Layer height, and NOx (NO + NO2) emissions are key variables driving the model’s predictions. These findings highlight the emulator’s ability to capture the individual impact of each variable on the model’s NO2 predictions. Furthermore, our emulator outperformed the CMAQ model in terms of computational efficiency, being more than 900 times faster in predicting NO2 concentrations, enabling the rapid assessment of various pollution management scenarios. This work offers a valuable resource for air pollution mitigation efforts, not just in Texas, but with appropriate regional data training, its utility could be extended to other regions and pollutants as well.","PeriodicalId":94369,"journal":{"name":"Artificial intelligence for the earth systems","volume":"3 2","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Artificial intelligence for the earth systems","FirstCategoryId":"0","ListUrlMain":"https://doi.org/10.1175/aies-d-23-0055.1","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

In this study, we developed an emulator of the Community Multiscale Air Quality (CMAQ) model by employing a 1-dimensional Convolutional Neural Network (CNN) algorithm to predict hourly surface nitrogen dioxide (NO2) concentrations over the most densely populated urban regions in Texas. The inputs for the emulator were the same as those for the CMAQ model, which includes emission, meteorology, and land use land cover data. We trained the model over June, July, and August (JJA) of 2011 and 2014 and then tested it on JJA of 2017, achieving an Index of Agreement (IOA) of 0.95 and a correlation of 0.90. We also employed temporal 3-fold cross-validation to evaluate the model’s performance, ensuring the robustness and generalizability of the results. To gain deeper insights and understand the factors influencing the model’s surface NO2 predictions, we conducted a Shapley Additive Explanations analysis. The results revealed solar radiation reaching the surface, Planetary Boundary Layer height, and NOx (NO + NO2) emissions are key variables driving the model’s predictions. These findings highlight the emulator’s ability to capture the individual impact of each variable on the model’s NO2 predictions. Furthermore, our emulator outperformed the CMAQ model in terms of computational efficiency, being more than 900 times faster in predicting NO2 concentrations, enabling the rapid assessment of various pollution management scenarios. This work offers a valuable resource for air pollution mitigation efforts, not just in Texas, but with appropriate regional data training, its utility could be extended to other regions and pollutants as well.
基于一维 CNN 的 CMAQ 仿真器:预测得克萨斯州人口最密集城市地区的二氧化氮浓度
在本研究中,我们采用一维卷积神经网络(CNN)算法开发了社区多尺度空气质量(CMAQ)模型的模拟器,用于预测德克萨斯州人口最稠密的城市地区每小时的地表二氧化氮(NO2)浓度。模拟器的输入与 CMAQ 模型的输入相同,其中包括排放、气象和土地利用土地覆盖数据。我们在 2011 年和 2014 年的 6 月、7 月和 8 月(JJA)对模型进行了训练,然后在 2017 年的 JJA 上对其进行了测试,获得了 0.95 的一致指数(IOA)和 0.90 的相关性。我们还采用了时态三重交叉验证来评估模型的性能,确保结果的稳健性和普适性。为了深入了解和理解影响模型地表二氧化氮预测结果的因素,我们进行了夏普利加法解释分析。结果显示,到达地表的太阳辐射、行星边界层高度和氮氧化物(NO + NO2)排放是驱动模型预测的关键变量。这些发现凸显了模拟器捕捉每个变量对模型二氧化氮预测的单独影响的能力。此外,我们的模拟器在计算效率方面优于 CMAQ 模型,在预测二氧化氮浓度方面比 CMAQ 模型快 900 多倍,从而能够快速评估各种污染管理方案。这项工作不仅为德克萨斯州的空气污染缓解工作提供了宝贵的资源,而且通过适当的区域数据培训,其实用性还可以扩展到其他地区和污染物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信