知识引导的机器学习揭示了大气硝酸盐气体到颗粒转化的关键驱动因素

IF 14 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Bo Xu , Haofei Yu , Zongbo Shi , Jinxing Liu , Yuting Wei , Zhongcheng Zhang , Yanqi Huangfu , Han Xu , Yue Li , Linlin Zhang , Yinchang Feng , Guoliang Shi
{"title":"知识引导的机器学习揭示了大气硝酸盐气体到颗粒转化的关键驱动因素","authors":"Bo Xu ,&nbsp;Haofei Yu ,&nbsp;Zongbo Shi ,&nbsp;Jinxing Liu ,&nbsp;Yuting Wei ,&nbsp;Zhongcheng Zhang ,&nbsp;Yanqi Huangfu ,&nbsp;Han Xu ,&nbsp;Yue Li ,&nbsp;Linlin Zhang ,&nbsp;Yinchang Feng ,&nbsp;Guoliang Shi","doi":"10.1016/j.ese.2023.100333","DOIUrl":null,"url":null,"abstract":"<div><p>Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO<sub>3</sub><sup>−</sup>)). The mechanism between ε(NO<sub>3</sub><sup>−</sup>) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO<sub>3</sub><sup>−</sup>). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH<sub>4</sub><sup>+</sup>, SO<sub>4</sub><sup>2−</sup>, and temperature as pivotal drivers for ε(NO<sub>3</sub><sup>−</sup>). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH<sub>4</sub><sup>+</sup> during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.</p></div>","PeriodicalId":34434,"journal":{"name":"Environmental Science and Ecotechnology","volume":"19 ","pages":"Article 100333"},"PeriodicalIF":14.0000,"publicationDate":"2023-10-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2666498423000984/pdfft?md5=674a9d66ca6ffce1552e9af0a9839128&pid=1-s2.0-S2666498423000984-main.pdf","citationCount":"1","resultStr":"{\"title\":\"Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate\",\"authors\":\"Bo Xu ,&nbsp;Haofei Yu ,&nbsp;Zongbo Shi ,&nbsp;Jinxing Liu ,&nbsp;Yuting Wei ,&nbsp;Zhongcheng Zhang ,&nbsp;Yanqi Huangfu ,&nbsp;Han Xu ,&nbsp;Yue Li ,&nbsp;Linlin Zhang ,&nbsp;Yinchang Feng ,&nbsp;Guoliang Shi\",\"doi\":\"10.1016/j.ese.2023.100333\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO<sub>3</sub><sup>−</sup>)). The mechanism between ε(NO<sub>3</sub><sup>−</sup>) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO<sub>3</sub><sup>−</sup>). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH<sub>4</sub><sup>+</sup>, SO<sub>4</sub><sup>2−</sup>, and temperature as pivotal drivers for ε(NO<sub>3</sub><sup>−</sup>). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH<sub>4</sub><sup>+</sup> during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.</p></div>\",\"PeriodicalId\":34434,\"journal\":{\"name\":\"Environmental Science and Ecotechnology\",\"volume\":\"19 \",\"pages\":\"Article 100333\"},\"PeriodicalIF\":14.0000,\"publicationDate\":\"2023-10-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2666498423000984/pdfft?md5=674a9d66ca6ffce1552e9af0a9839128&pid=1-s2.0-S2666498423000984-main.pdf\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Environmental Science and Ecotechnology\",\"FirstCategoryId\":\"93\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2666498423000984\",\"RegionNum\":1,\"RegionCategory\":\"环境科学与生态学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science and Ecotechnology","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2666498423000984","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 1

摘要

颗粒硝酸盐是细颗粒的关键成分,通过复杂的气体到颗粒的转化过程形成。这一过程受硝态氮气粒转化系数ε(NO3−)的调控。ε(NO3−)及其驱动因素之间的机制是高度复杂和非线性的,可以用机器学习方法来表征。然而,由于环境因素的影响,传统的机器学习通常会产生缺乏明确物理意义的结果,甚至可能与已建立的物理/化学机制相矛盾。它迫切需要一种具有透明物理解释的替代方法,并提供对ε(NO3−)影响的更深入的见解。本文介绍了一种有监督的机器学习方法——基于理论方法的多层嵌套随机森林。我们的方法有力地确定了NH4+、SO42−和温度是ε(NO3−)的关键驱动因素。值得注意的是,传统随机森林分析的结果与预期的实际结果之间存在着巨大的差异。此外,我们的方法强调了NH4+在白天(30%)和夜间(40%)期间的重要性,同时与传统的随机森林分析相比,适当地淡化了一些不太相关的驱动因素的影响。这项研究强调了在大气研究中将领域知识与机器学习相结合的变革潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate

Knowledge-guided machine learning reveals pivotal drivers for gas-to-particle conversion of atmospheric nitrate

Particulate nitrate, a key component of fine particles, forms through the intricate gas-to-particle conversion process. This process is regulated by the gas-to-particle conversion coefficient of nitrate (ε(NO3)). The mechanism between ε(NO3) and its drivers is highly complex and nonlinear, and can be characterized by machine learning methods. However, conventional machine learning often yields results that lack clear physical meaning and may even contradict established physical/chemical mechanisms due to the influence of ambient factors. It urgently needs an alternative approach that possesses transparent physical interpretations and provides deeper insights into the impact of ε(NO3). Here we introduce a supervised machine learning approach—the multilevel nested random forest guided by theory approaches. Our approach robustly identifies NH4+, SO42−, and temperature as pivotal drivers for ε(NO3). Notably, substantial disparities exist between the outcomes of traditional random forest analysis and the anticipated actual results. Furthermore, our approach underscores the significance of NH4+ during both daytime (30%) and nighttime (40%) periods, while appropriately downplaying the influence of some less relevant drivers in comparison to conventional random forest analysis. This research underscores the transformative potential of integrating domain knowledge with machine learning in atmospheric studies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
20.40
自引率
6.30%
发文量
11
审稿时长
18 days
期刊介绍: Environmental Science & Ecotechnology (ESE) is an international, open-access journal publishing original research in environmental science, engineering, ecotechnology, and related fields. Authors publishing in ESE can immediately, permanently, and freely share their work. They have license options and retain copyright. Published by Elsevier, ESE is co-organized by the Chinese Society for Environmental Sciences, Harbin Institute of Technology, and the Chinese Research Academy of Environmental Sciences, under the supervision of the China Association for Science and Technology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信