Machine learning identified molecular fragments responsible for infrared emission features of polycyclic aromatic hydrocarbons

Q1 Earth and Planetary Sciences
Zhisen Meng, Yong Zhang, E. Liang, Zhao Wang
{"title":"Machine learning identified molecular fragments responsible for infrared emission features of polycyclic aromatic hydrocarbons","authors":"Zhisen Meng, Yong Zhang, E. Liang, Zhao Wang","doi":"10.1093/mnrasl/slad089","DOIUrl":null,"url":null,"abstract":"\n Machine learning feature importance calculations are used to determine the molecular substructures that are responsible for mid- and far-infrared (IR) emission features of neutral polycyclic aromatic hydrocarbons (PAHs). Using the extended-connectivity fingerprint as a descriptor of chemical structure, a random forest model is trained on the spectra of 14 124 PAHs to evaluate the importance of 10 632 molecular fragments for each band within the range of 2.761 to $1172.745\\, \\mu$m. The accuracy of the results is confirmed by comparing them with previously studied unidentified infrared emission (UIE) bands. The results are summarized in two tables available as Supplementary Data, which can be used as a reference for assessing possible UIE carriers. We demonstrate that the tables can be used to explore the relation between the PAH structure and the spectra by discussing about the IR features of nitrogen-containing PAHs and superhydrogenated PAHs.","PeriodicalId":18951,"journal":{"name":"Monthly Notices of the Royal Astronomical Society: Letters","volume":"65 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Monthly Notices of the Royal Astronomical Society: Letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/mnrasl/slad089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0

Abstract

Machine learning feature importance calculations are used to determine the molecular substructures that are responsible for mid- and far-infrared (IR) emission features of neutral polycyclic aromatic hydrocarbons (PAHs). Using the extended-connectivity fingerprint as a descriptor of chemical structure, a random forest model is trained on the spectra of 14 124 PAHs to evaluate the importance of 10 632 molecular fragments for each band within the range of 2.761 to $1172.745\, \mu$m. The accuracy of the results is confirmed by comparing them with previously studied unidentified infrared emission (UIE) bands. The results are summarized in two tables available as Supplementary Data, which can be used as a reference for assessing possible UIE carriers. We demonstrate that the tables can be used to explore the relation between the PAH structure and the spectra by discussing about the IR features of nitrogen-containing PAHs and superhydrogenated PAHs.
机器学习识别了多环芳烃红外发射特征的分子片段
机器学习特征重要性计算用于确定中性多环芳烃(PAHs)的中、远红外(IR)发射特征的分子亚结构。利用扩展连接指纹作为化学结构描述符,在14124个多环芳烃的光谱上训练随机森林模型,在2.761 ~ $1172.745\,\mu$m范围内对每个波段的10632个分子片段进行重要性评估。通过与先前研究的未识别红外发射(UIE)波段进行比较,证实了结果的准确性。结果总结在两个表中,作为补充数据,可作为评估可能的UIE携带者的参考。我们通过讨论含氮多环芳烃和超氢化多环芳烃的红外特征,证明了这些表可以用来探讨多环芳烃的结构与光谱之间的关系。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Monthly Notices of the Royal Astronomical Society: Letters
Monthly Notices of the Royal Astronomical Society: Letters Earth and Planetary Sciences-Space and Planetary Science
CiteScore
8.80
自引率
0.00%
发文量
136
期刊介绍: For papers that merit urgent publication, MNRAS Letters, the online section of Monthly Notices of the Royal Astronomical Society, publishes short, topical and significant research in all fields of astronomy. Letters should be self-contained and describe the results of an original study whose rapid publication might be expected to have a significant influence on the subsequent development of research in the associated subject area. The 5-page limit must be respected. Authors are required to state their reasons for seeking publication in the form of a Letter when submitting their manuscript.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信