{"title":"Machine learning identified molecular fragments responsible for infrared emission features of polycyclic aromatic hydrocarbons","authors":"Zhisen Meng, Yong Zhang, E. Liang, Zhao Wang","doi":"10.1093/mnrasl/slad089","DOIUrl":null,"url":null,"abstract":"\n Machine learning feature importance calculations are used to determine the molecular substructures that are responsible for mid- and far-infrared (IR) emission features of neutral polycyclic aromatic hydrocarbons (PAHs). Using the extended-connectivity fingerprint as a descriptor of chemical structure, a random forest model is trained on the spectra of 14 124 PAHs to evaluate the importance of 10 632 molecular fragments for each band within the range of 2.761 to $1172.745\\, \\mu$m. The accuracy of the results is confirmed by comparing them with previously studied unidentified infrared emission (UIE) bands. The results are summarized in two tables available as Supplementary Data, which can be used as a reference for assessing possible UIE carriers. We demonstrate that the tables can be used to explore the relation between the PAH structure and the spectra by discussing about the IR features of nitrogen-containing PAHs and superhydrogenated PAHs.","PeriodicalId":18951,"journal":{"name":"Monthly Notices of the Royal Astronomical Society: Letters","volume":"65 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-06-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Monthly Notices of the Royal Astronomical Society: Letters","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/mnrasl/slad089","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Earth and Planetary Sciences","Score":null,"Total":0}
引用次数: 0
Abstract
Machine learning feature importance calculations are used to determine the molecular substructures that are responsible for mid- and far-infrared (IR) emission features of neutral polycyclic aromatic hydrocarbons (PAHs). Using the extended-connectivity fingerprint as a descriptor of chemical structure, a random forest model is trained on the spectra of 14 124 PAHs to evaluate the importance of 10 632 molecular fragments for each band within the range of 2.761 to $1172.745\, \mu$m. The accuracy of the results is confirmed by comparing them with previously studied unidentified infrared emission (UIE) bands. The results are summarized in two tables available as Supplementary Data, which can be used as a reference for assessing possible UIE carriers. We demonstrate that the tables can be used to explore the relation between the PAH structure and the spectra by discussing about the IR features of nitrogen-containing PAHs and superhydrogenated PAHs.
期刊介绍:
For papers that merit urgent publication, MNRAS Letters, the online section of Monthly Notices of the Royal Astronomical Society, publishes short, topical and significant research in all fields of astronomy. Letters should be self-contained and describe the results of an original study whose rapid publication might be expected to have a significant influence on the subsequent development of research in the associated subject area. The 5-page limit must be respected. Authors are required to state their reasons for seeking publication in the form of a Letter when submitting their manuscript.