Interpretable Machine Learning and Reactomics Assisted Isotopically Labeled FT-ICR-MS for Exploring the Reactivity and Transformation of Natural Organic Matter during Ultraviolet Photolysis
{"title":"Interpretable Machine Learning and Reactomics Assisted Isotopically Labeled FT-ICR-MS for Exploring the Reactivity and Transformation of Natural Organic Matter during Ultraviolet Photolysis","authors":"Dhimas Dwinandha, Mohamed Elsamadony, Rongjun Gao, Qing-Long Fu, Jibao Liu* and Manabu Fujii*, ","doi":"10.1021/acs.est.3c05213","DOIUrl":null,"url":null,"abstract":"<p >Isotopically labeled FT-ICR-MS combined with multiple post-analyses, including interpretable machine learning (IML) and a paired mass distance (PMD) network, was employed to unravel the reactivity and transformation of natural organic matter (NOM) during ultraviolet (UV) irradiation. FT-ICR-MS analysis was used to assign formulas, which were classified on the basis of their molecular compositions and structural categories. Isotope (deuterium, D) labeling was utilized to unequivocally determine the photochemical products and examine the development of OD radical-mediated NOM transformation. With regard to the reactive molecular formulas, CHOS formulas exhibited the highest reactivity (86.5% of precursors disappeared) followed by CHON (53.4%) and CHO (24.6%) formulas. With regard to structural categories, the degree of reactivity decreased in the following order: tannins > condensed aromatics > lignin/CRAMs. The IML algorithm demonstrated that the crucial features governing the reactivity of formulas were the molecular weight, DBE-O, NOSC, and the presence of heteroatoms (i.e., N and S), suggesting that the large and unsaturated compounds containing S and N are more prone to photodegradation. The reactomics approach using the PMD network further indicated that 11 specific molecular formulas in the CHOS and CHO class served as hubs, implying a higher photoreactivity and participation in a range of transformations. The isotope labeling analyses also found that, among the reactions observed, hydroxylation (i.e., +OD) is dominant for lignin/CRAMs and condensed aromatics, and formulas containing ≤10 D atoms were developed. Overall, this study, by adopting rigorous and interpretable techniques, could provide in-depth insights into the molecular-level dynamics of NOM under UV irradiation.</p>","PeriodicalId":36,"journal":{"name":"环境科学与技术","volume":"58 1","pages":"816–825"},"PeriodicalIF":11.3000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"环境科学与技术","FirstCategoryId":"1","ListUrlMain":"https://pubs.acs.org/doi/10.1021/acs.est.3c05213","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ENVIRONMENTAL","Score":null,"Total":0}
引用次数: 0
Abstract
Isotopically labeled FT-ICR-MS combined with multiple post-analyses, including interpretable machine learning (IML) and a paired mass distance (PMD) network, was employed to unravel the reactivity and transformation of natural organic matter (NOM) during ultraviolet (UV) irradiation. FT-ICR-MS analysis was used to assign formulas, which were classified on the basis of their molecular compositions and structural categories. Isotope (deuterium, D) labeling was utilized to unequivocally determine the photochemical products and examine the development of OD radical-mediated NOM transformation. With regard to the reactive molecular formulas, CHOS formulas exhibited the highest reactivity (86.5% of precursors disappeared) followed by CHON (53.4%) and CHO (24.6%) formulas. With regard to structural categories, the degree of reactivity decreased in the following order: tannins > condensed aromatics > lignin/CRAMs. The IML algorithm demonstrated that the crucial features governing the reactivity of formulas were the molecular weight, DBE-O, NOSC, and the presence of heteroatoms (i.e., N and S), suggesting that the large and unsaturated compounds containing S and N are more prone to photodegradation. The reactomics approach using the PMD network further indicated that 11 specific molecular formulas in the CHOS and CHO class served as hubs, implying a higher photoreactivity and participation in a range of transformations. The isotope labeling analyses also found that, among the reactions observed, hydroxylation (i.e., +OD) is dominant for lignin/CRAMs and condensed aromatics, and formulas containing ≤10 D atoms were developed. Overall, this study, by adopting rigorous and interpretable techniques, could provide in-depth insights into the molecular-level dynamics of NOM under UV irradiation.
采用同位素标记的傅立叶变换化学还原质谱(FT-ICR-MS)结合多种后分析,包括可解释的机器学习(IML)和成对质量距离(PMD)网络,来揭示天然有机物(NOM)在紫外线(UV)照射下的反应性和转化。利用 FT-ICR-MS 分析来确定分子式,并根据分子组成和结构类别对分子式进行分类。利用同位素(氘,D)标记来明确确定光化学产物,并研究 OD 自由基介导的 NOM 转化过程。在反应分子式方面,CHOS 式的反应活性最高(86.5% 的前体消失),其次是 CHON 式(53.4%)和 CHO 式(24.6%)。就结构类别而言,反应程度依次降低:单宁酸;缩合芳香族化合物;木质素/CRAMs。IML 算法表明,影响配方反应性的关键特征是分子量、DBE-O、NOSC 和杂原子(即 N 和 S)的存在,这表明含有 S 和 N 的大型不饱和化合物更容易发生光降解。利用 PMD 网络的反应组学方法进一步表明,CHOS 和 CHO 类中有 11 个特定分子式是枢纽,这意味着它们具有较高的光活性,并参与了一系列转化过程。同位素标记分析还发现,在观察到的反应中,羟基化(即 +OD)在木质素/CRAMs 和缩合芳烃中占主导地位,并形成了含有≤10 个 D 原子的分子式。总之,本研究通过采用严谨和可解释的技术,可以深入了解紫外辐照下 NOM 的分子水平动力学。
期刊介绍:
Environmental Science & Technology (ES&T) is a co-sponsored academic and technical magazine by the Hubei Provincial Environmental Protection Bureau and the Hubei Provincial Academy of Environmental Sciences.
Environmental Science & Technology (ES&T) holds the status of Chinese core journals, scientific papers source journals of China, Chinese Science Citation Database source journals, and Chinese Academic Journal Comprehensive Evaluation Database source journals. This publication focuses on the academic field of environmental protection, featuring articles related to environmental protection and technical advancements.