Development of prediction models on the degradation kinetics parameters of antibiotics in aquatic environments with machine learning methods.

IF 3.9 3区 环境科学与生态学 Q1 CHEMISTRY, ANALYTICAL
Meijuan Zhang, Tong Xu, Yueli Lan, Jiansheng Cui, Bo Yao, Mengzhen Hao, Shuangjiang Li
{"title":"Development of prediction models on the degradation kinetics parameters of antibiotics in aquatic environments with machine learning methods.","authors":"Meijuan Zhang, Tong Xu, Yueli Lan, Jiansheng Cui, Bo Yao, Mengzhen Hao, Shuangjiang Li","doi":"10.1039/d5em00337g","DOIUrl":null,"url":null,"abstract":"<p><p>Antibiotics, as emerging contaminants, are increasingly detected in aquatic environments, raising significant concerns about their ecological risks. However, the lack of hydrolysis rate constants (<i>k</i><sub>H</sub>) and aqueous hydroxyl radical degradation rate constants (<i>k</i><sub>OH</sub>) limits the environmental persistent assessment of antibiotics. The present study addresses this gap by developing prediction models using multiple linear regression and three machine learning algorithms (<i>i.e.</i>, random forest, support vector machine, and extreme gradient boosting (XGBoost)), based on a dataset of 69 <i>k</i><sub>H</sub> and 80 <i>k</i><sub>OH</sub> values. The XGBoost models, identified as optimal, were employed to fill in missing data in the original dataset. Subsequently, a multi-task model capable of simultaneously predicting <i>k</i><sub>H</sub> and <i>k</i><sub>OH</sub> values was developed with good performance. The application domain was characterized by Williams plots. Furthermore, Shapley Additive exPlanations analysis was employed to identify key molecular descriptors influencing degradation rates, which provides insights into the underlying degradation mechanisms. This approach not only facilitates the simultaneous prediction of <i>k</i><sub>H</sub> and <i>k</i><sub>OH</sub> values for various new pollutants, but also enhances the understanding of how molecular structure affects their synergistic degradation kinetics in aquatic environments, thereby significantly contributing to the assessment of environmental persistence of emerging contaminants.</p>","PeriodicalId":74,"journal":{"name":"Environmental Science: Processes & Impacts","volume":" ","pages":""},"PeriodicalIF":3.9000,"publicationDate":"2025-09-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Science: Processes & Impacts","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1039/d5em00337g","RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, ANALYTICAL","Score":null,"Total":0}
引用次数: 0

Abstract

Antibiotics, as emerging contaminants, are increasingly detected in aquatic environments, raising significant concerns about their ecological risks. However, the lack of hydrolysis rate constants (kH) and aqueous hydroxyl radical degradation rate constants (kOH) limits the environmental persistent assessment of antibiotics. The present study addresses this gap by developing prediction models using multiple linear regression and three machine learning algorithms (i.e., random forest, support vector machine, and extreme gradient boosting (XGBoost)), based on a dataset of 69 kH and 80 kOH values. The XGBoost models, identified as optimal, were employed to fill in missing data in the original dataset. Subsequently, a multi-task model capable of simultaneously predicting kH and kOH values was developed with good performance. The application domain was characterized by Williams plots. Furthermore, Shapley Additive exPlanations analysis was employed to identify key molecular descriptors influencing degradation rates, which provides insights into the underlying degradation mechanisms. This approach not only facilitates the simultaneous prediction of kH and kOH values for various new pollutants, but also enhances the understanding of how molecular structure affects their synergistic degradation kinetics in aquatic environments, thereby significantly contributing to the assessment of environmental persistence of emerging contaminants.

基于机器学习方法的抗生素在水生环境降解动力学参数预测模型的建立
抗生素作为新兴污染物,越来越多地出现在水生环境中,引起了人们对其生态风险的严重关注。然而,缺乏水解速率常数(kH)和水羟基自由基降解速率常数(kOH)限制了抗生素的环境持久性评估。本研究基于69 kH和80 kOH值的数据集,通过使用多元线性回归和三种机器学习算法(即随机森林,支持向量机和极端梯度增强(XGBoost))开发预测模型来解决这一差距。XGBoost模型被认为是最优的,被用来填补原始数据集中缺失的数据。随后,建立了能够同时预测kH和kOH值的多任务模型,并取得了较好的效果。应用领域用威廉姆斯图来表征。此外,采用Shapley加性解释分析来确定影响降解速率的关键分子描述符,从而深入了解潜在的降解机制。该方法不仅有助于同时预测各种新污染物的kH和kOH值,而且增强了对分子结构如何影响其在水生环境中的协同降解动力学的理解,从而对新污染物的环境持久性评估有重要贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Science: Processes & Impacts
Environmental Science: Processes & Impacts CHEMISTRY, ANALYTICAL-ENVIRONMENTAL SCIENCES
CiteScore
9.50
自引率
3.60%
发文量
202
审稿时长
1 months
期刊介绍: Environmental Science: Processes & Impacts publishes high quality papers in all areas of the environmental chemical sciences, including chemistry of the air, water, soil and sediment. We welcome studies on the environmental fate and effects of anthropogenic and naturally occurring contaminants, both chemical and microbiological, as well as related natural element cycling processes.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信