A comprehensive machine learning-based models for predicting mixture toxicity of azole fungicides toward algae (Auxenochlorella pyrenoidosa)

IF 10.3 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Li-Tang Qin , Xue-Fang Tian , Jun-Yao Zhang , Yan-Peng Liang , Hong-Hu Zeng , Ling-Yun Mo
{"title":"A comprehensive machine learning-based models for predicting mixture toxicity of azole fungicides toward algae (Auxenochlorella pyrenoidosa)","authors":"Li-Tang Qin ,&nbsp;Xue-Fang Tian ,&nbsp;Jun-Yao Zhang ,&nbsp;Yan-Peng Liang ,&nbsp;Hong-Hu Zeng ,&nbsp;Ling-Yun Mo","doi":"10.1016/j.envint.2024.109162","DOIUrl":null,"url":null,"abstract":"<div><div>Quantitative structure–activity relationships (QSARs) have been used to predict mixture toxicity. However, current research faces gaps in achieving accurate predictions of the mixture toxicity of azole fungicides. To address this gap, the application of machine learning (ML) algorithms has emerged as an effective strategy. In this study, we applied 12 algorithms, namely, k-nearest neighbor (KNN), kernel k-nearest neighbors (KKNN), support vector machine (SVM), random forest (RF), stochastic gradient boosting (GBM), cubist, bagged multivariate adaptive regression splines (Bagged MARS), eXtreme gradient boosting (XGBoost), boosted generalized linear model (GLMBoost), boosted generalized additive model (GAMBoost), bayesian regularized neural networks (BRNN), and recursive partitioning and regression trees (CART) to build ML models for 225 mixture toxicity of azole fungicides towards <em>Auxenochlorella pyrenoidosa</em>. A total of 36 single ML models and 12 consensus models were developed. The results indicated that models employing concentration addition (CA), independent action (IA), and molecular descriptors (MD) as variables demonstrated superior predictive abilities. The consensus model combining SVM and RF algorithms (labeled as CM0) demonstrated the highest level of accuracy in fitting the data, with a coefficient of determination of 0.980. Additionally, it showed strong predictive abilities when tested with external data, achieving an external R<sup>2</sup> value of 0.945 and a Concordance Correlation Coefficient of 0.967. This study provides a positive contribution to the ecological risk assessment of a mixture of azole fungicides.</div></div>","PeriodicalId":308,"journal":{"name":"Environment International","volume":"194 ","pages":"Article 109162"},"PeriodicalIF":10.3000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environment International","FirstCategoryId":"93","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0160412024007487","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Quantitative structure–activity relationships (QSARs) have been used to predict mixture toxicity. However, current research faces gaps in achieving accurate predictions of the mixture toxicity of azole fungicides. To address this gap, the application of machine learning (ML) algorithms has emerged as an effective strategy. In this study, we applied 12 algorithms, namely, k-nearest neighbor (KNN), kernel k-nearest neighbors (KKNN), support vector machine (SVM), random forest (RF), stochastic gradient boosting (GBM), cubist, bagged multivariate adaptive regression splines (Bagged MARS), eXtreme gradient boosting (XGBoost), boosted generalized linear model (GLMBoost), boosted generalized additive model (GAMBoost), bayesian regularized neural networks (BRNN), and recursive partitioning and regression trees (CART) to build ML models for 225 mixture toxicity of azole fungicides towards Auxenochlorella pyrenoidosa. A total of 36 single ML models and 12 consensus models were developed. The results indicated that models employing concentration addition (CA), independent action (IA), and molecular descriptors (MD) as variables demonstrated superior predictive abilities. The consensus model combining SVM and RF algorithms (labeled as CM0) demonstrated the highest level of accuracy in fitting the data, with a coefficient of determination of 0.980. Additionally, it showed strong predictive abilities when tested with external data, achieving an external R2 value of 0.945 and a Concordance Correlation Coefficient of 0.967. This study provides a positive contribution to the ecological risk assessment of a mixture of azole fungicides.

Abstract Image

Abstract Image

基于机器学习的唑类杀菌剂对藻类(焦藻)混合毒性综合预测模型
定量结构-活性关系(QSAR)已被用于预测混合物的毒性。然而,目前的研究在准确预测唑类杀菌剂混合物毒性方面还存在差距。为了弥补这一差距,应用机器学习(ML)算法已成为一种有效的策略。在本研究中,我们应用了 12 种算法,即 k 近邻(KNN)、核 k 近邻(KKNN)、支持向量机(SVM)、随机森林(RF)、随机梯度提升(GBM)、立方体、袋装多元自适应回归样条(Bagged MARS)、极端梯度提升(XGBoost)、(GLMBoost)、提升广义加性模型(GAMBoost)、贝叶斯正则化神经网络(BRNN)和递归分区与回归树(CART)等方法建立了多模型,用于分析唑类杀菌剂对焦磷酸小球藻的 225 种混合物毒性。共建立了 36 个单一 ML 模型和 12 个共识模型。结果表明,采用浓度加成(CA)、独立作用(IA)和分子描述符(MD)作为变量的模型表现出更优越的预测能力。结合 SVM 和 RF 算法的共识模型(标记为 CM0)在拟合数据方面表现出最高的准确度,决定系数为 0.980。此外,在使用外部数据进行测试时,它也表现出很强的预测能力,外部 R2 值为 0.945,一致性相关系数为 0.967。这项研究为唑类杀菌剂混合物的生态风险评估做出了积极贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environment International
Environment International 环境科学-环境科学
CiteScore
21.90
自引率
3.40%
发文量
734
审稿时长
2.8 months
期刊介绍: Environmental Health publishes manuscripts focusing on critical aspects of environmental and occupational medicine, including studies in toxicology and epidemiology, to illuminate the human health implications of exposure to environmental hazards. The journal adopts an open-access model and practices open peer review. It caters to scientists and practitioners across all environmental science domains, directly or indirectly impacting human health and well-being. With a commitment to enhancing the prevention of environmentally-related health risks, Environmental Health serves as a public health journal for the community and scientists engaged in matters of public health significance concerning the environment.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信