Using machine learning to classify the immunosuppressive activity of per- and polyfluoroalkyl substances.

IF 3.2 4区 医学 Q1 Pharmacology, Toxicology and Pharmaceutics
Yuxin Xuan, Yulu Wang, Rui Li, Yuyan Zhong, Na Wang, Lingyin Zhang, Qian Chen, Shuling Yu, Jintao Yuan
{"title":"Using machine learning to classify the immunosuppressive activity of per- and polyfluoroalkyl substances.","authors":"Yuxin Xuan, Yulu Wang, Rui Li, Yuyan Zhong, Na Wang, Lingyin Zhang, Qian Chen, Shuling Yu, Jintao Yuan","doi":"10.1080/15376516.2024.2387733","DOIUrl":null,"url":null,"abstract":"<p><p>Per- and polyfluoroalkyl substances (PFASs), one of the persistent organic pollutants, have immunosuppressive effects. The evaluation of this effect has been the focus of regulatory toxicology. In this investigation, 146 PFASs (immunosuppressive or nonimmunosuppressive) and corresponding concentration gradients were collected from literature, and their structures were characterized by using Dragon descriptors. Feature importance analysis and stepwise feature elimination are used for feature selection. Three machine learning (ML) methods, namely Random Forest (RF), Extreme Gradient Boosting Machine (XGB), and Categorical Boosting Machine (CB), were utilized for model development. The model interpretability was explored by feature importance analysis and correlation analysis. The findings indicated that the three models developed have exhibited excellent performance. Among them, the best-performing RF model has an average AUC score of 0.9720 for the testing set. The results of the feature importance analysis demonstrated that concentration, SpPosA_X, IVDE, R2s, and SIC2 were the crucial molecular features. Applicability domain analysis was also performed to determine reliable prediction boundaries for the model. In conclusion, this study is the first application of ML models to investigate the immunosuppressive activity of PFASs. The variables used in the models can help understand the mechanism of the immunosuppressive activity of PFASs, allow researchers to more effectively assess the immunosuppressive potential of a large number of PFASs, and thus better guide environmental and health risk assessment efforts.</p>","PeriodicalId":23177,"journal":{"name":"Toxicology Mechanisms and Methods","volume":" ","pages":"1-9"},"PeriodicalIF":3.2000,"publicationDate":"2024-08-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Toxicology Mechanisms and Methods","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/15376516.2024.2387733","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Pharmacology, Toxicology and Pharmaceutics","Score":null,"Total":0}
引用次数: 0

Abstract

Per- and polyfluoroalkyl substances (PFASs), one of the persistent organic pollutants, have immunosuppressive effects. The evaluation of this effect has been the focus of regulatory toxicology. In this investigation, 146 PFASs (immunosuppressive or nonimmunosuppressive) and corresponding concentration gradients were collected from literature, and their structures were characterized by using Dragon descriptors. Feature importance analysis and stepwise feature elimination are used for feature selection. Three machine learning (ML) methods, namely Random Forest (RF), Extreme Gradient Boosting Machine (XGB), and Categorical Boosting Machine (CB), were utilized for model development. The model interpretability was explored by feature importance analysis and correlation analysis. The findings indicated that the three models developed have exhibited excellent performance. Among them, the best-performing RF model has an average AUC score of 0.9720 for the testing set. The results of the feature importance analysis demonstrated that concentration, SpPosA_X, IVDE, R2s, and SIC2 were the crucial molecular features. Applicability domain analysis was also performed to determine reliable prediction boundaries for the model. In conclusion, this study is the first application of ML models to investigate the immunosuppressive activity of PFASs. The variables used in the models can help understand the mechanism of the immunosuppressive activity of PFASs, allow researchers to more effectively assess the immunosuppressive potential of a large number of PFASs, and thus better guide environmental and health risk assessment efforts.

利用机器学习对全氟化烷基和多氟化烷基物质的免疫抑制活性进行分类。
全氟和多氟烷基物质(PFAS)是持久性有机污染物之一,具有免疫抑制作用。对这种效应的评估一直是监管毒理学的重点。本研究从文献中收集了 146 种 PFAS(免疫抑制或非免疫抑制)及其相应的浓度梯度,并使用龙描述符对其结构进行了表征。特征重要性分析和逐步特征消除用于特征选择。模型开发采用了三种机器学习(ML)方法,即随机森林(RF)、极梯度提升机(XGB)和分类提升机(CB)。通过特征重要性分析和相关性分析探讨了模型的可解释性。研究结果表明,所开发的三个模型都表现出了卓越的性能。其中,表现最好的 RF 模型在测试集中的平均 AUC 得分为 0.9720。特征重要性分析结果表明,浓度、SpPosA_X、IVDE、R2s 和 SIC2 是关键的分子特征。此外,还进行了适用域分析,以确定模型的可靠预测边界。总之,本研究是首次应用 ML 模型研究全氟辛烷磺酸的免疫抑制活性。模型中使用的变量有助于理解全氟辛烷磺酸的免疫抑制活性机制,使研究人员能够更有效地评估大量全氟辛烷磺酸的免疫抑制潜力,从而更好地指导环境和健康风险评估工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.60
自引率
3.10%
发文量
66
审稿时长
6-12 weeks
期刊介绍: Toxicology Mechanisms and Methods is a peer-reviewed journal whose aim is twofold. Firstly, the journal contains original research on subjects dealing with the mechanisms by which foreign chemicals cause toxic tissue injury. Chemical substances of interest include industrial compounds, environmental pollutants, hazardous wastes, drugs, pesticides, and chemical warfare agents. The scope of the journal spans from molecular and cellular mechanisms of action to the consideration of mechanistic evidence in establishing regulatory policy. Secondly, the journal addresses aspects of the development, validation, and application of new and existing laboratory methods, techniques, and equipment. A variety of research methods are discussed, including: In vivo studies with standard and alternative species In vitro studies and alternative methodologies Molecular, biochemical, and cellular techniques Pharmacokinetics and pharmacodynamics Mathematical modeling and computer programs Forensic analyses Risk assessment Data collection and analysis.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信