Descriptor-Based Explainable QSAR Modeling Approaches for the Prediction of Respiratory Toxicity of Volatile and Gas-Phase Chemicals.

IF 2.8 4区 医学 Q3 TOXICOLOGY
Gul Karaduman, Hasan Yildirim
{"title":"Descriptor-Based Explainable QSAR Modeling Approaches for the Prediction of Respiratory Toxicity of Volatile and Gas-Phase Chemicals.","authors":"Gul Karaduman, Hasan Yildirim","doi":"10.1002/jat.4859","DOIUrl":null,"url":null,"abstract":"<p><p>Toxic gases pose significant risks to human health and ecosystems because of their capacity to deplete oxygen levels and disrupt critical physiological functions. Effective management of these hazards is crucial, given their widespread use in various industrial sectors, including agriculture, metallurgy, pharmaceuticals, and plastics. A dataset of 229 gaseous-phase chemicals from authoritative toxicological sources was used to develop robust binary quantitative structure-activity relationship (QSAR) models for predicting toxicity. Employing molecular descriptor selection, correlation analyses, and outlier detection, random forest and XGBoost emerged as the most accurate models, each achieving a high accuracy of 0.915, with ROC areas of 0.992 and 0.986, respectively. SHapley Additive exPlanations (SHAP) were applied to enhance the explainability of the best model, providing global and local insights into the importance and direction of each descriptor. Key features influencing toxicity were identified globally, while locally, descriptors were ranked on a molecule-by-molecule basis, offering guidance for mitigating toxic effects through structural modifications. Although numerous studies have focused on predicting the toxicity of chemical gases, few have systematically analyzed the specific molecular descriptors driving toxicity. This study addresses this gap by providing a descriptor-based analysis, offering deeper mechanistic insights, and enabling more accurate predictions. The QSAR approach aligns with the 4R principles, promoting ethical, cost-effective, and efficient alternatives to animal testing, thus contributing to safer chemical design and environmental monitoring.</p>","PeriodicalId":15242,"journal":{"name":"Journal of Applied Toxicology","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-07-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Toxicology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/jat.4859","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"TOXICOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Toxic gases pose significant risks to human health and ecosystems because of their capacity to deplete oxygen levels and disrupt critical physiological functions. Effective management of these hazards is crucial, given their widespread use in various industrial sectors, including agriculture, metallurgy, pharmaceuticals, and plastics. A dataset of 229 gaseous-phase chemicals from authoritative toxicological sources was used to develop robust binary quantitative structure-activity relationship (QSAR) models for predicting toxicity. Employing molecular descriptor selection, correlation analyses, and outlier detection, random forest and XGBoost emerged as the most accurate models, each achieving a high accuracy of 0.915, with ROC areas of 0.992 and 0.986, respectively. SHapley Additive exPlanations (SHAP) were applied to enhance the explainability of the best model, providing global and local insights into the importance and direction of each descriptor. Key features influencing toxicity were identified globally, while locally, descriptors were ranked on a molecule-by-molecule basis, offering guidance for mitigating toxic effects through structural modifications. Although numerous studies have focused on predicting the toxicity of chemical gases, few have systematically analyzed the specific molecular descriptors driving toxicity. This study addresses this gap by providing a descriptor-based analysis, offering deeper mechanistic insights, and enabling more accurate predictions. The QSAR approach aligns with the 4R principles, promoting ethical, cost-effective, and efficient alternatives to animal testing, thus contributing to safer chemical design and environmental monitoring.

基于描述符的可解释QSAR建模方法用于挥发性和气相化学品的呼吸毒性预测。
有毒气体具有耗尽氧气水平和破坏关键生理功能的能力,因此对人类健康和生态系统构成重大风险。鉴于这些危害在包括农业、冶金、制药和塑料在内的各个工业部门广泛使用,对这些危害进行有效管理至关重要。利用来自权威毒理学来源的229种气相化学物质的数据集,建立了可靠的二元定量构效关系(QSAR)模型,用于预测毒性。通过分子描述符选择、相关分析和离群值检测,随机森林和XGBoost是最准确的模型,准确率均达到0.915,ROC面积分别为0.992和0.986。SHapley加性解释(SHAP)被用于增强最佳模型的可解释性,提供对每个描述符的重要性和方向的全局和局部见解。在全球范围内确定了影响毒性的关键特征,而在局部,描述符按分子对分子进行排序,为通过结构修饰减轻毒性作用提供指导。虽然许多研究都集中在预测化学气体的毒性,但很少有系统地分析驱动毒性的特定分子描述符。本研究通过提供基于描述符的分析,提供更深入的机制见解,并实现更准确的预测,解决了这一差距。QSAR方法符合4R原则,提倡道德、成本效益和高效的动物试验替代方案,从而有助于更安全的化学品设计和环境监测。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
7.00
自引率
6.10%
发文量
145
审稿时长
1 months
期刊介绍: Journal of Applied Toxicology publishes peer-reviewed original reviews and hypothesis-driven research articles on mechanistic, fundamental and applied research relating to the toxicity of drugs and chemicals at the molecular, cellular, tissue, target organ and whole body level in vivo (by all relevant routes of exposure) and in vitro / ex vivo. All aspects of toxicology are covered (including but not limited to nanotoxicology, genomics and proteomics, teratogenesis, carcinogenesis, mutagenesis, reproductive and endocrine toxicology, toxicopathology, target organ toxicity, systems toxicity (eg immunotoxicity), neurobehavioral toxicology, mechanistic studies, biochemical and molecular toxicology, novel biomarkers, pharmacokinetics/PBPK, risk assessment and environmental health studies) and emphasis is given to papers of clear application to human health, and/or advance mechanistic understanding and/or provide significant contributions and impact to their field.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信