Impact of halogenation on scaffold toxicity assessed using HD-GEM machine learning model.

IF 6.8 2区 生物学 Q1 BIOCHEMICAL RESEARCH METHODS
Bharath Reddy Boya, Jin-Hyung Lee, Jae-Mun Choi, Jintae Lee
{"title":"Impact of halogenation on scaffold toxicity assessed using HD-GEM machine learning model.","authors":"Bharath Reddy Boya, Jin-Hyung Lee, Jae-Mun Choi, Jintae Lee","doi":"10.1093/bib/bbaf347","DOIUrl":null,"url":null,"abstract":"<p><p>Halogens play a fundamental role in drug design, influencing bioactivity, stability, and selectivity. However, their impact on toxicity, particularly genotoxicity, cardiotoxicity, and hepatotoxicity, remains a critical challenge in drug discovery. This study presents HD-GEM (Hybrid Dynamic Graph-based Ensemble Model), a novel machine learning framework integrating graph neural networks, descriptor-based molecular fingerprints, and ensemble meta-learning to predict the toxicity of halogenated aromatic compounds and drug scaffolds. HD-GEM demonstrates superior predictive power compared to conventional machine learning (ML) models and popular toxicity web applications like ProTox, ADMETlab, and admetSAR, achieving high accuracy and Receiver Operating Characteristic-Area Under Curve scores across diverse datasets. Importantly, a node perturbation analysis revealed that carbon, nitrogen, and oxygen atoms within the scaffold dominate toxicity predictions, whereas halogen contributions were minimal, challenging the conventional assumption that halogenation inherently increases toxicity in many pharmacological contexts. Among halogens, iodine-substituted compounds exhibit the lowest toxicity, a trend corroborated across single-, double-, and triple-ring scaffolds. Notably, polyhalogenated scaffolds show reduced toxicity, suggesting a stabilizing effect that mitigates reactive metabolite formation. This study presents an interpretable artificial intelligence-driven framework for toxicity prediction in the context of computational toxicology and cheminformatics. Atom-level and descriptor-based analyses reveal scaffold- and feature-specific contributions to toxicity.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 4","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf347","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

Abstract

Halogens play a fundamental role in drug design, influencing bioactivity, stability, and selectivity. However, their impact on toxicity, particularly genotoxicity, cardiotoxicity, and hepatotoxicity, remains a critical challenge in drug discovery. This study presents HD-GEM (Hybrid Dynamic Graph-based Ensemble Model), a novel machine learning framework integrating graph neural networks, descriptor-based molecular fingerprints, and ensemble meta-learning to predict the toxicity of halogenated aromatic compounds and drug scaffolds. HD-GEM demonstrates superior predictive power compared to conventional machine learning (ML) models and popular toxicity web applications like ProTox, ADMETlab, and admetSAR, achieving high accuracy and Receiver Operating Characteristic-Area Under Curve scores across diverse datasets. Importantly, a node perturbation analysis revealed that carbon, nitrogen, and oxygen atoms within the scaffold dominate toxicity predictions, whereas halogen contributions were minimal, challenging the conventional assumption that halogenation inherently increases toxicity in many pharmacological contexts. Among halogens, iodine-substituted compounds exhibit the lowest toxicity, a trend corroborated across single-, double-, and triple-ring scaffolds. Notably, polyhalogenated scaffolds show reduced toxicity, suggesting a stabilizing effect that mitigates reactive metabolite formation. This study presents an interpretable artificial intelligence-driven framework for toxicity prediction in the context of computational toxicology and cheminformatics. Atom-level and descriptor-based analyses reveal scaffold- and feature-specific contributions to toxicity.

使用HD-GEM机器学习模型评估卤化对支架毒性的影响。
卤素在药物设计中起着至关重要的作用,影响着药物的生物活性、稳定性和选择性。然而,它们对毒性的影响,特别是遗传毒性、心脏毒性和肝毒性,仍然是药物发现的一个关键挑战。本研究提出了HD-GEM (Hybrid Dynamic graph -based Ensemble Model),这是一种新的机器学习框架,将图神经网络、基于描述符的分子指纹和集成元学习集成在一起,用于预测卤化芳香族化合物和药物支架的毒性。与传统的机器学习(ML)模型和流行的毒性网络应用程序(如ProTox, ADMETlab和admetSAR)相比,HD-GEM显示出卓越的预测能力,在不同的数据集上实现了高精度和Receiver Operating characteristic area Under Curve得分。重要的是,节点扰动分析显示,支架内的碳、氮和氧原子在毒性预测中占主导地位,而卤素的贡献很小,这挑战了卤化在许多药理学背景下固有地增加毒性的传统假设。在卤素中,碘取代的化合物表现出最低的毒性,这一趋势在单环、双环和三环支架中得到证实。值得注意的是,多卤化支架显示出较低的毒性,表明其稳定作用可以减轻反应性代谢物的形成。本研究提出了一个可解释的人工智能驱动的框架,用于计算毒理学和化学信息学背景下的毒性预测。原子水平和基于描述符的分析揭示了支架和特征特异性对毒性的贡献。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Briefings in bioinformatics
Briefings in bioinformatics 生物-生化研究方法
CiteScore
13.20
自引率
13.70%
发文量
549
审稿时长
6 months
期刊介绍: Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data. The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信