Bharath Reddy Boya, Jin-Hyung Lee, Jae-Mun Choi, Jintae Lee
{"title":"Impact of halogenation on scaffold toxicity assessed using HD-GEM machine learning model.","authors":"Bharath Reddy Boya, Jin-Hyung Lee, Jae-Mun Choi, Jintae Lee","doi":"10.1093/bib/bbaf347","DOIUrl":null,"url":null,"abstract":"<p><p>Halogens play a fundamental role in drug design, influencing bioactivity, stability, and selectivity. However, their impact on toxicity, particularly genotoxicity, cardiotoxicity, and hepatotoxicity, remains a critical challenge in drug discovery. This study presents HD-GEM (Hybrid Dynamic Graph-based Ensemble Model), a novel machine learning framework integrating graph neural networks, descriptor-based molecular fingerprints, and ensemble meta-learning to predict the toxicity of halogenated aromatic compounds and drug scaffolds. HD-GEM demonstrates superior predictive power compared to conventional machine learning (ML) models and popular toxicity web applications like ProTox, ADMETlab, and admetSAR, achieving high accuracy and Receiver Operating Characteristic-Area Under Curve scores across diverse datasets. Importantly, a node perturbation analysis revealed that carbon, nitrogen, and oxygen atoms within the scaffold dominate toxicity predictions, whereas halogen contributions were minimal, challenging the conventional assumption that halogenation inherently increases toxicity in many pharmacological contexts. Among halogens, iodine-substituted compounds exhibit the lowest toxicity, a trend corroborated across single-, double-, and triple-ring scaffolds. Notably, polyhalogenated scaffolds show reduced toxicity, suggesting a stabilizing effect that mitigates reactive metabolite formation. This study presents an interpretable artificial intelligence-driven framework for toxicity prediction in the context of computational toxicology and cheminformatics. Atom-level and descriptor-based analyses reveal scaffold- and feature-specific contributions to toxicity.</p>","PeriodicalId":9209,"journal":{"name":"Briefings in bioinformatics","volume":"26 4","pages":""},"PeriodicalIF":6.8000,"publicationDate":"2025-07-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Briefings in bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1093/bib/bbaf347","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0
Abstract
Halogens play a fundamental role in drug design, influencing bioactivity, stability, and selectivity. However, their impact on toxicity, particularly genotoxicity, cardiotoxicity, and hepatotoxicity, remains a critical challenge in drug discovery. This study presents HD-GEM (Hybrid Dynamic Graph-based Ensemble Model), a novel machine learning framework integrating graph neural networks, descriptor-based molecular fingerprints, and ensemble meta-learning to predict the toxicity of halogenated aromatic compounds and drug scaffolds. HD-GEM demonstrates superior predictive power compared to conventional machine learning (ML) models and popular toxicity web applications like ProTox, ADMETlab, and admetSAR, achieving high accuracy and Receiver Operating Characteristic-Area Under Curve scores across diverse datasets. Importantly, a node perturbation analysis revealed that carbon, nitrogen, and oxygen atoms within the scaffold dominate toxicity predictions, whereas halogen contributions were minimal, challenging the conventional assumption that halogenation inherently increases toxicity in many pharmacological contexts. Among halogens, iodine-substituted compounds exhibit the lowest toxicity, a trend corroborated across single-, double-, and triple-ring scaffolds. Notably, polyhalogenated scaffolds show reduced toxicity, suggesting a stabilizing effect that mitigates reactive metabolite formation. This study presents an interpretable artificial intelligence-driven framework for toxicity prediction in the context of computational toxicology and cheminformatics. Atom-level and descriptor-based analyses reveal scaffold- and feature-specific contributions to toxicity.
期刊介绍:
Briefings in Bioinformatics is an international journal serving as a platform for researchers and educators in the life sciences. It also appeals to mathematicians, statisticians, and computer scientists applying their expertise to biological challenges. The journal focuses on reviews tailored for users of databases and analytical tools in contemporary genetics, molecular and systems biology. It stands out by offering practical assistance and guidance to non-specialists in computerized methodologies. Covering a wide range from introductory concepts to specific protocols and analyses, the papers address bacterial, plant, fungal, animal, and human data.
The journal's detailed subject areas include genetic studies of phenotypes and genotypes, mapping, DNA sequencing, expression profiling, gene expression studies, microarrays, alignment methods, protein profiles and HMMs, lipids, metabolic and signaling pathways, structure determination and function prediction, phylogenetic studies, and education and training.