{"title":"Accelerating drug discovery targeting dihydroorotate dehydrogenase using machine learning and generative AI approaches","authors":"Gayathri Krishnamurthy Ganga","doi":"10.1016/j.compbiolchem.2025.108443","DOIUrl":null,"url":null,"abstract":"<div><div>Dihydroorotate dehydrogenase (DHODH) is a key enzyme in pyrimidine biosynthesis, making it an attractive drug target for cancer, autoimmune diseases, and infections. Traditional DHODH inhibitor discovery is slow and costly. Our study integrated machine learning (ML) and generative artificial intelligence (AI) to accelerate this process, enhancing efficiency and reducing costs. We employed Random Forest (RF), XGBoost (XGB), and Logistic Regression (LR) to predict pIC50 values, with RF achieving the highest accuracy (93 % test accuracy, 81 % on unseen molecules), demonstrating superior generalization. Using a Graph Convolutional Network-based Variational Autoencoder (GCN-VAE), we generated 59 unique drug-like molecules, five with pIC50 > 7, expanding the chemical space beyond conventional screening.</div><div>Docking studies confirmed strong binding affinities, with the most promising newly generated molecule showing a binding energy of –11.1 kcal/mol and an inhibition constant (Ki) of 269.8 nM. Key interactions with residues such as ALA59, PHE36, TYR38, GLN47, and ARG36 further validated stability and inhibitory potential. This AI-driven workflow accelerates DHODH inhibitor discovery by significantly reducing screening time, enhancing molecular diversity, and improving predictive accuracy. Our approach presents a scalable, cost-effective strategy for developing novel therapeutics, offering a transformative shift in drug discovery.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"118 ","pages":"Article 108443"},"PeriodicalIF":2.6000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927125001033","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Dihydroorotate dehydrogenase (DHODH) is a key enzyme in pyrimidine biosynthesis, making it an attractive drug target for cancer, autoimmune diseases, and infections. Traditional DHODH inhibitor discovery is slow and costly. Our study integrated machine learning (ML) and generative artificial intelligence (AI) to accelerate this process, enhancing efficiency and reducing costs. We employed Random Forest (RF), XGBoost (XGB), and Logistic Regression (LR) to predict pIC50 values, with RF achieving the highest accuracy (93 % test accuracy, 81 % on unseen molecules), demonstrating superior generalization. Using a Graph Convolutional Network-based Variational Autoencoder (GCN-VAE), we generated 59 unique drug-like molecules, five with pIC50 > 7, expanding the chemical space beyond conventional screening.
Docking studies confirmed strong binding affinities, with the most promising newly generated molecule showing a binding energy of –11.1 kcal/mol and an inhibition constant (Ki) of 269.8 nM. Key interactions with residues such as ALA59, PHE36, TYR38, GLN47, and ARG36 further validated stability and inhibitory potential. This AI-driven workflow accelerates DHODH inhibitor discovery by significantly reducing screening time, enhancing molecular diversity, and improving predictive accuracy. Our approach presents a scalable, cost-effective strategy for developing novel therapeutics, offering a transformative shift in drug discovery.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.