{"title":"揭示精确药物设计的分子特征:来自锥虫硫酮还原酶、PKC-θ和CB1的机器学习见解。","authors":"Sunil Sahu, Adarsh Anmol, Tushar Nishad, Satya Eswari Jujjavarapu","doi":"10.1007/s11030-025-11287-3","DOIUrl":null,"url":null,"abstract":"<p><p>Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors. Three models-Random Forest (RF), Gradient Boosting (GB), and a stacking ensemble with Ridge Regression-predicted pIC₅₀, with the ensemble achieving the lowest RMSE. Results highlight heteroatom-containing rings for TR, multiple ring systems for PKC-θ, and aromatic rings for CB1 as critical for high bioactivity. This adaptable framework accelerates drug design by pinpointing optimizable structures, enhancing efficiency in therapeutic development.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-09-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Unveiling molecular signatures for precision drug design: machine learning insights from trypanothione reductase, PKC-θ, and CB1.\",\"authors\":\"Sunil Sahu, Adarsh Anmol, Tushar Nishad, Satya Eswari Jujjavarapu\",\"doi\":\"10.1007/s11030-025-11287-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors. Three models-Random Forest (RF), Gradient Boosting (GB), and a stacking ensemble with Ridge Regression-predicted pIC₅₀, with the ensemble achieving the lowest RMSE. Results highlight heteroatom-containing rings for TR, multiple ring systems for PKC-θ, and aromatic rings for CB1 as critical for high bioactivity. This adaptable framework accelerates drug design by pinpointing optimizable structures, enhancing efficiency in therapeutic development.</p>\",\"PeriodicalId\":708,\"journal\":{\"name\":\"Molecular Diversity\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-09-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Molecular Diversity\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://doi.org/10.1007/s11030-025-11287-3\",\"RegionNum\":2,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-025-11287-3","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
Unveiling molecular signatures for precision drug design: machine learning insights from trypanothione reductase, PKC-θ, and CB1.
Traditional drug discovery methods like high-throughput screening and molecular docking are slow and costly. This study introduces a machine learning framework to predict bioactivity (pIC₅₀) and identify key molecular properties and structural features for targeting Trypanothione reductase (TR), Protein kinase C theta (PKC-θ), and Cannabinoid receptor 1 (CB1) using data from the ChEMBL database. Molecular fingerprints, generated via PaDEL-Descriptor and RDKit, encoded structural features as binary vectors. Three models-Random Forest (RF), Gradient Boosting (GB), and a stacking ensemble with Ridge Regression-predicted pIC₅₀, with the ensemble achieving the lowest RMSE. Results highlight heteroatom-containing rings for TR, multiple ring systems for PKC-θ, and aromatic rings for CB1 as critical for high bioactivity. This adaptable framework accelerates drug design by pinpointing optimizable structures, enhancing efficiency in therapeutic development.
期刊介绍:
Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including:
combinatorial chemistry and parallel synthesis;
small molecule libraries;
microwave synthesis;
flow synthesis;
fluorous synthesis;
diversity oriented synthesis (DOS);
nanoreactors;
click chemistry;
multiplex technologies;
fragment- and ligand-based design;
structure/function/SAR;
computational chemistry and molecular design;
chemoinformatics;
screening techniques and screening interfaces;
analytical and purification methods;
robotics, automation and miniaturization;
targeted libraries;
display libraries;
peptides and peptoids;
proteins;
oligonucleotides;
carbohydrates;
natural diversity;
new methods of library formulation and deconvolution;
directed evolution, origin of life and recombination;
search techniques, landscapes, random chemistry and more;