e-QSAR (Explainable AI-QSAR), molecular docking, and ADMET analysis of structurally diverse GSK3-beta modulators to identify concealed modulatory features vindicated by X-ray
Vijay H. Masand , Sami Al-Hussain , Gaurav S. Masand , Abdul Samad , Rakhi Gawali , Shravan Jadhav , Magdi E.A. Zaki
{"title":"e-QSAR (Explainable AI-QSAR), molecular docking, and ADMET analysis of structurally diverse GSK3-beta modulators to identify concealed modulatory features vindicated by X-ray","authors":"Vijay H. Masand , Sami Al-Hussain , Gaurav S. Masand , Abdul Samad , Rakhi Gawali , Shravan Jadhav , Magdi E.A. Zaki","doi":"10.1016/j.compbiolchem.2024.108324","DOIUrl":null,"url":null,"abstract":"<div><div>Glycogen Synthase Kinase-3 beta (GSK-3β) is a crucial enzyme linked to various cellular processes, including neurodegeneration, autophagy, and diabetes. A structurally diverse set of 1293 molecules having GSK-3β modulatory activity has been used. Molecular docking and eXplainable Artificial Intelligence (XAI) have been used concomitantly. The approach involves using GA for feature selection and XGBoost for in-depth analysis, yielding strong statistical validation with R2tr = 0.9075, R2L10 %O = 0.9116, and Q2F3 = 0.7841. Molecular docking provided complementary and similar results. Machine learning model interpretation using SHapley Additive exPlanations (SHAP) revealed that specific structural features like aromatic carbon with specific partial charges, non-ring nitrogen atoms, sp<sup>3</sup>-hybrid carbon atoms, and the topological distance between carbon and nitrogen atoms, among others, significantly influence the modulatory profile. The results are also supported by reported X-ray resolved structures. In addition, <em>in-silico</em> ADMET analysis is also accomplished. This research underscores the value of advanced machine learning techniques in understanding complex biological phenomena and supporting rational drug design.</div></div>","PeriodicalId":10616,"journal":{"name":"Computational Biology and Chemistry","volume":"115 ","pages":"Article 108324"},"PeriodicalIF":2.6000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational Biology and Chemistry","FirstCategoryId":"99","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1476927124003128","RegionNum":4,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Glycogen Synthase Kinase-3 beta (GSK-3β) is a crucial enzyme linked to various cellular processes, including neurodegeneration, autophagy, and diabetes. A structurally diverse set of 1293 molecules having GSK-3β modulatory activity has been used. Molecular docking and eXplainable Artificial Intelligence (XAI) have been used concomitantly. The approach involves using GA for feature selection and XGBoost for in-depth analysis, yielding strong statistical validation with R2tr = 0.9075, R2L10 %O = 0.9116, and Q2F3 = 0.7841. Molecular docking provided complementary and similar results. Machine learning model interpretation using SHapley Additive exPlanations (SHAP) revealed that specific structural features like aromatic carbon with specific partial charges, non-ring nitrogen atoms, sp3-hybrid carbon atoms, and the topological distance between carbon and nitrogen atoms, among others, significantly influence the modulatory profile. The results are also supported by reported X-ray resolved structures. In addition, in-silico ADMET analysis is also accomplished. This research underscores the value of advanced machine learning techniques in understanding complex biological phenomena and supporting rational drug design.
期刊介绍:
Computational Biology and Chemistry publishes original research papers and review articles in all areas of computational life sciences. High quality research contributions with a major computational component in the areas of nucleic acid and protein sequence research, molecular evolution, molecular genetics (functional genomics and proteomics), theory and practice of either biology-specific or chemical-biology-specific modeling, and structural biology of nucleic acids and proteins are particularly welcome. Exceptionally high quality research work in bioinformatics, systems biology, ecology, computational pharmacology, metabolism, biomedical engineering, epidemiology, and statistical genetics will also be considered.
Given their inherent uncertainty, protein modeling and molecular docking studies should be thoroughly validated. In the absence of experimental results for validation, the use of molecular dynamics simulations along with detailed free energy calculations, for example, should be used as complementary techniques to support the major conclusions. Submissions of premature modeling exercises without additional biological insights will not be considered.
Review articles will generally be commissioned by the editors and should not be submitted to the journal without explicit invitation. However prospective authors are welcome to send a brief (one to three pages) synopsis, which will be evaluated by the editors.