SAR and QSAR in Environmental Research最新文献

筛选
英文 中文
QSPR models for water solubility of organic compounds using correlation intensity index and Las Vegas algorithm. 利用相关强度指数和Las Vegas算法建立有机化合物水溶性的QSPR模型。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-04-24 DOI: 10.1080/1062936X.2026.2659325
A P Toropova, A A Toropov, G Selvestrel, N Iovine, A Roncaglioni, E Benfenati
{"title":"QSPR models for water solubility of organic compounds using correlation intensity index and Las Vegas algorithm.","authors":"A P Toropova, A A Toropov, G Selvestrel, N Iovine, A Roncaglioni, E Benfenati","doi":"10.1080/1062936X.2026.2659325","DOIUrl":"https://doi.org/10.1080/1062936X.2026.2659325","url":null,"abstract":"<p><p>Water solubility is an important factor in environmental and toxicological science because it determines the mobility, bioavailability, and potential for absorption by living organisms. Higher solubility often correlates with greater environmental mobility, and toxicity is often inversely related to solubility. In this work, the solubility of 4453 compounds in water was examined using quantitative structure-property relationships (QSPR). The effectiveness of the Monte Carlo method, the correlation intensity index (CII), and the Las Vegas algorithm for developing organic compound solubility models is assessed using CORAL software. Both factors (CII and the Las Vegas algorithm) have been shown to improve the statistical quality of the model set for the calibration and validation sets. The average coefficient of determination for the validation set is 0.94 ± 0.01.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"1-12"},"PeriodicalIF":2.3,"publicationDate":"2026-04-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147779771","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Network toxicology and molecular docking elucidation of oligoasthenospermia induced by bisphenol analogues: a comparative study of BPA, BPS, BPF, and BPAF. 双酚类似物诱导少弱精子症的网络毒理学和分子对接研究:双酚a、BPS、BPF和BPAF的比较研究。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-04-20 DOI: 10.1080/1062936X.2026.2659333
H He, J Li, J Liu, J Zong, X Jiang, M Xiao, J Liu, Y Lyu
{"title":"Network toxicology and molecular docking elucidation of oligoasthenospermia induced by bisphenol analogues: a comparative study of BPA, BPS, BPF, and BPAF.","authors":"H He, J Li, J Liu, J Zong, X Jiang, M Xiao, J Liu, Y Lyu","doi":"10.1080/1062936X.2026.2659333","DOIUrl":"https://doi.org/10.1080/1062936X.2026.2659333","url":null,"abstract":"<p><p>Bisphenol A (BPA) substitutes are increasingly prevalent in consumer products, yet their potential to induce oligoasthenospermia (OAS) remains poorly understood. This study combined network toxicology, molecular docking, and 500-ns molecular dynamics (MD) simulations to systematically compare the toxicity mechanisms of BPA, BPS, BPF, and BPAF in the context of OAS. Intersection analysis identified 25 shared targets, with topological clustering highlighting ESR1, AR, CYP19A1, TNF, and IL6 as central hubs linking endocrine disruption with inflammatory pathways. Molecular docking revealed broad receptor engagement (-5.4 to -8.7 kcal/mol), with BPAF exhibiting the strongest affinities, particularly for ESR1 (-8.7 kcal/mol) and CYP19A1 (-8.4 kcal/mol). Molecular Dynamics (MD) simulations confirmed the dynamic stability of the ESR1-BPAF complex, demonstrating a compact fold (SASA ~124 nm<sup>2</sup>), minimal structural drift (ligand RMSD ~0.16 nm; protein RMSD ~0.20 nm), and persistent hydrogen bonding anchored by residues GLU353 and LEU428. These computational findings predict that BPA alternatives - especially BPAF - may significantly perturb steroidogenic and inflammatory axes to drive spermatogenic impairment. This study provides a predictive hazard prioritization framework that challenges the safety assumption of 'BPA-free' substitutes and warrants urgent experimental validation.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"1-15"},"PeriodicalIF":2.3,"publicationDate":"2026-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147723596","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Mitigating silent failures in toxicity prediction: a conformalized heteroscedastic Bayesian framework. 减轻毒性预测中的沉默失败:一个符合规范的异方差贝叶斯框架。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-04-20 DOI: 10.1080/1062936X.2026.2658500
B Özlüer Başer
{"title":"Mitigating silent failures in toxicity prediction: a conformalized heteroscedastic Bayesian framework.","authors":"B Özlüer Başer","doi":"10.1080/1062936X.2026.2658500","DOIUrl":"https://doi.org/10.1080/1062936X.2026.2658500","url":null,"abstract":"<p><p>Accurate uncertainty quantification is a prerequisite for reliable toxicity assessments in drug discovery. Traditional QSAR models provide point estimates but fail to communicate prediction reliability, particularly for structurally complex compounds. We propose Conformalized Two-Stage Heteroscedastic BART (C2S-HBART), a novel framework addressing homoscedastic modelling assumptions. A first-stage BART model estimates the mean toxicity response; a second stage explicitly estimates local predictive variance from SMILES-derived structural descriptors. Split Conformal Prediction is then integrated to provide distribution-free validity guarantees. Evaluated on the Tox21 dataset, C2S-HBART achieves near-nominal coverage (0.952 vs. 0.880 for the uncalibrated baseline) and reduces the rate of silent failures - confident but incorrect predictions - from 12.02% to 1.8%. Compared to a standard conformalized baseline requiring wide intervals (Avg Width: 1.63), the heteroscedastic approach achieves equivalent safety with sharper predictions (Avg Width: 1.20), representing a 26% improvement in information efficiency. Variable importance analysis further reveals that molecular size and topological complexity are primary drivers of predictive uncertainty. C2S-HBART provides a statistically rigorous, transparent decision-support tool for preclinical screening, enabling toxicologists to prioritize safer compounds and flag structurally complex molecules for experimental validation.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"1-15"},"PeriodicalIF":2.3,"publicationDate":"2026-04-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147723624","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Prediction of acetylcholinesterase inhibition associated with Alzheimer's disease using hybrid descriptor and graph-based machine learning models. 使用混合描述符和基于图的机器学习模型预测与阿尔茨海默病相关的乙酰胆碱酯酶抑制
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-03-01 Epub Date: 2026-04-15 DOI: 10.1080/1062936X.2026.2647201
R U Laskar, B S Meena, A Roy, A Borah
{"title":"Prediction of acetylcholinesterase inhibition associated with Alzheimer's disease using hybrid descriptor and graph-based machine learning models.","authors":"R U Laskar, B S Meena, A Roy, A Borah","doi":"10.1080/1062936X.2026.2647201","DOIUrl":"10.1080/1062936X.2026.2647201","url":null,"abstract":"<p><p>Accurate prediction of acetylcholinesterase (AChE) inhibitory activity is important in drug discovery and environmental toxicology because AChE inhibition represents a key mechanism underlying neurotoxicity associated with pharmaceuticals and environmental contaminants. In this study, machine learning approaches were used to develop predictive models for AChE inhibitory activity using experimentally measured bioactivity data for small molecules targeting human AChE. A curated dataset containing 5795 molecules was compiled from BindingDB to support reliable model development. Fifteen predictive models were evaluated, including twelve individual machine learning and deep learning models and three hybrid fusion models, using multiple molecular representations such as physicochemical descriptors derived from RDKit and PaDEL and graph-based molecular structures. Among the individual models, tree-based ensemble methods demonstrated strong baseline performance, indicating that physicochemical descriptors capture important chemical features associated with AChE inhibition. Graph neural networks, particularly Graph Isomorphism Network effectively learn structural patterns related to inhibitory activity. To integrate complementary molecular information, a late-fusion hybrid framework combining descriptor-based predictions and graph-based representations was implemented using leakage-safe stacking with a Ridge regression meta-learner. Across ten independent train-test splits, the best-performing hybrid model integrating PaDEL-based XGBoost and GIN achieved <i>r</i><sup>2</sup> = 0.7400 ± 0.0138, demonstrating improved and stable predictive performance over individual models.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"223-240"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147692051","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Linear models, quantum molecular descriptors, and DSSC efficiency: an approach for evaluating potential new sensitizing dyes. 线性模型、量子分子描述符和DSSC效率:一种评估潜在的新增敏染料的方法。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-03-01 Epub Date: 2026-04-27 DOI: 10.1080/1062936X.2026.2647200
E F S Mattos, I F Vieira, G S Mendonça, N B da Costa
{"title":"Linear models, quantum molecular descriptors, and DSSC efficiency: an approach for evaluating potential new sensitizing dyes.","authors":"E F S Mattos, I F Vieira, G S Mendonça, N B da Costa","doi":"10.1080/1062936X.2026.2647200","DOIUrl":"https://doi.org/10.1080/1062936X.2026.2647200","url":null,"abstract":"<p><p>The growing energy demand has accelerated the search for renewable energy sources, with dye-sensitized solar cells (DSSCs) emerging as promising candidates. To streamline the experimental process and reduce associated costs, we developed predictive models based on linear regression to predict the efficiency of DSSCs. These models, use quantum molecular descriptors (QMDs) derived from molecular electronics structure and excited states of the dyes. Our study focused on evaluating organic dyes based in imidazole, BODIPY, and squaraine for DSSC applications. The resulting linear models were simple, robust, and predictive, satisfying all standard validation metrics. Furthermore, each model's descriptors reveal key electronic/spectroscopic characteristics for efficiency enhancement, including: (i) increased molecular mass through branching, (ii) HOMO-LUMO gap control, and (iii) planar π-bridge optimization.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"37 3","pages":"205-222"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147779803","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Integrated chemometric modelling of histamine H3 receptor for the identification of ligands from a natural product repository. 用于从天然产物库中识别配体的组胺H3受体的集成化学计量学建模。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-03-01 Epub Date: 2026-04-27 DOI: 10.1080/1062936X.2026.2651284
N S Jandali, L A Samuor, K Tawaha, M A Khanfar
{"title":"Integrated chemometric modelling of histamine H<sub>3</sub> receptor for the identification of ligands from a natural product repository.","authors":"N S Jandali, L A Samuor, K Tawaha, M A Khanfar","doi":"10.1080/1062936X.2026.2651284","DOIUrl":"https://doi.org/10.1080/1062936X.2026.2651284","url":null,"abstract":"<p><p>The histamine H<sub>3</sub> receptor (H<sub>3</sub>R) is a GPCR that regulates the release of multiple neurotransmitters and has emerged as an attractive target for CNS disorders. An integrated computational workflow was applied to identify H<sub>3</sub>R ligands from natural products by combining pharmacophore modelling, QSAR, docking and molecular dynamics (MD) algorithms. Structure-based and ligand-based pharmacophore models were developed and validated using ROC analysis against a set of active H<sub>3</sub>R inhibitors and DUD-E decoys. The resulting models (PHARM-1 and PHARM-2) achieved AUC values of 0.716 and 0.959, respectively, supporting their ability to discriminate active from inactive compounds. Both models were used to screen AnalytiCon Discovery database of natural products, yielding 23 hits that were subsequently prioritized by docking and QSAR model generated using genetic function approximation. Six candidates docked successfully and preserved the expected anchoring interaction with the conserved Asp114; four compounds showed favourable QSAR-predicted affinities (pKi ≈ 5.01-7.06) and high consensus docking scores. Finally, MD simulations of H<sub>3</sub>R complexed with the two promising hits, physostigmine and catharanthine, indicate stable complexes, and receptor fluctuations reflecting ligand-dependent conformational sampling. These results highlight several natural products as promising H<sub>3</sub>R ligand candidates and provide a practical multi-filter pipeline for prioritizing hits for experimental validation.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":"37 3","pages":"259-282"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147779816","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Optima: a GUI-based toolkit for developing and validating interpretable machine learning based supervised classification QSAR models. Optima:一个基于gui的工具包,用于开发和验证基于监督分类QSAR模型的可解释机器学习。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-03-01 Epub Date: 2026-04-16 DOI: 10.1080/1062936X.2026.2649822
I Dasgupta, R Roy, S Gayen
{"title":"Optima: a GUI-based toolkit for developing and validating interpretable machine learning based supervised classification QSAR models.","authors":"I Dasgupta, R Roy, S Gayen","doi":"10.1080/1062936X.2026.2649822","DOIUrl":"10.1080/1062936X.2026.2649822","url":null,"abstract":"<p><p>The rapid advancement of ML algorithms and the increasing availability of large datasets have significantly transformed the landscape of predictive modelling in scientific research. In this context, we introduce Optima (OPTimized Interpretable Model Building & Analysis Toolkit), a user-friendly, Python-based GUI designed to simplify and accelerate the development of interpretable ML-based classification QSAR/QSPR/QSTR models. This toolkit offers an intuitive graphical user interface (GUI), enabling users with domain knowledge but limited coding experience to efficiently optimize, construct, and interpret various ML-based classification models. A most highlighting feature of this GUI is its fully customizable settings panel, allowing users to modify colour schemes, font sizes, axis labels, and plot dimensions to suit publication or presentation needs. By combining robust optimization with an explainable approach, the Optima toolkit improves classification QSAR model performance while ensuring transparency and reproducibility. This platform addresses a critical need by providing an intuitive GUI for rational dataset splitting, efficient feature selection, and the development of seven different ML-based classification QSAR models, covering the entire workflow from optimization to interpretation. The toolkit is presently available for Windows and can be downloaded from the provided link (https://github.com/Rahul-Roy-21/OPTIMA).</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"241-257"},"PeriodicalIF":2.3,"publicationDate":"2026-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147692100","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Antiepileptic potential of Jatropha integerrima Jacq. extracts: an exploratory study integrating in vivo seizure models and computational analysis. 麻疯树的抗癫痫作用。摘要:一项结合体内癫痫模型和计算分析的探索性研究。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-02-01 Epub Date: 2026-04-07 DOI: 10.1080/1062936X.2026.2640387
S Fatima, F Nasir, A Ahmed
{"title":"Antiepileptic potential of <i>Jatropha integerrima</i> Jacq. extracts: an exploratory study integrating in vivo seizure models and computational analysis.","authors":"S Fatima, F Nasir, A Ahmed","doi":"10.1080/1062936X.2026.2640387","DOIUrl":"10.1080/1062936X.2026.2640387","url":null,"abstract":"<p><p>Epilepsy is a chronic neurological disorder characterized by recurrent seizures resulting from abnormal neuronal excitability and ion channel dysfunction. This study explored the antiepileptic potential of <i>Jatropha integerrima</i> using integrated in vivo and in silico approaches to identify safer alternatives to conventional therapies. Ethanolic extracts of the plant were fractionated into petroleum ether (PE) and ethyl acetate (EA) fractions and analyzed via LC-MS for tentative phytoconstituent identification. Anticonvulsant activity of both fractions was evaluated in Swiss albino mice using pentylenetetrazol (PTZ) and maximal electroshock seizure (MES) models. The PE fraction significantly reduced seizure frequency, duration, and mortality (<i>p</i> < 0.05), demonstrating effects comparable to diazepam. Molecular docking and MM/GBSA analyses revealed strong binding affinities of major compounds toward GABAA and Nav1.2 receptors. Notably, (-)-jatrointelignan A (C₃₁H₃₆O₁₁) exhibited the highest docking scores (-10.20 kcal/mol for GABAA and -11.47 kcal/mol for Nav1.2) and favorable binding free energies (-25.38 and -70.63 kcal/mol, respectively). ADME and toxicity predictions suggested good pharmacokinetic properties with low toxicity, while molecular dynamics simulations confirmed stable receptor ligand interactions. Overall, <i>J. integerrima</i> and its lead compound demonstrate promising antiepileptic potential via dual modulation of GABAergic and sodium channel pathways.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"131-165"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147628324","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
QSAR-based design of antitubercular pyrrolidine carboxamides. 基于qsar的抗结核吡咯烷胺类药物设计。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-02-01 Epub Date: 2026-03-27 DOI: 10.1080/1062936X.2026.2637963
J G Hernández-Lima, I Mercado-Sánchez, R Chávez-Rocha, I Vargas-Rodríguez, M A Vázquez, A Bazán-Jiménez, M Del Refugio Cuevas-Flores, J Robles, M A García-Revilla
{"title":"QSAR-based design of antitubercular pyrrolidine carboxamides.","authors":"J G Hernández-Lima, I Mercado-Sánchez, R Chávez-Rocha, I Vargas-Rodríguez, M A Vázquez, A Bazán-Jiménez, M Del Refugio Cuevas-Flores, J Robles, M A García-Revilla","doi":"10.1080/1062936X.2026.2637963","DOIUrl":"10.1080/1062936X.2026.2637963","url":null,"abstract":"<p><p>Tuberculosis affects 10 million people around the world, for this reason, the development of new drugs against <i>M. tuberculosis (</i>Mtb) is an urgent issue. Enoyl ACP reductase (InhA) is a relevant biological target because it's the role in the synthesis of mycolic acid, the building block of the cell wall. Here, we propose new anti-tuberculosis candidates based on a family of tested InhA pyrrolidine-carboxamides. The strategy starts building QSAR models using, for the first time, a set of topological-descriptors from QTAIM (LDMtrace, Ndeloc, Ntotal, and Max_eigenv), hydrogen bond energy (HB) of Docking simulations, and standard QSAR molecular-descriptors (MATS4m, VE1_Dzp, and RDF70m). The model with the best performance displays robust external and internal validation parameters. Finally, based on the most reliable QSAR model, a set of new molecules were proposed; six of them (p69, p71, p73, p75, p77, and p78) show an efficient calculated IC<sub>50</sub> and adequate ADMET properties, suggesting an enhanced biological activity. Molecular Dynamics and MM/PBSA results show that p71 is the most reliable candidate for the inhibition of InhA, displaying a stable complex p71-InhA along 200 ns of simulation and a negative free binding energy, ∆G<sub>binding</sub> < 0.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"105-130"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147521928","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
Systematic evaluation of data preprocessing and model selection strategies for reliable pIC50 prediction of acetylcholinesterase inhibitors. 对乙酰胆碱酯酶抑制剂pIC50可靠预测的数据预处理和模型选择策略进行系统评估。
IF 2.3 3区 环境科学与生态学
SAR and QSAR in Environmental Research Pub Date : 2026-02-01 Epub Date: 2026-04-10 DOI: 10.1080/1062936X.2026.2647204
E Delibaş, H I Güler
{"title":"Systematic evaluation of data preprocessing and model selection strategies for reliable pIC<sub>50</sub> prediction of acetylcholinesterase inhibitors.","authors":"E Delibaş, H I Güler","doi":"10.1080/1062936X.2026.2647204","DOIUrl":"10.1080/1062936X.2026.2647204","url":null,"abstract":"<p><p>Predicting acetylcholinesterase (AChE) inhibitory activity is important in drug discovery. This study evaluates molecular descriptor - based machine learning models to predict AChE activity as pIC<sub>50</sub> values. The primary objective was to comparatively investigate the impact of different data preprocessing strategies on prediction performance and model selection under challenging chemical datasets exhibiting low correlation structures. Tree based gradient boosting algorithms, namely CatBoost and XGBoost, together with sensitive regression models including Support Vector Regression and Multilayer Perceptron, were examined, and model specific data preparation pipelines were applied according to their structural assumptions. The target variable was stabilized through logarithmic transformation and winsorization of IC<sub>50</sub> values. Model performance was assessed using both a 70-15-15 train-validation-test split and a 10-fold cross validation protocol. Furthermore, stacking based ensemble learning strategies were explored to enhance generalization capability. The results demonstrate that predictive performance is predominantly constrained by intrinsic dataset characteristics rather than algorithmic selection. Optimized tree-based models achieved the highest accuracy, while stacking provided only marginal improvements over the best individual learners. To improve interpretability, SHAP based explainable artificial intelligence analysis was conducted, highlighting the contributions of biologically meaningful molecular descriptors, and offers guidance for future studies addressing comparable biochemical modelling challenges.</p>","PeriodicalId":21446,"journal":{"name":"SAR and QSAR in Environmental Research","volume":" ","pages":"185-204"},"PeriodicalIF":2.3,"publicationDate":"2026-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"147646362","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}
引用次数: 0
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
相关产品
×
本文献相关产品
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书