{"title":"Integrating ensemble machine-learning and fibril docking to discover potent, novel triazole-naphthalene tau-aggregation inhibitors.","authors":"Poulami Saha, Anuja Chouhan","doi":"10.1007/s11030-026-11507-4","DOIUrl":null,"url":null,"abstract":"<p><p>Tau-protein aggregation is a central pathological feature of Alzheimer's disease, so blocking fibril growth is an attractive therapeutic goal. We curated a high-quality set of 289 literature IC<sub>50 </sub> measurements for human-tau aggregation and trained a stacked-ensemble QSAR model (SVR + RF + XGB) that achieves fivefold CV Q<sup>2</sup> = 0.63, external R<sup>2</sup> = 0.57 and RMSE = 0.73 log-units. Applicability-domain analysis revealed no high-influence outliers in the calibration set, and a 5-nearest-neighbour density test confirmed that each of sixteen previously unreported 1,2,4-triazole-naphthalene derivatives (TND, TND-1…TND-15) lies in locally populated chemical space, albeit at the edge of the global domain. The model predicts pIC<sub>50 </sub> = 6.75-7.53 (IC<sub>50 </sub> ≈ 30-177 nM), nominating TND-9, TND-15 and TND-5 as top-ranked candidates based on predicted potency. Nearly all TNDs fall within the BBB window (MW ≈ 350-450 Da, TPSA < 90 Å<sup>2</sup>); most obey cLogP ≤ 5, and the few slightly above still map to the BOILED-Egg CNS-positive zone. Retrospective docking against phosphorylated-tau fibrils (PDB ID 6HRF) highlighted TND, TND-5 and TND-14 with sub-micromolar predicted affinity, forming key contacts in the microtubule-binding cleft. These docking results support binding plausibility rather than quantitative aggregation inhibition. TND-8, although highly ranked by docking, was deprioritised owing to low predicted GI absorption. Physicochemical and CNS-oriented ADMET filters further support developability of the top leads. The integrated workflow-combining rigorously validated QSAR, structure-based docking on the 6HRF polymorph and developability profiling-provides an open-source blueprint for tau-aggregation inhibitor discovery. Consensus ranking prioritises TND-5 for immediate in-silico follow-up, with TND, TND-14, TND-9 and TND-15 as secondary leads.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":"2853-2863"},"PeriodicalIF":3.8000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-026-11507-4","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/4 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0
Abstract
Tau-protein aggregation is a central pathological feature of Alzheimer's disease, so blocking fibril growth is an attractive therapeutic goal. We curated a high-quality set of 289 literature IC50 measurements for human-tau aggregation and trained a stacked-ensemble QSAR model (SVR + RF + XGB) that achieves fivefold CV Q2 = 0.63, external R2 = 0.57 and RMSE = 0.73 log-units. Applicability-domain analysis revealed no high-influence outliers in the calibration set, and a 5-nearest-neighbour density test confirmed that each of sixteen previously unreported 1,2,4-triazole-naphthalene derivatives (TND, TND-1…TND-15) lies in locally populated chemical space, albeit at the edge of the global domain. The model predicts pIC50 = 6.75-7.53 (IC50 ≈ 30-177 nM), nominating TND-9, TND-15 and TND-5 as top-ranked candidates based on predicted potency. Nearly all TNDs fall within the BBB window (MW ≈ 350-450 Da, TPSA < 90 Å2); most obey cLogP ≤ 5, and the few slightly above still map to the BOILED-Egg CNS-positive zone. Retrospective docking against phosphorylated-tau fibrils (PDB ID 6HRF) highlighted TND, TND-5 and TND-14 with sub-micromolar predicted affinity, forming key contacts in the microtubule-binding cleft. These docking results support binding plausibility rather than quantitative aggregation inhibition. TND-8, although highly ranked by docking, was deprioritised owing to low predicted GI absorption. Physicochemical and CNS-oriented ADMET filters further support developability of the top leads. The integrated workflow-combining rigorously validated QSAR, structure-based docking on the 6HRF polymorph and developability profiling-provides an open-source blueprint for tau-aggregation inhibitor discovery. Consensus ranking prioritises TND-5 for immediate in-silico follow-up, with TND, TND-14, TND-9 and TND-15 as secondary leads.
期刊介绍:
Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including:
combinatorial chemistry and parallel synthesis;
small molecule libraries;
microwave synthesis;
flow synthesis;
fluorous synthesis;
diversity oriented synthesis (DOS);
nanoreactors;
click chemistry;
multiplex technologies;
fragment- and ligand-based design;
structure/function/SAR;
computational chemistry and molecular design;
chemoinformatics;
screening techniques and screening interfaces;
analytical and purification methods;
robotics, automation and miniaturization;
targeted libraries;
display libraries;
peptides and peptoids;
proteins;
oligonucleotides;
carbohydrates;
natural diversity;
new methods of library formulation and deconvolution;
directed evolution, origin of life and recombination;
search techniques, landscapes, random chemistry and more;