Integrating ensemble machine-learning and fibril docking to discover potent, novel triazole-naphthalene tau-aggregation inhibitors.

IF 3.8 2区 化学 Q2 CHEMISTRY, APPLIED
Molecular Diversity Pub Date : 2026-04-01 Epub Date: 2026-03-04 DOI:10.1007/s11030-026-11507-4
Poulami Saha, Anuja Chouhan
{"title":"Integrating ensemble machine-learning and fibril docking to discover potent, novel triazole-naphthalene tau-aggregation inhibitors.","authors":"Poulami Saha, Anuja Chouhan","doi":"10.1007/s11030-026-11507-4","DOIUrl":null,"url":null,"abstract":"<p><p>Tau-protein aggregation is a central pathological feature of Alzheimer's disease, so blocking fibril growth is an attractive therapeutic goal. We curated a high-quality set of 289 literature IC<sub>50 </sub> measurements for human-tau aggregation and trained a stacked-ensemble QSAR model (SVR + RF + XGB) that achieves fivefold CV Q<sup>2</sup> = 0.63, external R<sup>2</sup> = 0.57 and RMSE = 0.73 log-units. Applicability-domain analysis revealed no high-influence outliers in the calibration set, and a 5-nearest-neighbour density test confirmed that each of sixteen previously unreported 1,2,4-triazole-naphthalene derivatives (TND, TND-1…TND-15) lies in locally populated chemical space, albeit at the edge of the global domain. The model predicts pIC<sub>50 </sub> = 6.75-7.53 (IC<sub>50 </sub> ≈ 30-177 nM), nominating TND-9, TND-15 and TND-5 as top-ranked candidates based on predicted potency. Nearly all TNDs fall within the BBB window (MW ≈ 350-450 Da, TPSA < 90 Å<sup>2</sup>); most obey cLogP ≤ 5, and the few slightly above still map to the BOILED-Egg CNS-positive zone. Retrospective docking against phosphorylated-tau fibrils (PDB ID 6HRF) highlighted TND, TND-5 and TND-14 with sub-micromolar predicted affinity, forming key contacts in the microtubule-binding cleft. These docking results support binding plausibility rather than quantitative aggregation inhibition. TND-8, although highly ranked by docking, was deprioritised owing to low predicted GI absorption. Physicochemical and CNS-oriented ADMET filters further support developability of the top leads. The integrated workflow-combining rigorously validated QSAR, structure-based docking on the 6HRF polymorph and developability profiling-provides an open-source blueprint for tau-aggregation inhibitor discovery. Consensus ranking prioritises TND-5 for immediate in-silico follow-up, with TND, TND-14, TND-9 and TND-15 as secondary leads.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":"2853-2863"},"PeriodicalIF":3.8000,"publicationDate":"2026-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-026-11507-4","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2026/3/4 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

Tau-protein aggregation is a central pathological feature of Alzheimer's disease, so blocking fibril growth is an attractive therapeutic goal. We curated a high-quality set of 289 literature IC50  measurements for human-tau aggregation and trained a stacked-ensemble QSAR model (SVR + RF + XGB) that achieves fivefold CV Q2 = 0.63, external R2 = 0.57 and RMSE = 0.73 log-units. Applicability-domain analysis revealed no high-influence outliers in the calibration set, and a 5-nearest-neighbour density test confirmed that each of sixteen previously unreported 1,2,4-triazole-naphthalene derivatives (TND, TND-1…TND-15) lies in locally populated chemical space, albeit at the edge of the global domain. The model predicts pIC50  = 6.75-7.53 (IC50  ≈ 30-177 nM), nominating TND-9, TND-15 and TND-5 as top-ranked candidates based on predicted potency. Nearly all TNDs fall within the BBB window (MW ≈ 350-450 Da, TPSA < 90 Å2); most obey cLogP ≤ 5, and the few slightly above still map to the BOILED-Egg CNS-positive zone. Retrospective docking against phosphorylated-tau fibrils (PDB ID 6HRF) highlighted TND, TND-5 and TND-14 with sub-micromolar predicted affinity, forming key contacts in the microtubule-binding cleft. These docking results support binding plausibility rather than quantitative aggregation inhibition. TND-8, although highly ranked by docking, was deprioritised owing to low predicted GI absorption. Physicochemical and CNS-oriented ADMET filters further support developability of the top leads. The integrated workflow-combining rigorously validated QSAR, structure-based docking on the 6HRF polymorph and developability profiling-provides an open-source blueprint for tau-aggregation inhibitor discovery. Consensus ranking prioritises TND-5 for immediate in-silico follow-up, with TND, TND-14, TND-9 and TND-15 as secondary leads.

集成集成机器学习和纤维对接,以发现有效的新型三唑-萘tau聚集抑制剂。
tau蛋白聚集是阿尔茨海默病的核心病理特征,因此阻断原纤维生长是一个有吸引力的治疗目标。我们整理了一组高质量的289篇关于人类tau聚集的文献IC50测量数据,并训练了一个堆叠集成QSAR模型(SVR + RF + XGB),该模型实现了5倍CV Q2 = 0.63,外部R2 = 0.57, RMSE = 0.73对数单位。适用性域分析显示,校准集中没有高影响异常值,5近邻密度测试证实,16种以前未报道的1,2,4-三唑-萘衍生物(TND, TND-1…TND-15)中的每一种都位于当地人口稠密的化学空间,尽管处于全球域的边缘。该模型预测pIC50 = 6.75-7.53 (IC50≈30-177 nM),根据预测效力,将TND-9、TND-15和TND-5列为首选候选药物。几乎所有TNDs都落在BBB窗口内(MW≈350-450 Da, TPSA 2);大多数符合cLogP≤5,少数略高于5的仍映射到煮蛋cns阳性区。与磷酸化tau原纤维(PDB ID 6HRF)的回顾性对接显示,TND、TND-5和TND-14具有亚微摩尔预测的亲和力,在微管结合间隙中形成关键接触。这些对接结果支持结合的合理性,而不是定量聚集抑制。TND-8虽然在对接中排名很高,但由于预测GI吸收较低,因此排名较低。物理化学和面向cns的ADMET过滤器进一步支持顶部引线的可发展性。集成的工作流程——结合严格验证的QSAR、基于6HRF多态性的结构对接和可发育性分析——为发现tau聚集抑制剂提供了一个开源的蓝图。共识排名优先考虑TND-5进行即时计算机随访,TND, TND-14, TND-9和TND-15作为次要线索。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Diversity
Molecular Diversity 化学-化学综合
CiteScore
7.30
自引率
7.90%
发文量
219
审稿时长
2.7 months
期刊介绍: Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including: combinatorial chemistry and parallel synthesis; small molecule libraries; microwave synthesis; flow synthesis; fluorous synthesis; diversity oriented synthesis (DOS); nanoreactors; click chemistry; multiplex technologies; fragment- and ligand-based design; structure/function/SAR; computational chemistry and molecular design; chemoinformatics; screening techniques and screening interfaces; analytical and purification methods; robotics, automation and miniaturization; targeted libraries; display libraries; peptides and peptoids; proteins; oligonucleotides; carbohydrates; natural diversity; new methods of library formulation and deconvolution; directed evolution, origin of life and recombination; search techniques, landscapes, random chemistry and more;
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书