Molecular basis of BACE1 modulation revealed by machine learning, molecular simulations, and experimental validation.

IF 8.5 1区 化学 Q1 BIOCHEMISTRY & MOLECULAR BIOLOGY
Muhammad Shahab, Jiazhuo Xiao, Jiaojiao Wang, Zunnan Huang
{"title":"Molecular basis of BACE1 modulation revealed by machine learning, molecular simulations, and experimental validation.","authors":"Muhammad Shahab, Jiazhuo Xiao, Jiaojiao Wang, Zunnan Huang","doi":"10.1016/j.ijbiomac.2026.152409","DOIUrl":null,"url":null,"abstract":"<p><p>Alzheimer's disease (AD) remains one of the most prevalent and debilitating neurodegenerative disorders worldwide, with no currently available disease-modifying treatments. β-site amyloid precursor protein cleaving enzyme 1 (BACE1) catalyzes the rate-limiting step in amyloid-β (Aβ) production and represents a validated therapeutic target for AD intervention. In this study, we developed an integrated computational framework combining machine learning-based virtual screening, molecular docking, and molecular dynamics simulations with experimental validation using CCK-8 assays and Western blot analysis to identify novel BACE1 inhibitors from a natural product library. A curated dataset of experimentally validated BACE1 inhibitors retrieved from the ChEMBL database was used to construct 36 classification models based on three molecular fingerprint representations MACCS Keys, ECFP4, and Topological Torsion in combination with four machine learning algorithms such as Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Extreme Gradient Boosting (XGBoost). Among all developed models, the SVM model using ECFP4 fingerprints achieved the best predictive performance, with an external test set accuracy of 0.91 and a Matthews Correlation Coefficient (MCC) of 0.78. The optimized models were subsequently applied to screen 4779 natural product compounds from the MedChemExpress library using a consensus prediction strategy. Promising hits were evaluated by molecular docking against the BACE1 crystal structure (PDB: 6PZ4), and top-ranked candidates were subjected to 200 ns molecular dynamics simulations followed by MM/GBSA binding free energy calculations. Among the identified candidates, HY-N7141 exhibited the most favorable docking score (-9.04 kcal/mol) and binding free energy (ΔG = -67.30 kcal/mol), driven predominantly by strong van der Waals interactions with key catalytic residues. Structural stability analysis confirmed that the majority of protein-ligand complexes maintained stable conformations throughout the simulations. Experimental validation in SH-SY5Y human neuroblastoma cells further assessed the effects of selected compounds on BACE1 protein expression. Collectively, these findings demonstrate the utility of integrating machine learning with structure-based approaches for accelerating the discovery of potent BACE1 inhibitors, and present several promising candidates warranting further preclinical investigation.</p>","PeriodicalId":333,"journal":{"name":"International Journal of Biological Macromolecules","volume":" ","pages":"152409"},"PeriodicalIF":8.5000,"publicationDate":"2026-05-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Biological Macromolecules","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1016/j.ijbiomac.2026.152409","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Alzheimer's disease (AD) remains one of the most prevalent and debilitating neurodegenerative disorders worldwide, with no currently available disease-modifying treatments. β-site amyloid precursor protein cleaving enzyme 1 (BACE1) catalyzes the rate-limiting step in amyloid-β (Aβ) production and represents a validated therapeutic target for AD intervention. In this study, we developed an integrated computational framework combining machine learning-based virtual screening, molecular docking, and molecular dynamics simulations with experimental validation using CCK-8 assays and Western blot analysis to identify novel BACE1 inhibitors from a natural product library. A curated dataset of experimentally validated BACE1 inhibitors retrieved from the ChEMBL database was used to construct 36 classification models based on three molecular fingerprint representations MACCS Keys, ECFP4, and Topological Torsion in combination with four machine learning algorithms such as Support Vector Machine (SVM), Random Forest (RF), Decision Tree (DT), and Extreme Gradient Boosting (XGBoost). Among all developed models, the SVM model using ECFP4 fingerprints achieved the best predictive performance, with an external test set accuracy of 0.91 and a Matthews Correlation Coefficient (MCC) of 0.78. The optimized models were subsequently applied to screen 4779 natural product compounds from the MedChemExpress library using a consensus prediction strategy. Promising hits were evaluated by molecular docking against the BACE1 crystal structure (PDB: 6PZ4), and top-ranked candidates were subjected to 200 ns molecular dynamics simulations followed by MM/GBSA binding free energy calculations. Among the identified candidates, HY-N7141 exhibited the most favorable docking score (-9.04 kcal/mol) and binding free energy (ΔG = -67.30 kcal/mol), driven predominantly by strong van der Waals interactions with key catalytic residues. Structural stability analysis confirmed that the majority of protein-ligand complexes maintained stable conformations throughout the simulations. Experimental validation in SH-SY5Y human neuroblastoma cells further assessed the effects of selected compounds on BACE1 protein expression. Collectively, these findings demonstrate the utility of integrating machine learning with structure-based approaches for accelerating the discovery of potent BACE1 inhibitors, and present several promising candidates warranting further preclinical investigation.

通过机器学习、分子模拟和实验验证揭示BACE1调制的分子基础。
阿尔茨海默病(AD)仍然是世界上最普遍和最令人衰弱的神经退行性疾病之一,目前没有可用的疾病改善治疗方法。β位点淀粉样蛋白前体蛋白切割酶1 (BACE1)催化淀粉样蛋白-β (a β)产生的限速步骤,是AD干预的有效治疗靶点。在这项研究中,我们开发了一个集成的计算框架,将基于机器学习的虚拟筛选、分子对接和分子动力学模拟与CCK-8测定和Western blot分析的实验验证相结合,从天然产物库中鉴定出新的BACE1抑制剂。利用从ChEMBL数据库中检索到的经过实验验证的BACE1抑制剂的精选数据集,基于三种分子指纹表示MACCS Keys、ECFP4和Topological Torsion,结合支持向量机(SVM)、随机森林(RF)、决策树(DT)和极端梯度增强(XGBoost)等四种机器学习算法,构建了36个分类模型。在所有已开发的模型中,使用ECFP4指纹的SVM模型的预测性能最好,其外部测试集准确率为0.91,Matthews相关系数(MCC)为0.78。优化后的模型随后使用共识预测策略从MedChemExpress文库中筛选4779种天然产物化合物。通过对BACE1晶体结构(PDB: 6PZ4)的分子对接来评估有希望的候选分子,并对排名靠前的候选分子进行200 ns的分子动力学模拟,然后计算MM/GBSA结合自由能。在确定的候选化合物中,HY-N7141表现出最有利的对接分数(-9.04 kcal/mol)和结合自由能(ΔG = -67.30 kcal/mol),主要是由与关键催化残基的强范德华相互作用驱动的。结构稳定性分析证实,大多数蛋白质配体复合物在整个模拟过程中保持稳定的构象。SH-SY5Y人神经母细胞瘤细胞的实验验证进一步评估了所选化合物对BACE1蛋白表达的影响。总的来说,这些发现证明了将机器学习与基于结构的方法相结合的效用,可以加速发现有效的BACE1抑制剂,并提出了几个有希望的候选药物,值得进一步的临床前研究。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Biological Macromolecules
International Journal of Biological Macromolecules 生物-生化与分子生物学
CiteScore
13.70
自引率
9.80%
发文量
2728
审稿时长
64 days
期刊介绍: The International Journal of Biological Macromolecules is a well-established international journal dedicated to research on the chemical and biological aspects of natural macromolecules. Focusing on proteins, macromolecular carbohydrates, glycoproteins, proteoglycans, lignins, biological poly-acids, and nucleic acids, the journal presents the latest findings in molecular structure, properties, biological activities, interactions, modifications, and functional properties. Papers must offer new and novel insights, encompassing related model systems, structural conformational studies, theoretical developments, and analytical techniques. Each paper is required to primarily focus on at least one named biological macromolecule, reflected in the title, abstract, and text.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书