使用多步机器学习辅助混合虚拟筛选方法发现针对非小细胞肺癌的新型PDGFR抑制剂

IF 4.6 3区 化学 Q2 CHEMISTRY, MULTIDISCIPLINARY
RSC Advances Pub Date : 2025-01-10 DOI:10.1039/D4RA06975G
Sandhi Kranthi Reddy, S. V. G. Reddy and Syed Hussain Basha
{"title":"使用多步机器学习辅助混合虚拟筛选方法发现针对非小细胞肺癌的新型PDGFR抑制剂","authors":"Sandhi Kranthi Reddy, S. V. G. Reddy and Syed Hussain Basha","doi":"10.1039/D4RA06975G","DOIUrl":null,"url":null,"abstract":"<p >Non-Small Cell Lung Cancer (NSCLC) is a formidable global health challenge, responsible for the majority of cancer-related deaths worldwide. The Platelet-Derived Growth Factor Receptor (PDGFR) has emerged as a promising therapeutic target in NSCLC, given its crucial involvement in cell growth, proliferation, angiogenesis, and tumor progression. Among PDGFR inhibitors, avapritinib has garnered attention due to its selective activity against mutant forms of PDGFR, particularly PDGFRA D842V and KIT exon 17 D816V, linked to resistance against conventional tyrosine kinase inhibitors. In recent years, Machine Learning has emerged as a powerful tool in pharmaceutical research, offering data-driven insights and accelerating lead identification for drug discovery. In this research article, we focus on the application of Machine Learning, alongside the RDKit toolkit, to identify potential anti-cancer drug candidates targeting PDGFR in NSCLC. Our study demonstrates how smart algorithms efficiently narrow down large screening collections to target-specific sets of just a few hundred small molecules, streamlining the hit discovery process. Employing a Machine Learning-assisted virtual screening strategy, we successfully preselected 220 compounds with potential PDGFRA inhibitory activity from a vast library of 1.048 million compounds, representing a mere 0.013% of the original library. To validate these candidates, we employed traditional genetic algorithm-based virtual screening and docking methods. Remarkably, we found that ZINC000002931631 exhibited comparable or even superior inhibitory potential against PDGFRA compared to Avapritinib, which highlights the value of our Machine Learning approach. Moreover, as part of our lead validation studies, we conducted molecular dynamic simulations, revealing critical molecular–level interactions responsible for the conformational changes in PDGFRA necessary for substrate binding. Our study exemplifies the potential of Machine Learning in the drug discovery process, providing a more efficient and cost-effective means of identifying promising drug candidates for NSCLC treatment. The success of this approach in preselecting compounds with potent PDGFRA inhibitory potential highlights its significance in advancing personalized and targeted therapies for cancer treatment.</p>","PeriodicalId":102,"journal":{"name":"RSC Advances","volume":" 2","pages":" 851-869"},"PeriodicalIF":4.6000,"publicationDate":"2025-01-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://pubs.rsc.org/en/content/articlepdf/2025/ra/d4ra06975g?page=search","citationCount":"0","resultStr":"{\"title\":\"Discovery of novel PDGFR inhibitors targeting non-small cell lung cancer using a multistep machine learning assisted hybrid virtual screening approach\",\"authors\":\"Sandhi Kranthi Reddy, S. V. G. Reddy and Syed Hussain Basha\",\"doi\":\"10.1039/D4RA06975G\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p >Non-Small Cell Lung Cancer (NSCLC) is a formidable global health challenge, responsible for the majority of cancer-related deaths worldwide. The Platelet-Derived Growth Factor Receptor (PDGFR) has emerged as a promising therapeutic target in NSCLC, given its crucial involvement in cell growth, proliferation, angiogenesis, and tumor progression. Among PDGFR inhibitors, avapritinib has garnered attention due to its selective activity against mutant forms of PDGFR, particularly PDGFRA D842V and KIT exon 17 D816V, linked to resistance against conventional tyrosine kinase inhibitors. In recent years, Machine Learning has emerged as a powerful tool in pharmaceutical research, offering data-driven insights and accelerating lead identification for drug discovery. In this research article, we focus on the application of Machine Learning, alongside the RDKit toolkit, to identify potential anti-cancer drug candidates targeting PDGFR in NSCLC. Our study demonstrates how smart algorithms efficiently narrow down large screening collections to target-specific sets of just a few hundred small molecules, streamlining the hit discovery process. Employing a Machine Learning-assisted virtual screening strategy, we successfully preselected 220 compounds with potential PDGFRA inhibitory activity from a vast library of 1.048 million compounds, representing a mere 0.013% of the original library. To validate these candidates, we employed traditional genetic algorithm-based virtual screening and docking methods. Remarkably, we found that ZINC000002931631 exhibited comparable or even superior inhibitory potential against PDGFRA compared to Avapritinib, which highlights the value of our Machine Learning approach. Moreover, as part of our lead validation studies, we conducted molecular dynamic simulations, revealing critical molecular–level interactions responsible for the conformational changes in PDGFRA necessary for substrate binding. Our study exemplifies the potential of Machine Learning in the drug discovery process, providing a more efficient and cost-effective means of identifying promising drug candidates for NSCLC treatment. The success of this approach in preselecting compounds with potent PDGFRA inhibitory potential highlights its significance in advancing personalized and targeted therapies for cancer treatment.</p>\",\"PeriodicalId\":102,\"journal\":{\"name\":\"RSC Advances\",\"volume\":\" 2\",\"pages\":\" 851-869\"},\"PeriodicalIF\":4.6000,\"publicationDate\":\"2025-01-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://pubs.rsc.org/en/content/articlepdf/2025/ra/d4ra06975g?page=search\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"RSC Advances\",\"FirstCategoryId\":\"92\",\"ListUrlMain\":\"https://pubs.rsc.org/en/content/articlelanding/2025/ra/d4ra06975g\",\"RegionNum\":3,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"RSC Advances","FirstCategoryId":"92","ListUrlMain":"https://pubs.rsc.org/en/content/articlelanding/2025/ra/d4ra06975g","RegionNum":3,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

摘要

非小细胞肺癌(NSCLC)是一个巨大的全球健康挑战,负责全球大多数癌症相关死亡。鉴于血小板衍生生长因子受体(PDGFR)在细胞生长、增殖、血管生成和肿瘤进展中的重要作用,PDGFR已成为NSCLC的一个有希望的治疗靶点。在PDGFR抑制剂中,avapritinib因其对PDGFR突变型的选择性活性而受到关注,特别是PDGFRA D842V和KIT外显子17 D816V,与传统酪氨酸激酶抑制剂的抗性有关。近年来,机器学习已成为制药研究中的强大工具,提供数据驱动的见解,并加速药物发现的先导物识别。在这篇研究文章中,我们专注于机器学习和RDKit工具包的应用,以识别针对非小细胞肺癌PDGFR的潜在抗癌候选药物。我们的研究展示了智能算法如何有效地将大型筛选集合缩小到只有几百个小分子的目标特定集合,从而简化了热门发现过程。采用机器学习辅助的虚拟筛选策略,我们成功地从104.8万个化合物库中预选出220个具有潜在PDGFRA抑制活性的化合物,仅占原始文库的0.013%。为了验证这些候选者,我们采用了传统的基于遗传算法的虚拟筛选和对接方法。值得注意的是,我们发现ZINC000002931631对PDGFRA的抑制潜力与Avapritinib相当甚至更好,这凸显了我们机器学习方法的价值。此外,作为先导验证研究的一部分,我们进行了分子动力学模拟,揭示了导致PDGFRA构象变化的关键分子水平相互作用,这是底物结合所必需的。我们的研究证明了机器学习在药物发现过程中的潜力,提供了一种更有效和更具成本效益的方法来识别有前途的NSCLC治疗候选药物。这种方法在预选具有有效PDGFRA抑制潜力的化合物方面的成功突出了其在推进癌症治疗的个性化和靶向治疗方面的重要性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Discovery of novel PDGFR inhibitors targeting non-small cell lung cancer using a multistep machine learning assisted hybrid virtual screening approach

Discovery of novel PDGFR inhibitors targeting non-small cell lung cancer using a multistep machine learning assisted hybrid virtual screening approach

Non-Small Cell Lung Cancer (NSCLC) is a formidable global health challenge, responsible for the majority of cancer-related deaths worldwide. The Platelet-Derived Growth Factor Receptor (PDGFR) has emerged as a promising therapeutic target in NSCLC, given its crucial involvement in cell growth, proliferation, angiogenesis, and tumor progression. Among PDGFR inhibitors, avapritinib has garnered attention due to its selective activity against mutant forms of PDGFR, particularly PDGFRA D842V and KIT exon 17 D816V, linked to resistance against conventional tyrosine kinase inhibitors. In recent years, Machine Learning has emerged as a powerful tool in pharmaceutical research, offering data-driven insights and accelerating lead identification for drug discovery. In this research article, we focus on the application of Machine Learning, alongside the RDKit toolkit, to identify potential anti-cancer drug candidates targeting PDGFR in NSCLC. Our study demonstrates how smart algorithms efficiently narrow down large screening collections to target-specific sets of just a few hundred small molecules, streamlining the hit discovery process. Employing a Machine Learning-assisted virtual screening strategy, we successfully preselected 220 compounds with potential PDGFRA inhibitory activity from a vast library of 1.048 million compounds, representing a mere 0.013% of the original library. To validate these candidates, we employed traditional genetic algorithm-based virtual screening and docking methods. Remarkably, we found that ZINC000002931631 exhibited comparable or even superior inhibitory potential against PDGFRA compared to Avapritinib, which highlights the value of our Machine Learning approach. Moreover, as part of our lead validation studies, we conducted molecular dynamic simulations, revealing critical molecular–level interactions responsible for the conformational changes in PDGFRA necessary for substrate binding. Our study exemplifies the potential of Machine Learning in the drug discovery process, providing a more efficient and cost-effective means of identifying promising drug candidates for NSCLC treatment. The success of this approach in preselecting compounds with potent PDGFRA inhibitory potential highlights its significance in advancing personalized and targeted therapies for cancer treatment.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
RSC Advances
RSC Advances chemical sciences-
CiteScore
7.50
自引率
2.60%
发文量
3116
审稿时长
1.6 months
期刊介绍: An international, peer-reviewed journal covering all of the chemical sciences, including multidisciplinary and emerging areas. RSC Advances is a gold open access journal allowing researchers free access to research articles, and offering an affordable open access publishing option for authors around the world.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信