Machine learning-based design, screening, and activity validation of topoisomerase I inhibitors.

IF 3.8 2区 化学 Q2 CHEMISTRY, APPLIED
Ya-Kun Zhang, Jian-Bo Tong, Jia-Le Li, Rong Wang, Yan-Rong Zeng
{"title":"Machine learning-based design, screening, and activity validation of topoisomerase I inhibitors.","authors":"Ya-Kun Zhang, Jian-Bo Tong, Jia-Le Li, Rong Wang, Yan-Rong Zeng","doi":"10.1007/s11030-025-11295-3","DOIUrl":null,"url":null,"abstract":"<p><p>Topoisomerase I (TOP I) plays a vital role in maintaining genomic stability and regulating cellular proliferation. Its overexpression in aggressive cancers such as lung, pancreatic, and breast malignancies highlights its value as a therapeutic target. However, the current TOP I inhibitors face limitations including poor hydrolytic stability, significant toxicity, and the emergence of drug resistance. To address these issues, this study developed a comprehensive QSAR framework that goes beyond traditional methods restricted by limited descriptors or single algorithms. A dataset of 550 high-activity compounds from ChEMBL, BindingDB, and Topscience was systematically screened to build thirty QSAR models combining five molecular fingerprint types with six advanced machine learning algorithms. An optimized artificial neural network model was then employed to rationally design 5938 candidate inhibitors using the sequential attachment-based fragment embedding (SAFE) methodology. These candidates underwent rigorous evaluation through activity prediction, drug-likeness assessment, and ADMET profiling, resulting in seven promising compounds. Among them, three were experimentally validated by MTT cytotoxicity assays, while four novel compounds were further characterized by molecular docking and molecular dynamics simulations. This integrative approach provides a robust theoretical foundation for the rational design and optimization of TOP I inhibitors, facilitating the development of targeted therapies against TOP I-associated cancers.</p>","PeriodicalId":708,"journal":{"name":"Molecular Diversity","volume":" ","pages":""},"PeriodicalIF":3.8000,"publicationDate":"2025-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Molecular Diversity","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1007/s11030-025-11295-3","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

Abstract

Topoisomerase I (TOP I) plays a vital role in maintaining genomic stability and regulating cellular proliferation. Its overexpression in aggressive cancers such as lung, pancreatic, and breast malignancies highlights its value as a therapeutic target. However, the current TOP I inhibitors face limitations including poor hydrolytic stability, significant toxicity, and the emergence of drug resistance. To address these issues, this study developed a comprehensive QSAR framework that goes beyond traditional methods restricted by limited descriptors or single algorithms. A dataset of 550 high-activity compounds from ChEMBL, BindingDB, and Topscience was systematically screened to build thirty QSAR models combining five molecular fingerprint types with six advanced machine learning algorithms. An optimized artificial neural network model was then employed to rationally design 5938 candidate inhibitors using the sequential attachment-based fragment embedding (SAFE) methodology. These candidates underwent rigorous evaluation through activity prediction, drug-likeness assessment, and ADMET profiling, resulting in seven promising compounds. Among them, three were experimentally validated by MTT cytotoxicity assays, while four novel compounds were further characterized by molecular docking and molecular dynamics simulations. This integrative approach provides a robust theoretical foundation for the rational design and optimization of TOP I inhibitors, facilitating the development of targeted therapies against TOP I-associated cancers.

拓扑异构酶I抑制剂基于机器学习的设计、筛选和活性验证。
拓扑异构酶I (topi)在维持基因组稳定性和调节细胞增殖中起着至关重要的作用。它在侵袭性癌症如肺癌、胰腺癌和乳腺癌中的过度表达突出了它作为治疗靶点的价值。然而,目前的TOP I抑制剂面临水解稳定性差、毒性大、出现耐药等局限性。为了解决这些问题,本研究开发了一个全面的QSAR框架,超越了受有限描述符或单一算法限制的传统方法。系统筛选来自ChEMBL、BindingDB和Topscience的550个高活性化合物的数据集,构建30个QSAR模型,结合5种分子指纹类型和6种先进的机器学习算法。利用优化后的人工神经网络模型,采用基于序列附着的片段嵌入(SAFE)方法,合理设计5938种候选抑制剂。这些候选药物通过活性预测、药物相似性评估和ADMET分析进行了严格的评估,最终产生了7种有前景的化合物。其中3个化合物通过MTT细胞毒性实验验证,4个新化合物通过分子对接和分子动力学模拟进一步表征。这种综合方法为合理设计和优化TOP I抑制剂提供了坚实的理论基础,促进了针对TOP I相关癌症的靶向治疗的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Molecular Diversity
Molecular Diversity 化学-化学综合
CiteScore
7.30
自引率
7.90%
发文量
219
审稿时长
2.7 months
期刊介绍: Molecular Diversity is a new publication forum for the rapid publication of refereed papers dedicated to describing the development, application and theory of molecular diversity and combinatorial chemistry in basic and applied research and drug discovery. The journal publishes both short and full papers, perspectives, news and reviews dealing with all aspects of the generation of molecular diversity, application of diversity for screening against alternative targets of all types (biological, biophysical, technological), analysis of results obtained and their application in various scientific disciplines/approaches including: combinatorial chemistry and parallel synthesis; small molecule libraries; microwave synthesis; flow synthesis; fluorous synthesis; diversity oriented synthesis (DOS); nanoreactors; click chemistry; multiplex technologies; fragment- and ligand-based design; structure/function/SAR; computational chemistry and molecular design; chemoinformatics; screening techniques and screening interfaces; analytical and purification methods; robotics, automation and miniaturization; targeted libraries; display libraries; peptides and peptoids; proteins; oligonucleotides; carbohydrates; natural diversity; new methods of library formulation and deconvolution; directed evolution, origin of life and recombination; search techniques, landscapes, random chemistry and more;
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信