Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning

IF 2.6 4区 医学 Q3 CHEMISTRY, MEDICINAL
Wei Feng, Lei Liu, Lingjun Li, Peng Du, Zhichen Yuan, Jing Yuan, Changjiang Huang, Zijian Qin
{"title":"Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning","authors":"Wei Feng,&nbsp;Lei Liu,&nbsp;Lingjun Li,&nbsp;Peng Du,&nbsp;Zhichen Yuan,&nbsp;Jing Yuan,&nbsp;Changjiang Huang,&nbsp;Zijian Qin","doi":"10.1007/s00044-025-03423-3","DOIUrl":null,"url":null,"abstract":"<div><p>DNA polymerase theta (Polθ or POLQ) is an attractive target for treating BRCA-deficient cancers. In the present work, several computational approaches were employed for the design and discovery of novel POLQ helicase domain inhibitors. A dataset was constructed by curating a total of 781 known inhibitors, which were used to develop binary classification models using random forests to distinguish between highly and weakly active inhibitors. The Matthews correlation coefficient of the consensus model reached 0.771 for the test set. A virtual screening procedure of 3.4 million molecules was conducted based on shape similarity and predictions from the consensus model to identify four hits and a favorable benzothiazole moiety. A molecular generation model was trained using molecules from both the curated dataset and the identified hits to generate potential inhibitors, which were subsequently predicted by the consensus model. Finally, eight compounds were selected and synthesized for biochemical testing, leading to the identification of compound <b>19</b>, which had a novel scaffold and acceptable potency: inhibition rates of 80.7% at a concentration of 100 nM and 39.5% at a concentration of 10 nM. Compound <b>19</b> could serve as a suitable starting point for further optimization efforts in medicinal chemistry.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Machine Learning, Virtual Screening, Molecular Generation, Compound Synthesis, and Biochemical Testing in the Discovery of POLQ Helicase Domain Inhibitors.</p></div></div></figure></div></div>","PeriodicalId":699,"journal":{"name":"Medicinal Chemistry Research","volume":"34 6","pages":"1377 - 1391"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medicinal Chemistry Research","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s00044-025-03423-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

DNA polymerase theta (Polθ or POLQ) is an attractive target for treating BRCA-deficient cancers. In the present work, several computational approaches were employed for the design and discovery of novel POLQ helicase domain inhibitors. A dataset was constructed by curating a total of 781 known inhibitors, which were used to develop binary classification models using random forests to distinguish between highly and weakly active inhibitors. The Matthews correlation coefficient of the consensus model reached 0.771 for the test set. A virtual screening procedure of 3.4 million molecules was conducted based on shape similarity and predictions from the consensus model to identify four hits and a favorable benzothiazole moiety. A molecular generation model was trained using molecules from both the curated dataset and the identified hits to generate potential inhibitors, which were subsequently predicted by the consensus model. Finally, eight compounds were selected and synthesized for biochemical testing, leading to the identification of compound 19, which had a novel scaffold and acceptable potency: inhibition rates of 80.7% at a concentration of 100 nM and 39.5% at a concentration of 10 nM. Compound 19 could serve as a suitable starting point for further optimization efforts in medicinal chemistry.

Machine Learning, Virtual Screening, Molecular Generation, Compound Synthesis, and Biochemical Testing in the Discovery of POLQ Helicase Domain Inhibitors.

通过虚拟筛选和机器学习设计和发现POLQ解旋酶结构域抑制剂
DNA聚合酶theta (Polθ或POLQ)是治疗brca缺陷癌症的一个有吸引力的靶点。在目前的工作中,几种计算方法被用于设计和发现新的POLQ解旋酶结构域抑制剂。通过筛选总共781种已知抑制剂构建了一个数据集,并使用随机森林开发二元分类模型来区分高活性和弱活性抑制剂。测试集共识模型的马修斯相关系数达到0.771。基于形状相似性和共识模型的预测,进行了340万个分子的虚拟筛选程序,以确定四个命中和一个有利的苯并噻唑片段。分子生成模型使用来自精选数据集和已识别命中的分子进行训练,以生成潜在的抑制剂,随后通过共识模型进行预测。最后,选择8个化合物进行生化实验,鉴定出化合物19,该化合物具有新颖的支架和可接受的效价:在100 nM浓度下抑制率为80.7%,在10 nM浓度下抑制率为39.5%。化合物19可以作为药物化学进一步优化的合适起点。机器学习,虚拟筛选,分子生成,化合物合成,以及发现POLQ解旋酶结构域抑制剂的生化测试。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Medicinal Chemistry Research
Medicinal Chemistry Research 医学-医药化学
CiteScore
4.70
自引率
3.80%
发文量
162
审稿时长
5.0 months
期刊介绍: Medicinal Chemistry Research (MCRE) publishes papers on a wide range of topics, favoring research with significant, new, and up-to-date information. Although the journal has a demanding peer review process, MCRE still boasts rapid publication, due in part, to the length of the submissions. The journal publishes significant research on various topics, many of which emphasize the structure-activity relationships of molecular biology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信