Wei Feng, Lei Liu, Lingjun Li, Peng Du, Zhichen Yuan, Jing Yuan, Changjiang Huang, Zijian Qin
{"title":"通过虚拟筛选和机器学习设计和发现POLQ解旋酶结构域抑制剂","authors":"Wei Feng, Lei Liu, Lingjun Li, Peng Du, Zhichen Yuan, Jing Yuan, Changjiang Huang, Zijian Qin","doi":"10.1007/s00044-025-03423-3","DOIUrl":null,"url":null,"abstract":"<div><p>DNA polymerase theta (Polθ or POLQ) is an attractive target for treating BRCA-deficient cancers. In the present work, several computational approaches were employed for the design and discovery of novel POLQ helicase domain inhibitors. A dataset was constructed by curating a total of 781 known inhibitors, which were used to develop binary classification models using random forests to distinguish between highly and weakly active inhibitors. The Matthews correlation coefficient of the consensus model reached 0.771 for the test set. A virtual screening procedure of 3.4 million molecules was conducted based on shape similarity and predictions from the consensus model to identify four hits and a favorable benzothiazole moiety. A molecular generation model was trained using molecules from both the curated dataset and the identified hits to generate potential inhibitors, which were subsequently predicted by the consensus model. Finally, eight compounds were selected and synthesized for biochemical testing, leading to the identification of compound <b>19</b>, which had a novel scaffold and acceptable potency: inhibition rates of 80.7% at a concentration of 100 nM and 39.5% at a concentration of 10 nM. Compound <b>19</b> could serve as a suitable starting point for further optimization efforts in medicinal chemistry.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Machine Learning, Virtual Screening, Molecular Generation, Compound Synthesis, and Biochemical Testing in the Discovery of POLQ Helicase Domain Inhibitors.</p></div></div></figure></div></div>","PeriodicalId":699,"journal":{"name":"Medicinal Chemistry Research","volume":"34 6","pages":"1377 - 1391"},"PeriodicalIF":2.6000,"publicationDate":"2025-05-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning\",\"authors\":\"Wei Feng, Lei Liu, Lingjun Li, Peng Du, Zhichen Yuan, Jing Yuan, Changjiang Huang, Zijian Qin\",\"doi\":\"10.1007/s00044-025-03423-3\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>DNA polymerase theta (Polθ or POLQ) is an attractive target for treating BRCA-deficient cancers. In the present work, several computational approaches were employed for the design and discovery of novel POLQ helicase domain inhibitors. A dataset was constructed by curating a total of 781 known inhibitors, which were used to develop binary classification models using random forests to distinguish between highly and weakly active inhibitors. The Matthews correlation coefficient of the consensus model reached 0.771 for the test set. A virtual screening procedure of 3.4 million molecules was conducted based on shape similarity and predictions from the consensus model to identify four hits and a favorable benzothiazole moiety. A molecular generation model was trained using molecules from both the curated dataset and the identified hits to generate potential inhibitors, which were subsequently predicted by the consensus model. Finally, eight compounds were selected and synthesized for biochemical testing, leading to the identification of compound <b>19</b>, which had a novel scaffold and acceptable potency: inhibition rates of 80.7% at a concentration of 100 nM and 39.5% at a concentration of 10 nM. Compound <b>19</b> could serve as a suitable starting point for further optimization efforts in medicinal chemistry.</p><div><figure><div><div><picture><source><img></source></picture></div><div><p>Machine Learning, Virtual Screening, Molecular Generation, Compound Synthesis, and Biochemical Testing in the Discovery of POLQ Helicase Domain Inhibitors.</p></div></div></figure></div></div>\",\"PeriodicalId\":699,\"journal\":{\"name\":\"Medicinal Chemistry Research\",\"volume\":\"34 6\",\"pages\":\"1377 - 1391\"},\"PeriodicalIF\":2.6000,\"publicationDate\":\"2025-05-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medicinal Chemistry Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s00044-025-03423-3\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medicinal Chemistry Research","FirstCategoryId":"3","ListUrlMain":"https://link.springer.com/article/10.1007/s00044-025-03423-3","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
Design and discovery of POLQ helicase domain inhibitors by virtual screening and machine learning
DNA polymerase theta (Polθ or POLQ) is an attractive target for treating BRCA-deficient cancers. In the present work, several computational approaches were employed for the design and discovery of novel POLQ helicase domain inhibitors. A dataset was constructed by curating a total of 781 known inhibitors, which were used to develop binary classification models using random forests to distinguish between highly and weakly active inhibitors. The Matthews correlation coefficient of the consensus model reached 0.771 for the test set. A virtual screening procedure of 3.4 million molecules was conducted based on shape similarity and predictions from the consensus model to identify four hits and a favorable benzothiazole moiety. A molecular generation model was trained using molecules from both the curated dataset and the identified hits to generate potential inhibitors, which were subsequently predicted by the consensus model. Finally, eight compounds were selected and synthesized for biochemical testing, leading to the identification of compound 19, which had a novel scaffold and acceptable potency: inhibition rates of 80.7% at a concentration of 100 nM and 39.5% at a concentration of 10 nM. Compound 19 could serve as a suitable starting point for further optimization efforts in medicinal chemistry.
Machine Learning, Virtual Screening, Molecular Generation, Compound Synthesis, and Biochemical Testing in the Discovery of POLQ Helicase Domain Inhibitors.
期刊介绍:
Medicinal Chemistry Research (MCRE) publishes papers on a wide range of topics, favoring research with significant, new, and up-to-date information. Although the journal has a demanding peer review process, MCRE still boasts rapid publication, due in part, to the length of the submissions. The journal publishes significant research on various topics, many of which emphasize the structure-activity relationships of molecular biology.