Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin
{"title":"PyaiVS统一人工智能工作流程,加速配体发现并产生ABCG2抑制剂","authors":"Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin","doi":"10.1016/j.ejmech.2025.118176","DOIUrl":null,"url":null,"abstract":"<div><div>Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at <span><span>https://github.com/danqingmk/OpenVS_PyaiVS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":314,"journal":{"name":"European Journal of Medicinal Chemistry","volume":"300 ","pages":"Article 118176"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"PyaiVS unifies AI workflows to accelerate ligand discovery and yields ABCG2 inhibitors\",\"authors\":\"Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin\",\"doi\":\"10.1016/j.ejmech.2025.118176\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at <span><span>https://github.com/danqingmk/OpenVS_PyaiVS</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":314,\"journal\":{\"name\":\"European Journal of Medicinal Chemistry\",\"volume\":\"300 \",\"pages\":\"Article 118176\"},\"PeriodicalIF\":5.9000,\"publicationDate\":\"2025-09-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European Journal of Medicinal Chemistry\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0223523425009419\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MEDICINAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0223523425009419","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
PyaiVS unifies AI workflows to accelerate ligand discovery and yields ABCG2 inhibitors
Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at https://github.com/danqingmk/OpenVS_PyaiVS.
期刊介绍:
The European Journal of Medicinal Chemistry is a global journal that publishes studies on all aspects of medicinal chemistry. It provides a medium for publication of original papers and also welcomes critical review papers.
A typical paper would report on the organic synthesis, characterization and pharmacological evaluation of compounds. Other topics of interest are drug design, QSAR, molecular modeling, drug-receptor interactions, molecular aspects of drug metabolism, prodrug synthesis and drug targeting. The journal expects manuscripts to present the rational for a study, provide insight into the design of compounds or understanding of mechanism, or clarify the targets.