Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin
{"title":"PyaiVS unifies AI workflows to accelerate ligand discovery and yields ABCG2 inhibitors","authors":"Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin","doi":"10.1016/j.ejmech.2025.118176","DOIUrl":null,"url":null,"abstract":"<div><div>Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at <span><span>https://github.com/danqingmk/OpenVS_PyaiVS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":314,"journal":{"name":"European Journal of Medicinal Chemistry","volume":"300 ","pages":"Article 118176"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0223523425009419","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0
Abstract
Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at https://github.com/danqingmk/OpenVS_PyaiVS.
期刊介绍:
The European Journal of Medicinal Chemistry is a global journal that publishes studies on all aspects of medicinal chemistry. It provides a medium for publication of original papers and also welcomes critical review papers.
A typical paper would report on the organic synthesis, characterization and pharmacological evaluation of compounds. Other topics of interest are drug design, QSAR, molecular modeling, drug-receptor interactions, molecular aspects of drug metabolism, prodrug synthesis and drug targeting. The journal expects manuscripts to present the rational for a study, provide insight into the design of compounds or understanding of mechanism, or clarify the targets.