PyaiVS unifies AI workflows to accelerate ligand discovery and yields ABCG2 inhibitors

IF 5.9 2区 医学 Q1 CHEMISTRY, MEDICINAL
Mukuo Wang , Bojian Qu , Lihong Yang , Lin Wang , Kaili Jiang , Jianping Lin
{"title":"PyaiVS unifies AI workflows to accelerate ligand discovery and yields ABCG2 inhibitors","authors":"Mukuo Wang ,&nbsp;Bojian Qu ,&nbsp;Lihong Yang ,&nbsp;Lin Wang ,&nbsp;Kaili Jiang ,&nbsp;Jianping Lin","doi":"10.1016/j.ejmech.2025.118176","DOIUrl":null,"url":null,"abstract":"<div><div>Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at <span><span>https://github.com/danqingmk/OpenVS_PyaiVS</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":314,"journal":{"name":"European Journal of Medicinal Chemistry","volume":"300 ","pages":"Article 118176"},"PeriodicalIF":5.9000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Medicinal Chemistry","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0223523425009419","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MEDICINAL","Score":null,"Total":0}
引用次数: 0

Abstract

Developing optimized AI models for virtual screening requires coordinated selection of algorithms, molecular representations, and data splitting strategies, yet lacks integrated tools. We present PyaiVS, a Python package that integrates nine machine learning algorithms, five molecular representations, and three data splitting strategies. This study demonstrates that constructing efficient AI-driven virtual screening models for small molecules requires coordinated optimization of algorithm architectures (e.g., prioritizing deep learning models such as GCN, GAT, and Attentive FP), molecular representations (ECFP4/MACCS fingerprints for small datasets and molecular graph-based representations for large-scale data), and data splitting strategies (clustering-based splitting achieving 68.5 % optimal AUC-ROC performance). To demonstrate utility, we combined PyaiVS with pharmacophore modeling and docking to screen 4,188,623 compounds for ABCG2 inhibitors. Experimental validation identified four compounds (C1/C6/C7/C9) binding ABCG2 with sub-100 μM kd values (5.31–51.35 μM) that potentiate topotecan cytotoxicity. PyaiVS streamlines virtual screening by unifying critical components into an accessible platform, freely available at https://github.com/danqingmk/OpenVS_PyaiVS.

Abstract Image

Abstract Image

PyaiVS统一人工智能工作流程,加速配体发现并产生ABCG2抑制剂
开发用于虚拟筛选的优化人工智能模型需要协调选择算法、分子表示和数据分割策略,但缺乏集成工具。我们介绍PyaiVS,一个Python包,集成了九种机器学习算法,五种分子表示和三种数据分割策略。本研究表明,构建高效的人工智能驱动的小分子虚拟筛选模型需要协调优化算法架构(例如,优先考虑深度学习模型,如GCN、GAT和专心FP)、分子表征(小数据集的ECFP4/MACCS指纹和大规模数据的基于分子图的表征)和数据分割策略(基于聚类的分割实现68.5%最优AUC-ROC性能)。为了证明其实用性,我们将PyaiVS与药效团建模和对接相结合,筛选了4,188,623种ABCG2抑制剂化合物。通过实验验证,发现4个结合ABCG2的化合物(C1/C6/C7/C9) Kd值低于100 μM (5.31-51.35 μM),可增强拓扑替康的细胞毒性。PyaiVS通过将关键组件统一到一个可访问的平台来简化虚拟筛选,该平台可在https://github.com/danqingmk/OpenVS_PyaiVS免费获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
11.70
自引率
9.00%
发文量
863
审稿时长
29 days
期刊介绍: The European Journal of Medicinal Chemistry is a global journal that publishes studies on all aspects of medicinal chemistry. It provides a medium for publication of original papers and also welcomes critical review papers. A typical paper would report on the organic synthesis, characterization and pharmacological evaluation of compounds. Other topics of interest are drug design, QSAR, molecular modeling, drug-receptor interactions, molecular aspects of drug metabolism, prodrug synthesis and drug targeting. The journal expects manuscripts to present the rational for a study, provide insight into the design of compounds or understanding of mechanism, or clarify the targets.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信