GPCR-A17 MAAP: mapping modulators, agonists, and antagonists to predict the next bioactive target

IF 7.1 2区 化学 Q1 CHEMISTRY, MULTIDISCIPLINARY
Ana B. Caniceiro, Ana M. B. Amorim, Nícia Rosário-Ferreira, Irina S. Moreira
{"title":"GPCR-A17 MAAP: mapping modulators, agonists, and antagonists to predict the next bioactive target","authors":"Ana B. Caniceiro, Ana M. B. Amorim, Nícia Rosário-Ferreira, Irina S. Moreira","doi":"10.1186/s13321-025-01050-z","DOIUrl":null,"url":null,"abstract":"G Protein-Coupled Receptors (GPCRs) are vital players in cellular signalling and key targets for drug discovery, especially within the GPCR-A17 subfamily, which is linked to various diseases. To address the growing need for effective treatments, the GPCR-A17 Modulator, Agonist, Antagonist Predictor (MAAP) was introduced as an advanced ensemble machine learning model that combines XGBoost, Random Forest, and LightGBM to predict the functional roles of agonists, antagonists, and modulators in GPCR-A17 interactions. The model was trained on a dataset of over 3,000 ligands (agonists, antagonists, and modulators) and 6,900 protein–ligand interactions, comprising all three ligand types, sourced from the Guide to Pharmacology, Therapeutic Target Database, and ChEMBL. It demonstrated a strong predictive performance, achieving F1 scores of 0.9179 and 0.7151, AUCs of 0.9766 and 0.8591, and specificities of 0.9703 and 0.8789, respectively, reflecting the overall performance across all classes in the testing and independent ligand validation datasets. A Ki-filtered subset of 4,274 interactions (where Ki is the inhibition constant that quantifies the ligand-binding affinity) improved the F1 scores to 0.9330 and 0.8267 for the testing and independent ligand datasets, respectively. By guiding experimental validation, GPCR-A17 MAAP accelerates drug discovery for various therapeutic targets. The code and data are available on GitHub ( https://github.com/MoreiraLAB/GPCR-A17-MAAP ). This research on GPCRs, particularly the GPCR-A17 subfamily, is significant because of their crucial role in cellular signalling and relevance in developing targeted therapies for complex health conditions. By advancing ligand-receptor interaction predictions (agonists, antagonists, and modulators), this study enhances our understanding of drug-receptor dynamics. These insights can streamline drug discovery, reduce experimental trial-and-error, and accelerate the identification of bioactive compounds for therapeutic applications.","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"28 1","pages":""},"PeriodicalIF":7.1000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://doi.org/10.1186/s13321-025-01050-z","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

G Protein-Coupled Receptors (GPCRs) are vital players in cellular signalling and key targets for drug discovery, especially within the GPCR-A17 subfamily, which is linked to various diseases. To address the growing need for effective treatments, the GPCR-A17 Modulator, Agonist, Antagonist Predictor (MAAP) was introduced as an advanced ensemble machine learning model that combines XGBoost, Random Forest, and LightGBM to predict the functional roles of agonists, antagonists, and modulators in GPCR-A17 interactions. The model was trained on a dataset of over 3,000 ligands (agonists, antagonists, and modulators) and 6,900 protein–ligand interactions, comprising all three ligand types, sourced from the Guide to Pharmacology, Therapeutic Target Database, and ChEMBL. It demonstrated a strong predictive performance, achieving F1 scores of 0.9179 and 0.7151, AUCs of 0.9766 and 0.8591, and specificities of 0.9703 and 0.8789, respectively, reflecting the overall performance across all classes in the testing and independent ligand validation datasets. A Ki-filtered subset of 4,274 interactions (where Ki is the inhibition constant that quantifies the ligand-binding affinity) improved the F1 scores to 0.9330 and 0.8267 for the testing and independent ligand datasets, respectively. By guiding experimental validation, GPCR-A17 MAAP accelerates drug discovery for various therapeutic targets. The code and data are available on GitHub ( https://github.com/MoreiraLAB/GPCR-A17-MAAP ). This research on GPCRs, particularly the GPCR-A17 subfamily, is significant because of their crucial role in cellular signalling and relevance in developing targeted therapies for complex health conditions. By advancing ligand-receptor interaction predictions (agonists, antagonists, and modulators), this study enhances our understanding of drug-receptor dynamics. These insights can streamline drug discovery, reduce experimental trial-and-error, and accelerate the identification of bioactive compounds for therapeutic applications.
GPCR-A17 MAAP:定位调节剂、激动剂和拮抗剂,预测下一个生物活性靶点
G蛋白偶联受体(gpcr)在细胞信号传导中起着至关重要的作用,也是药物发现的关键靶点,特别是在与多种疾病相关的GPCR-A17亚家族中。为了满足对有效治疗方法日益增长的需求,GPCR-A17调节剂、激动剂、拮抗剂预测器(MAAP)作为一种先进的集成机器学习模型被引入,该模型结合了XGBoost、Random Forest和LightGBM来预测GPCR-A17相互作用中激动剂、拮抗剂和调节剂的功能作用。该模型是在超过3000种配体(激动剂、拮抗剂和调节剂)和6900种蛋白质-配体相互作用的数据集上训练的,包括所有三种配体类型,来自药理学指南、治疗靶点数据库和ChEMBL。该方法具有较强的预测性能,F1得分分别为0.9179和0.7151,auc分别为0.9766和0.8591,特异性分别为0.9703和0.8789,反映了测试和独立配体验证数据集中所有类别的整体性能。Ki过滤的4274个相互作用子集(其中Ki是量化配体结合亲和力的抑制常数)分别将测试和独立配体数据集的F1分数提高到0.9330和0.8267。通过指导实验验证,GPCR-A17 MAAP加速了各种治疗靶点的药物发现。代码和数据可在GitHub (https://github.com/MoreiraLAB/GPCR-A17-MAAP)上获得。这项关于gpcr,特别是GPCR-A17亚家族的研究具有重要意义,因为它们在细胞信号传导中起着至关重要的作用,并与开发针对复杂健康状况的靶向治疗相关。通过推进配体-受体相互作用预测(激动剂、拮抗剂和调节剂),本研究增强了我们对药物-受体动力学的理解。这些见解可以简化药物发现,减少实验试错,并加速识别用于治疗应用的生物活性化合物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of Cheminformatics
Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS
CiteScore
14.10
自引率
7.00%
发文量
82
审稿时长
3 months
期刊介绍: Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信