GPCR-A17 MAAP: mapping modulators, agonists, and antagonists to predict the next bioactive target

IF 5.7 2区化学 Q1 CHEMISTRY, MULTIDISCIPLINARY

Journal of Cheminformatics Pub Date : 2025-07-11 DOI:10.1186/s13321-025-01050-z

Ana B. Caniceiro, Ana M. B. Amorim, Nícia Rosário-Ferreira, Irina S. Moreira

{"title":"GPCR-A17 MAAP: mapping modulators, agonists, and antagonists to predict the next bioactive target","authors":"Ana B. Caniceiro, Ana M. B. Amorim, Nícia Rosário-Ferreira, Irina S. Moreira","doi":"10.1186/s13321-025-01050-z","DOIUrl":null,"url":null,"abstract":"<div><p>G Protein-Coupled Receptors (GPCRs) are vital players in cellular signalling and key targets for drug discovery, especially within the GPCR-A17 subfamily, which is linked to various diseases. To address the growing need for effective treatments, the GPCR-A17 Modulator, Agonist, Antagonist Predictor (MAAP) was introduced as an advanced ensemble machine learning model that combines XGBoost, Random Forest, and LightGBM to predict the functional roles of agonists, antagonists, and modulators in GPCR-A17 interactions. The model was trained on a dataset of over 3,000 ligands (agonists, antagonists, and modulators) and 6,900 protein–ligand interactions, comprising all three ligand types, sourced from the Guide to Pharmacology, Therapeutic Target Database, and ChEMBL. It demonstrated a strong predictive performance, achieving F1 scores of 0.9179 and 0.7151, AUCs of 0.9766 and 0.8591, and specificities of 0.9703 and 0.8789, respectively, reflecting the overall performance across all classes in the testing and independent ligand validation datasets. A Ki-filtered subset of 4,274 interactions (where Ki is the inhibition constant that quantifies the ligand-binding affinity) improved the F1 scores to 0.9330 and 0.8267 for the testing and independent ligand datasets, respectively. By guiding experimental validation, GPCR-A17 MAAP accelerates drug discovery for various therapeutic targets. The code and data are available on GitHub (https://github.com/MoreiraLAB/GPCR-A17-MAAP).</p><h3>Graphical Abstract</h3>\n<div><figure><div><div><picture><source><img></source></picture></div></div></figure></div></div>","PeriodicalId":617,"journal":{"name":"Journal of Cheminformatics","volume":"17 1","pages":""},"PeriodicalIF":5.7000,"publicationDate":"2025-07-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://jcheminf.biomedcentral.com/counter/pdf/10.1186/s13321-025-01050-z","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Cheminformatics","FirstCategoryId":"92","ListUrlMain":"https://link.springer.com/article/10.1186/s13321-025-01050-z","RegionNum":2,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

G Protein-Coupled Receptors (GPCRs) are vital players in cellular signalling and key targets for drug discovery, especially within the GPCR-A17 subfamily, which is linked to various diseases. To address the growing need for effective treatments, the GPCR-A17 Modulator, Agonist, Antagonist Predictor (MAAP) was introduced as an advanced ensemble machine learning model that combines XGBoost, Random Forest, and LightGBM to predict the functional roles of agonists, antagonists, and modulators in GPCR-A17 interactions. The model was trained on a dataset of over 3,000 ligands (agonists, antagonists, and modulators) and 6,900 protein–ligand interactions, comprising all three ligand types, sourced from the Guide to Pharmacology, Therapeutic Target Database, and ChEMBL. It demonstrated a strong predictive performance, achieving F1 scores of 0.9179 and 0.7151, AUCs of 0.9766 and 0.8591, and specificities of 0.9703 and 0.8789, respectively, reflecting the overall performance across all classes in the testing and independent ligand validation datasets. A Ki-filtered subset of 4,274 interactions (where Ki is the inhibition constant that quantifies the ligand-binding affinity) improved the F1 scores to 0.9330 and 0.8267 for the testing and independent ligand datasets, respectively. By guiding experimental validation, GPCR-A17 MAAP accelerates drug discovery for various therapeutic targets. The code and data are available on GitHub (https://github.com/MoreiraLAB/GPCR-A17-MAAP).

Graphical Abstract

查看原文本刊更多论文

GPCR-A17 MAAP：定位调节剂、激动剂和拮抗剂，预测下一个生物活性靶点

G蛋白偶联受体（gpcr）在细胞信号传导中起着至关重要的作用，也是药物发现的关键靶点，特别是在与多种疾病相关的GPCR-A17亚家族中。为了满足对有效治疗方法日益增长的需求，GPCR-A17调节剂、激动剂、拮抗剂预测器（MAAP）作为一种先进的集成机器学习模型被引入，该模型结合了XGBoost、Random Forest和LightGBM来预测GPCR-A17相互作用中激动剂、拮抗剂和调节剂的功能作用。该模型是在超过3000种配体（激动剂、拮抗剂和调节剂）和6900种蛋白质-配体相互作用的数据集上训练的，包括所有三种配体类型，来自药理学指南、治疗靶点数据库和ChEMBL。该方法具有较强的预测性能，F1得分分别为0.9179和0.7151，auc分别为0.9766和0.8591，特异性分别为0.9703和0.8789，反映了测试和独立配体验证数据集中所有类别的整体性能。Ki过滤的4274个相互作用子集（其中Ki是量化配体结合亲和力的抑制常数）分别将测试和独立配体数据集的F1分数提高到0.9330和0.8267。通过指导实验验证，GPCR-A17 MAAP加速了各种治疗靶点的药物发现。代码和数据可在GitHub （https://github.com/MoreiraLAB/GPCR-A17-MAAP）上获得。这项关于gpcr，特别是GPCR-A17亚家族的研究具有重要意义，因为它们在细胞信号传导中起着至关重要的作用，并与开发针对复杂健康状况的靶向治疗相关。通过推进配体-受体相互作用预测（激动剂、拮抗剂和调节剂），本研究增强了我们对药物-受体动力学的理解。这些见解可以简化药物发现，减少实验试错，并加速识别用于治疗应用的生物活性化合物。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Cheminformatics CHEMISTRY, MULTIDISCIPLINARY-COMPUTER SCIENCE, INFORMATION SYSTEMS

CiteScore

14.10

自引率

7.00%

发文量

审稿时长

3 months

期刊介绍： Journal of Cheminformatics is an open access journal publishing original peer-reviewed research in all aspects of cheminformatics and molecular modelling. Coverage includes, but is not limited to: chemical information systems, software and databases, and molecular modelling, chemical structure representations and their use in structure, substructure, and similarity searching of chemical substance and chemical reaction databases, computer and molecular graphics, computer-aided molecular design, expert systems, QSAR, and data mining techniques.