GAIN-BRCA: a graph-based AI-net framework for breast cancer subtype classification using multiomics data.

IF 2.8 Q2 MATHEMATICAL & COMPUTATIONAL BIOLOGY
Bioinformatics advances Pub Date : 2025-05-14 eCollection Date: 2025-01-01 DOI:10.1093/bioadv/vbaf116
Jai Chand Patel, Sushil Kumar Shakyawar, Sahil Sethi, Chittibabu Guda
{"title":"GAIN-BRCA: a graph-based AI-net framework for breast cancer subtype classification using multiomics data.","authors":"Jai Chand Patel, Sushil Kumar Shakyawar, Sahil Sethi, Chittibabu Guda","doi":"10.1093/bioadv/vbaf116","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Contextual integration of multiomic datasets from the same patient could improve the accuracy of subtype prediction algorithms to help with better prognosis and management of breast cancer. Previous machine learning models have underexplored the graph-based integration, hence unable to leverage the biological associations among different omics modalities. Here, we developed a graph-based method, GAIN-BRCA, using the native features from mRNA, DNA methylation (CpG), and miRNA data as well as the synthesized features from their interactions. GAIN-BRCA computes weightage from miRNA-mRNA and CpG-mRNA interactions to derive a new transformed feature vector that captures the essential biological context.</p><p><strong>Results: </strong>GAIN-BRCA demonstrates superior performance with an AUROC of 0.98. GAIN-BRCA, with an accuracy of 0.92 also outperformed the existing methods like MOGONET and moBRCA-net with accuracies of 0.72 and 0.86, respectively. Kaplan-Meier survival analysis revealed subtype-specific prognostic genes, including KRAS in Luminal A (<i>P</i> value = 0.041), TOX in Luminal B (<i>P</i> value = 0.008), and MITF and TOB1 in HER2+ (<i>P</i> values = 0.029 and 0.025, respectively). However, no single gene demonstrated a significant survival correlation unique to the Basal subtype. GAIN-BRCA framework, in combination with SHAP, has identified several subtype-specific biomarkers to aid in the development of precision therapeutics for breast cancer subtypes.</p><p><strong>Availability and implementation: </strong>GAIN-BRCA code is publicly accessible on https://github.com/GudaLab/GAIN-BRCA.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf116"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12151285/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Motivation: Contextual integration of multiomic datasets from the same patient could improve the accuracy of subtype prediction algorithms to help with better prognosis and management of breast cancer. Previous machine learning models have underexplored the graph-based integration, hence unable to leverage the biological associations among different omics modalities. Here, we developed a graph-based method, GAIN-BRCA, using the native features from mRNA, DNA methylation (CpG), and miRNA data as well as the synthesized features from their interactions. GAIN-BRCA computes weightage from miRNA-mRNA and CpG-mRNA interactions to derive a new transformed feature vector that captures the essential biological context.

Results: GAIN-BRCA demonstrates superior performance with an AUROC of 0.98. GAIN-BRCA, with an accuracy of 0.92 also outperformed the existing methods like MOGONET and moBRCA-net with accuracies of 0.72 and 0.86, respectively. Kaplan-Meier survival analysis revealed subtype-specific prognostic genes, including KRAS in Luminal A (P value = 0.041), TOX in Luminal B (P value = 0.008), and MITF and TOB1 in HER2+ (P values = 0.029 and 0.025, respectively). However, no single gene demonstrated a significant survival correlation unique to the Basal subtype. GAIN-BRCA framework, in combination with SHAP, has identified several subtype-specific biomarkers to aid in the development of precision therapeutics for breast cancer subtypes.

Availability and implementation: GAIN-BRCA code is publicly accessible on https://github.com/GudaLab/GAIN-BRCA.

GAIN-BRCA:基于图的AI-net框架,用于使用多组学数据进行乳腺癌亚型分类。
动机:来自同一患者的多组数据集的上下文整合可以提高亚型预测算法的准确性,从而有助于更好的乳腺癌预后和管理。以前的机器学习模型没有充分探索基于图的集成,因此无法利用不同组学模式之间的生物学关联。在这里,我们开发了一种基于图的方法,即GAIN-BRCA,利用mRNA、DNA甲基化(CpG)和miRNA数据的天然特征以及它们相互作用的合成特征。GAIN-BRCA计算miRNA-mRNA和CpG-mRNA相互作用的权重,以获得捕获基本生物学背景的新转换特征向量。结果:GAIN-BRCA表现出优异的性能,AUROC为0.98。GAIN-BRCA的准确率为0.92,也优于MOGONET和moBRCA-net等现有方法,前者的准确率分别为0.72和0.86。Kaplan-Meier生存分析揭示了亚型特异性预后基因,包括Luminal A中的KRAS (P值= 0.041),Luminal B中的TOX (P值= 0.008),HER2+中的MITF和TOB1 (P值分别为0.029和0.025)。然而,没有单一基因显示出与基底亚型特有的显著生存相关。GAIN-BRCA框架与SHAP结合,已经确定了几种亚型特异性生物标志物,以帮助开发针对乳腺癌亚型的精确治疗方法。可用性和实现:GAIN-BRCA代码可在https://github.com/GudaLab/GAIN-BRCA上公开访问。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.60
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信