Jai Chand Patel, Sushil Kumar Shakyawar, Sahil Sethi, Chittibabu Guda
{"title":"GAIN-BRCA: a graph-based AI-net framework for breast cancer subtype classification using multiomics data.","authors":"Jai Chand Patel, Sushil Kumar Shakyawar, Sahil Sethi, Chittibabu Guda","doi":"10.1093/bioadv/vbaf116","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>Contextual integration of multiomic datasets from the same patient could improve the accuracy of subtype prediction algorithms to help with better prognosis and management of breast cancer. Previous machine learning models have underexplored the graph-based integration, hence unable to leverage the biological associations among different omics modalities. Here, we developed a graph-based method, GAIN-BRCA, using the native features from mRNA, DNA methylation (CpG), and miRNA data as well as the synthesized features from their interactions. GAIN-BRCA computes weightage from miRNA-mRNA and CpG-mRNA interactions to derive a new transformed feature vector that captures the essential biological context.</p><p><strong>Results: </strong>GAIN-BRCA demonstrates superior performance with an AUROC of 0.98. GAIN-BRCA, with an accuracy of 0.92 also outperformed the existing methods like MOGONET and moBRCA-net with accuracies of 0.72 and 0.86, respectively. Kaplan-Meier survival analysis revealed subtype-specific prognostic genes, including KRAS in Luminal A (<i>P</i> value = 0.041), TOX in Luminal B (<i>P</i> value = 0.008), and MITF and TOB1 in HER2+ (<i>P</i> values = 0.029 and 0.025, respectively). However, no single gene demonstrated a significant survival correlation unique to the Basal subtype. GAIN-BRCA framework, in combination with SHAP, has identified several subtype-specific biomarkers to aid in the development of precision therapeutics for breast cancer subtypes.</p><p><strong>Availability and implementation: </strong>GAIN-BRCA code is publicly accessible on https://github.com/GudaLab/GAIN-BRCA.</p>","PeriodicalId":72368,"journal":{"name":"Bioinformatics advances","volume":"5 1","pages":"vbaf116"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12151285/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics advances","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioadv/vbaf116","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: Contextual integration of multiomic datasets from the same patient could improve the accuracy of subtype prediction algorithms to help with better prognosis and management of breast cancer. Previous machine learning models have underexplored the graph-based integration, hence unable to leverage the biological associations among different omics modalities. Here, we developed a graph-based method, GAIN-BRCA, using the native features from mRNA, DNA methylation (CpG), and miRNA data as well as the synthesized features from their interactions. GAIN-BRCA computes weightage from miRNA-mRNA and CpG-mRNA interactions to derive a new transformed feature vector that captures the essential biological context.
Results: GAIN-BRCA demonstrates superior performance with an AUROC of 0.98. GAIN-BRCA, with an accuracy of 0.92 also outperformed the existing methods like MOGONET and moBRCA-net with accuracies of 0.72 and 0.86, respectively. Kaplan-Meier survival analysis revealed subtype-specific prognostic genes, including KRAS in Luminal A (P value = 0.041), TOX in Luminal B (P value = 0.008), and MITF and TOB1 in HER2+ (P values = 0.029 and 0.025, respectively). However, no single gene demonstrated a significant survival correlation unique to the Basal subtype. GAIN-BRCA framework, in combination with SHAP, has identified several subtype-specific biomarkers to aid in the development of precision therapeutics for breast cancer subtypes.
Availability and implementation: GAIN-BRCA code is publicly accessible on https://github.com/GudaLab/GAIN-BRCA.