MOLUNGN: a multi-omics graph neural network for biomarker discovery and accurate lung cancer classification.

IF 2.8 3区 生物学 Q2 GENETICS & HEREDITY
Frontiers in Genetics Pub Date : 2025-06-04 eCollection Date: 2025-01-01 DOI:10.3389/fgene.2025.1610284
Daifeng Zhang, Guoqiang Bian, Yuanbin Zhang, Jiadong Xie, Chenjun Hu
{"title":"MOLUNGN: a multi-omics graph neural network for biomarker discovery and accurate lung cancer classification.","authors":"Daifeng Zhang, Guoqiang Bian, Yuanbin Zhang, Jiadong Xie, Chenjun Hu","doi":"10.3389/fgene.2025.1610284","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Lung cancer continues to pose significant global health burdens due to its high morbidity and mortality. This study aimed to systematically integrate biomedical datasets, particularly incorporating traditional Chinese medicine (TCM)-associated multi-omics data, employing advanced deep-learning methods enhanced by graph attention mechanisms. We sought to investigate molecular mechanisms underlying stage-wise lung cancer progression and identify pivotal stage-specific biomarkers to support precise cancer staging classification.</p><p><strong>Methods: </strong>We developed a novel multi-omics integrative model, named the Multi-Omics Lung Cancer Graph Network (MOLUNGN), based on Graph Attention Networks (GAT). Clinical datasets of non-small cell lung cancer (NSCLC), including lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC), were analyzed to create omics-specific feature matrices comprising mRNA expression, miRNA mutation profiles, and DNA methylation data. MOLUNGN incorporated omics-specific GAT modules (OSGAT) combined with a Multi-Omics View Correlation Discovery Network (MOVCDN), effectively capturing intra- and inter-omics correlations. This framework enabled comprehensive classification of clinical cases into precise cancer stages, alongside the extraction of stage-specific biomarkers.</p><p><strong>Results: </strong>Evaluations utilizing publicly available datasets confirmed MOLUNGN's superior performance over existing methodologies. On the LUAD dataset, MOLUNGN achieved accuracy (ACC) of 0.84, Recall_weighted of 0.84, F1_weighted of 0.83, and F1_macro of 0.82. On the LUSC dataset, the model further improved, achieving ACC of 0.86, Recall_weighted of 0.86, F1_weighted of 0.85, and F1_macro of 0.84. Notably, critical stage-specific biomarkers with significant biological relevance to lung cancer progression were identified, facilitating robust gene-disease associations.</p><p><strong>Discussion: </strong>Our findings underscore the efficacy of MOLUNGN as an integrative framework in accurately classifying lung cancer stages and uncovering essential biomarkers. These biomarkers provide deep insights into lung cancer progression mechanisms and represent promising targets for future clinical validation. Integrating these biomarkers into the TCM-target-disease network enriches the understanding of TCM therapeutic potentials, laying a robust foundation for future precision medicine applications.</p>","PeriodicalId":12750,"journal":{"name":"Frontiers in Genetics","volume":"16 ","pages":"1610284"},"PeriodicalIF":2.8000,"publicationDate":"2025-06-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12174459/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Genetics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.3389/fgene.2025.1610284","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Lung cancer continues to pose significant global health burdens due to its high morbidity and mortality. This study aimed to systematically integrate biomedical datasets, particularly incorporating traditional Chinese medicine (TCM)-associated multi-omics data, employing advanced deep-learning methods enhanced by graph attention mechanisms. We sought to investigate molecular mechanisms underlying stage-wise lung cancer progression and identify pivotal stage-specific biomarkers to support precise cancer staging classification.

Methods: We developed a novel multi-omics integrative model, named the Multi-Omics Lung Cancer Graph Network (MOLUNGN), based on Graph Attention Networks (GAT). Clinical datasets of non-small cell lung cancer (NSCLC), including lung adenocarcinoma (LUAD) and lung squamous cell carcinoma (LUSC), were analyzed to create omics-specific feature matrices comprising mRNA expression, miRNA mutation profiles, and DNA methylation data. MOLUNGN incorporated omics-specific GAT modules (OSGAT) combined with a Multi-Omics View Correlation Discovery Network (MOVCDN), effectively capturing intra- and inter-omics correlations. This framework enabled comprehensive classification of clinical cases into precise cancer stages, alongside the extraction of stage-specific biomarkers.

Results: Evaluations utilizing publicly available datasets confirmed MOLUNGN's superior performance over existing methodologies. On the LUAD dataset, MOLUNGN achieved accuracy (ACC) of 0.84, Recall_weighted of 0.84, F1_weighted of 0.83, and F1_macro of 0.82. On the LUSC dataset, the model further improved, achieving ACC of 0.86, Recall_weighted of 0.86, F1_weighted of 0.85, and F1_macro of 0.84. Notably, critical stage-specific biomarkers with significant biological relevance to lung cancer progression were identified, facilitating robust gene-disease associations.

Discussion: Our findings underscore the efficacy of MOLUNGN as an integrative framework in accurately classifying lung cancer stages and uncovering essential biomarkers. These biomarkers provide deep insights into lung cancer progression mechanisms and represent promising targets for future clinical validation. Integrating these biomarkers into the TCM-target-disease network enriches the understanding of TCM therapeutic potentials, laying a robust foundation for future precision medicine applications.

MOLUNGN:用于生物标志物发现和肺癌准确分类的多组学图神经网络。
导言:肺癌由于其高发病率和死亡率,继续构成重大的全球健康负担。本研究旨在系统地整合生物医学数据集,特别是结合中医相关的多组学数据,采用先进的深度学习方法,通过图注意机制增强。我们试图研究肺癌分期进展的分子机制,并确定关键的分期特异性生物标志物,以支持精确的癌症分期分类。方法:基于图注意力网络(GAT),建立了一种新的多组学整合模型,命名为多组学肺癌图网络(MOLUNGN)。分析非小细胞肺癌(NSCLC)的临床数据集,包括肺腺癌(LUAD)和肺鳞状细胞癌(LUSC),以创建组学特异性特征矩阵,包括mRNA表达、miRNA突变谱和DNA甲基化数据。MOLUNGN将组学特异性GAT模块(OSGAT)与多组学视图相关性发现网络(MOVCDN)相结合,有效捕获组学内部和组学间的相关性。该框架能够将临床病例全面分类为精确的癌症阶段,同时提取特定阶段的生物标志物。结果:利用公开可用数据集的评估证实了MOLUNGN优于现有方法的性能。在LUAD数据集上,MOLUNGN的准确率(ACC)为0.84,Recall_weighted为0.84,F1_weighted为0.83,F1_macro为0.82。在LUSC数据集上,模型得到进一步改进,ACC为0.86,Recall_weighted为0.86,F1_weighted为0.85,F1_macro为0.84。值得注意的是,发现了与肺癌进展具有重要生物学相关性的关键阶段特异性生物标志物,促进了强大的基因与疾病的关联。讨论:我们的研究结果强调了MOLUNGN作为准确分类肺癌分期和揭示必要生物标志物的综合框架的有效性。这些生物标志物为肺癌进展机制提供了深入的见解,并代表了未来临床验证的有希望的靶点。将这些生物标志物整合到中医靶病网络中,丰富了对中医治疗潜力的认识,为未来精准医疗的应用奠定了坚实的基础。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Frontiers in Genetics
Frontiers in Genetics Biochemistry, Genetics and Molecular Biology-Molecular Medicine
CiteScore
5.50
自引率
8.10%
发文量
3491
审稿时长
14 weeks
期刊介绍: Frontiers in Genetics publishes rigorously peer-reviewed research on genes and genomes relating to all the domains of life, from humans to plants to livestock and other model organisms. Led by an outstanding Editorial Board of the world’s leading experts, this multidisciplinary, open-access journal is at the forefront of communicating cutting-edge research to researchers, academics, clinicians, policy makers and the public. The study of inheritance and the impact of the genome on various biological processes is well documented. However, the majority of discoveries are still to come. A new era is seeing major developments in the function and variability of the genome, the use of genetic and genomic tools and the analysis of the genetic basis of various biological phenomena.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信