MLGCN-Driver:一种基于多层图卷积神经网络的癌症驱动基因识别方法。

IF 3.3 3区 生物学 Q2 BIOCHEMICAL RESEARCH METHODS
Pi-Jing Wei, Jingxin Zhou, Rui-Fen Cao, Yun Ding, Zhenyu Yue, Chun-Hou Zheng
{"title":"MLGCN-Driver:一种基于多层图卷积神经网络的癌症驱动基因识别方法。","authors":"Pi-Jing Wei, Jingxin Zhou, Rui-Fen Cao, Yun Ding, Zhenyu Yue, Chun-Hou Zheng","doi":"10.1186/s12859-025-06260-8","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The progression of cancer is driven by the accumulation of mutations in driver genes. Many researches promote to identify cancer driver genes. However, most of them ignore the high-order features in the network.</p><p><strong>Result: </strong>In this study, we propose a novel method MLGCN-Driver based on multi-layer graph convolutional neural networks (GCN) to boost driver gene identification. MLGCN-Driver employs multi-layer GCN with initial residual connections and identity mappings to learn biological multi-omics features within biological networks. In addition, node2vec algorithm is used to extract the topological structure features of the biological network, and then the features are fed into another multi-layer GCN for feature learning. Meanwhile, the initial residual connections and identity mappings mitigate the over-smooth of features. Finally, the probability of each gene being a driver gene is calculated based on low-dimensional biological features and topological features.</p><p><strong>Conclusion: </strong>We applied the MLGCN-Driver on pan-cancer dataset and cancer type-specific datasets. Experimental results demonstrate the excellent performance of MLGCN-Driver in terms of the area under the ROC curve (AUC) and the area under the precision-recall curve (AUPRC) when compared with state-of-the-art approaches.</p>","PeriodicalId":8958,"journal":{"name":"BMC Bioinformatics","volume":"26 1","pages":"233"},"PeriodicalIF":3.3000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12482577/pdf/","citationCount":"0","resultStr":"{\"title\":\"MLGCN-Driver: a cancer driver gene identification method based on multi-layer graph convolutional neural network.\",\"authors\":\"Pi-Jing Wei, Jingxin Zhou, Rui-Fen Cao, Yun Ding, Zhenyu Yue, Chun-Hou Zheng\",\"doi\":\"10.1186/s12859-025-06260-8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The progression of cancer is driven by the accumulation of mutations in driver genes. Many researches promote to identify cancer driver genes. However, most of them ignore the high-order features in the network.</p><p><strong>Result: </strong>In this study, we propose a novel method MLGCN-Driver based on multi-layer graph convolutional neural networks (GCN) to boost driver gene identification. MLGCN-Driver employs multi-layer GCN with initial residual connections and identity mappings to learn biological multi-omics features within biological networks. In addition, node2vec algorithm is used to extract the topological structure features of the biological network, and then the features are fed into another multi-layer GCN for feature learning. Meanwhile, the initial residual connections and identity mappings mitigate the over-smooth of features. Finally, the probability of each gene being a driver gene is calculated based on low-dimensional biological features and topological features.</p><p><strong>Conclusion: </strong>We applied the MLGCN-Driver on pan-cancer dataset and cancer type-specific datasets. Experimental results demonstrate the excellent performance of MLGCN-Driver in terms of the area under the ROC curve (AUC) and the area under the precision-recall curve (AUPRC) when compared with state-of-the-art approaches.</p>\",\"PeriodicalId\":8958,\"journal\":{\"name\":\"BMC Bioinformatics\",\"volume\":\"26 1\",\"pages\":\"233\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-09-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12482577/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Bioinformatics\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1186/s12859-025-06260-8\",\"RegionNum\":3,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMICAL RESEARCH METHODS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Bioinformatics","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1186/s12859-025-06260-8","RegionNum":3,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOCHEMICAL RESEARCH METHODS","Score":null,"Total":0}
引用次数: 0

摘要

背景:癌症的进展是由驱动基因突变的积累所驱动的。许多研究都在促进癌症驱动基因的识别。然而,它们大多忽略了网络中的高阶特征。结果:在本研究中,我们提出了一种基于多层图卷积神经网络(GCN)的新方法MLGCN-Driver来促进驱动基因的识别。MLGCN-Driver采用具有初始残差连接和身份映射的多层GCN来学习生物网络中的生物多组学特征。此外,使用node2vec算法提取生物网络的拓扑结构特征,然后将特征馈送到另一个多层GCN中进行特征学习。同时,利用初始残差连接和身份映射来缓解特征的过度平滑。最后,根据低维生物特征和拓扑特征计算每个基因为驱动基因的概率。结论:我们将MLGCN-Driver应用于泛癌症数据集和癌症类型特异性数据集。实验结果表明,与现有方法相比,MLGCN-Driver在ROC曲线下面积(AUC)和precision-recall曲线下面积(AUPRC)方面表现优异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MLGCN-Driver: a cancer driver gene identification method based on multi-layer graph convolutional neural network.

Background: The progression of cancer is driven by the accumulation of mutations in driver genes. Many researches promote to identify cancer driver genes. However, most of them ignore the high-order features in the network.

Result: In this study, we propose a novel method MLGCN-Driver based on multi-layer graph convolutional neural networks (GCN) to boost driver gene identification. MLGCN-Driver employs multi-layer GCN with initial residual connections and identity mappings to learn biological multi-omics features within biological networks. In addition, node2vec algorithm is used to extract the topological structure features of the biological network, and then the features are fed into another multi-layer GCN for feature learning. Meanwhile, the initial residual connections and identity mappings mitigate the over-smooth of features. Finally, the probability of each gene being a driver gene is calculated based on low-dimensional biological features and topological features.

Conclusion: We applied the MLGCN-Driver on pan-cancer dataset and cancer type-specific datasets. Experimental results demonstrate the excellent performance of MLGCN-Driver in terms of the area under the ROC curve (AUC) and the area under the precision-recall curve (AUPRC) when compared with state-of-the-art approaches.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Bioinformatics
BMC Bioinformatics 生物-生化研究方法
CiteScore
5.70
自引率
3.30%
发文量
506
审稿时长
4.3 months
期刊介绍: BMC Bioinformatics is an open access, peer-reviewed journal that considers articles on all aspects of the development, testing and novel application of computational and statistical methods for the modeling and analysis of all kinds of biological data, as well as other areas of computational biology. BMC Bioinformatics is part of the BMC series which publishes subject-specific journals focused on the needs of individual research communities across all areas of biology and medicine. We offer an efficient, fair and friendly peer review service, and are committed to publishing all sound science, provided that there is some advance in knowledge presented by the work.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信