机器学习和基因网络整合揭示胰腺癌预后子网络和生物标志物。

IF 4.1 2区 生物学 Q2 BIOCHEMISTRY & MOLECULAR BIOLOGY
Computational and structural biotechnology journal Pub Date : 2025-09-20 eCollection Date: 2025-01-01 DOI:10.1016/j.csbj.2025.09.028
Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova
{"title":"机器学习和基因网络整合揭示胰腺癌预后子网络和生物标志物。","authors":"Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova","doi":"10.1016/j.csbj.2025.09.028","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies<b>.</b></p><p><strong>Methods: </strong>We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.</p><p><strong>Results: </strong>WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes <i>ANLN</i>, <i>GPRC5A</i>, <i>KLF6</i>, <i>MUC1</i>, and <i>PHF20</i>, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. <i>In vitro</i> validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.</p><p><strong>Conclusion: </strong>This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.</p>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"4151-4162"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495057/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning and gene network integration reveal prognostic subnetworks and biomarkers in pancreatic cancer.\",\"authors\":\"Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova\",\"doi\":\"10.1016/j.csbj.2025.09.028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies<b>.</b></p><p><strong>Methods: </strong>We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.</p><p><strong>Results: </strong>WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes <i>ANLN</i>, <i>GPRC5A</i>, <i>KLF6</i>, <i>MUC1</i>, and <i>PHF20</i>, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. <i>In vitro</i> validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.</p><p><strong>Conclusion: </strong>This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.</p>\",\"PeriodicalId\":10715,\"journal\":{\"name\":\"Computational and structural biotechnology journal\",\"volume\":\"27 \",\"pages\":\"4151-4162\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495057/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational and structural biotechnology journal\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.csbj.2025.09.028\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.csbj.2025.09.028","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
引用次数: 0

摘要

背景:胰腺癌死亡率高,缺乏早期检测标志物。先进的方法,如机器学习(ML)和网络分析,识别具有潜在诊断和预后生物标志物的中心癌症网络,从而改进肿瘤靶向策略。方法:系统地从TCGA、GTEx和GEO数据库中收集胰腺癌转录组数据集。加权基因共表达网络分析(WGCNA)确定了与临床特征相关的基因模块。基于机器学习的多种特征选择方法(随机森林、支持向量机、LASSO、ReliefF)和差异基因表达(DGE)分析对候选基因进行了优先排序。功能富集(基因本体和KEGG通路数据库)检测了肿瘤进展和免疫逃避的生物学过程。生存分析评估预后意义。结果:WGCNA从与癌症分期和生存密切相关的关键基因模块中鉴定出胰腺癌网络。常见的生物标志物,包括基因ANLN、GPRC5A、KLF6、MUC1和PHF20的转录本,在ML、WGCNA、DGE和生存分析中显示出显著的诊断和预后潜力。在患者样本和组织模型中对mucin1和CD44蛋白进行了体外验证。结论:本研究通过将WGCNA与ML、DGE、通路和生存分析相结合,确定了胰腺癌预后和诊断的新的调节肿瘤网络和相关生物标志物。在pc-biomarkers.de上有一个交互式门户网站,可以探索完整的结果和可视化。未来的工作将进一步验证这些生物标志物,以改善早期检测、预后和治疗策略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine learning and gene network integration reveal prognostic subnetworks and biomarkers in pancreatic cancer.

Background: Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies.

Methods: We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.

Results: WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes ANLN, GPRC5A, KLF6, MUC1, and PHF20, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. In vitro validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.

Conclusion: This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Computational and structural biotechnology journal
Computational and structural biotechnology journal Biochemistry, Genetics and Molecular Biology-Biophysics
CiteScore
9.30
自引率
3.30%
发文量
540
审稿时长
6 weeks
期刊介绍: Computational and Structural Biotechnology Journal (CSBJ) is an online gold open access journal publishing research articles and reviews after full peer review. All articles are published, without barriers to access, immediately upon acceptance. The journal places a strong emphasis on functional and mechanistic understanding of how molecular components in a biological process work together through the application of computational methods. Structural data may provide such insights, but they are not a pre-requisite for publication in the journal. Specific areas of interest include, but are not limited to: Structure and function of proteins, nucleic acids and other macromolecules Structure and function of multi-component complexes Protein folding, processing and degradation Enzymology Computational and structural studies of plant systems Microbial Informatics Genomics Proteomics Metabolomics Algorithms and Hypothesis in Bioinformatics Mathematical and Theoretical Biology Computational Chemistry and Drug Discovery Microscopy and Molecular Imaging Nanotechnology Systems and Synthetic Biology
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信