Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova
{"title":"机器学习和基因网络整合揭示胰腺癌预后子网络和生物标志物。","authors":"Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova","doi":"10.1016/j.csbj.2025.09.028","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies<b>.</b></p><p><strong>Methods: </strong>We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.</p><p><strong>Results: </strong>WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes <i>ANLN</i>, <i>GPRC5A</i>, <i>KLF6</i>, <i>MUC1</i>, and <i>PHF20</i>, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. <i>In vitro</i> validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.</p><p><strong>Conclusion: </strong>This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.</p>","PeriodicalId":10715,"journal":{"name":"Computational and structural biotechnology journal","volume":"27 ","pages":"4151-4162"},"PeriodicalIF":4.1000,"publicationDate":"2025-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495057/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning and gene network integration reveal prognostic subnetworks and biomarkers in pancreatic cancer.\",\"authors\":\"Rana Salihoglu, Jesus Nieves, Gudrun Dandekar, Regina Ebert, Maximilian Rudert, Thomas Dandekar, Elena Bencurova\",\"doi\":\"10.1016/j.csbj.2025.09.028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies<b>.</b></p><p><strong>Methods: </strong>We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.</p><p><strong>Results: </strong>WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes <i>ANLN</i>, <i>GPRC5A</i>, <i>KLF6</i>, <i>MUC1</i>, and <i>PHF20</i>, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. <i>In vitro</i> validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.</p><p><strong>Conclusion: </strong>This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.</p>\",\"PeriodicalId\":10715,\"journal\":{\"name\":\"Computational and structural biotechnology journal\",\"volume\":\"27 \",\"pages\":\"4151-4162\"},\"PeriodicalIF\":4.1000,\"publicationDate\":\"2025-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12495057/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Computational and structural biotechnology journal\",\"FirstCategoryId\":\"99\",\"ListUrlMain\":\"https://doi.org/10.1016/j.csbj.2025.09.028\",\"RegionNum\":2,\"RegionCategory\":\"生物学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"BIOCHEMISTRY & MOLECULAR BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computational and structural biotechnology journal","FirstCategoryId":"99","ListUrlMain":"https://doi.org/10.1016/j.csbj.2025.09.028","RegionNum":2,"RegionCategory":"生物学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"BIOCHEMISTRY & MOLECULAR BIOLOGY","Score":null,"Total":0}
Machine learning and gene network integration reveal prognostic subnetworks and biomarkers in pancreatic cancer.
Background: Pancreatic cancer has a high mortality rate and lacks early detection markers. Advanced methods, such as machine learning (ML) and network analysis, identify central cancer networks with potential diagnostic and prognostic biomarkers, leading to improved tumor targeting strategies.
Methods: We systematically collected pancreatic cancer transcriptome datasets from the databases TCGA, GTEx, and GEO. Weighted gene co-expression network analysis (WGCNA) identified gene modules associated with clinical traits. Multiple machine learning-based feature selection methods (Random Forest, Support Vector Machine, LASSO, ReliefF) and differential gene expression (DGE) analysis prioritized candidate genes. Functional enrichment (Gene Ontology and KEGG pathway database) examined biological processes involved in tumor progression and immune evasion. Survival analyses evaluated prognostic significance.
Results: WGCNA identified pancreatic cancer networks from key gene modules strongly associated with cancer stage and survival. Common biomarkers, including transcripts from genes ANLN, GPRC5A, KLF6, MUC1, and PHF20, demonstrated significant diagnostic and prognostic potential as shown by ML, WGCNA, DGE, and survival analyzes. In vitro validation was performed for proteins mucin1 and CD44 in patient samples and tissue models.
Conclusion: This study identified novel regulatory cancer networks and associated biomarkers for pancreatic cancer prognosis and diagnosis by integrating WGCNA with ML, DGE, pathway, and survival analyses. An interactive web portal to explore the full results and visualizations is available at pc-biomarkers.de. Future work will further validate these biomarkers to improve early detection, prognosis, and treatment strategies.
期刊介绍:
Computational and Structural Biotechnology Journal (CSBJ) is an online gold open access journal publishing research articles and reviews after full peer review. All articles are published, without barriers to access, immediately upon acceptance. The journal places a strong emphasis on functional and mechanistic understanding of how molecular components in a biological process work together through the application of computational methods. Structural data may provide such insights, but they are not a pre-requisite for publication in the journal. Specific areas of interest include, but are not limited to:
Structure and function of proteins, nucleic acids and other macromolecules
Structure and function of multi-component complexes
Protein folding, processing and degradation
Enzymology
Computational and structural studies of plant systems
Microbial Informatics
Genomics
Proteomics
Metabolomics
Algorithms and Hypothesis in Bioinformatics
Mathematical and Theoretical Biology
Computational Chemistry and Drug Discovery
Microscopy and Molecular Imaging
Nanotechnology
Systems and Synthetic Biology