{"title":"Cancer Survival Prediction based on Soft-Label Guided Contrastive Learning and Global Feature Fusion.","authors":"Huiying Jiang, Wenlan Chen, Fei Guo, Cheng Liang","doi":"10.1093/bioinformatics/btaf552","DOIUrl":null,"url":null,"abstract":"<p><strong>Motivation: </strong>The high complexity and heterogeneity of cancer pose significant challenges to personalized treatment, making the improvement of cancer survival prediction accuracy crucial for clinical decision-making. The integration of multi-omics data enables a more comprehensive capture of multi-layered information in complex biological processes. However, existing survival analysis models still face limitations in accurately extracting and effectively integrating the unique and shared information from multi-omics data.</p><p><strong>Results: </strong>In this paper, we propose a novel prediction model for cancer survival based on soft-label guided contrastive learning and global feature fusion, namely SLCGF. Our model first extracts paired feature representations for each omics using Siamese encoders. We then perform intra-view and inter-view contrastive learning simultaneously, employing a neighborhood-based paradigm to enhance feature discrimination and alignment across omics. To ensure reliable neighbor retention and improve model robustness, we treat the affinities between samples and their high-order neighbors as soft labels to guide the contrastive learning process at both levels. In addition, we adopt a global self-attention mechanism to obtain the unified representation for cancer survival prediction, where the cross-omics connections are fully exploited and complementary information is adaptively integrated. We comprehensively evaluate the performance of our model on 13 cancer multi-omics datasets, and the experimental results demonstrate its superiority over existing approaches.</p><p><strong>Availability and implementation: </strong>Source code is available at https://github.com/LiangSDNULab/SLCGF.</p>","PeriodicalId":93899,"journal":{"name":"Bioinformatics (Oxford, England)","volume":" ","pages":""},"PeriodicalIF":5.4000,"publicationDate":"2025-09-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bioinformatics (Oxford, England)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/bioinformatics/btaf552","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Motivation: The high complexity and heterogeneity of cancer pose significant challenges to personalized treatment, making the improvement of cancer survival prediction accuracy crucial for clinical decision-making. The integration of multi-omics data enables a more comprehensive capture of multi-layered information in complex biological processes. However, existing survival analysis models still face limitations in accurately extracting and effectively integrating the unique and shared information from multi-omics data.
Results: In this paper, we propose a novel prediction model for cancer survival based on soft-label guided contrastive learning and global feature fusion, namely SLCGF. Our model first extracts paired feature representations for each omics using Siamese encoders. We then perform intra-view and inter-view contrastive learning simultaneously, employing a neighborhood-based paradigm to enhance feature discrimination and alignment across omics. To ensure reliable neighbor retention and improve model robustness, we treat the affinities between samples and their high-order neighbors as soft labels to guide the contrastive learning process at both levels. In addition, we adopt a global self-attention mechanism to obtain the unified representation for cancer survival prediction, where the cross-omics connections are fully exploited and complementary information is adaptively integrated. We comprehensively evaluate the performance of our model on 13 cancer multi-omics datasets, and the experimental results demonstrate its superiority over existing approaches.
Availability and implementation: Source code is available at https://github.com/LiangSDNULab/SLCGF.