Machine learning integration of single-cell and bulk transcriptomics identifies fibroblast-driven prognostic markers in colorectal cancer.

0 MEDICINE, RESEARCH & EXPERIMENTAL
Ning Zhang, Ruiyan Liu, Siya Wu, Chenxi Feng, Boxiang Wang, Qiaoqiao Zheng, Linru Jie, Ruihua Kang, Xiaoli Guo, Xiaoyang Wang, Shaokai Zhang, Jiangong Zhang
{"title":"Machine learning integration of single-cell and bulk transcriptomics identifies fibroblast-driven prognostic markers in colorectal cancer.","authors":"Ning Zhang, Ruiyan Liu, Siya Wu, Chenxi Feng, Boxiang Wang, Qiaoqiao Zheng, Linru Jie, Ruihua Kang, Xiaoli Guo, Xiaoyang Wang, Shaokai Zhang, Jiangong Zhang","doi":"10.17305/bb.2025.12038","DOIUrl":null,"url":null,"abstract":"<p><p>Single-cell RNA sequencing (scRNA-seq) has significantly advanced our understanding of cellular heterogeneity and the complex interplay within the tumor microenvironment (TME) of colorectal cancer (CRC). However, translating these molecular insights into clinically actionable prognostic biomarkers and therapeutic strategies remains a considerable challenge. In this study, we conducted a comprehensive scRNA-seq analysis of 306 CRC samples comprising 448,255 cells to characterize the TME in depth. By constructing intercellular communication networks based on connection counts and communication probabilities, we identified fibroblasts as central regulatory hubs within the TME. Using Wilcoxon rank-sum tests and univariate survival analyses, we initially identified 23 prognostic fibroblast markers. These were refined to a seven-gene fibroblast-related prognostic signature via an integrated machine learning approach. The signature exhibited robust predictive performance in the The Cancer Genome Atlas - Colon Adenocarcinoma (TCGA-COAD) training cohort (n=351; C-index=0.65) and was successfully validated in the GSE17536 dataset (n=177; C-index=0.63). Functional enrichment analyses revealed that this signature is involved in immune regulation and multiple tumor-associated cellular pathways. Notably, high-risk patients displayed increased macrophage and NK cell infiltration, impaired immune function, and elevated immune rejection scores, while low-risk patients demonstrated heightened sensitivity to camptothecin and irinotecan. Together, our findings underscore the prognostic value of fibroblast-derived signatures in CRC and support their potential utility in risk stratification and the development of personalized therapeutic strategies, contributing to the advancement of precision oncology.</p>","PeriodicalId":72398,"journal":{"name":"Biomolecules & biomedicine","volume":" ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2025-04-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomolecules & biomedicine","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17305/bb.2025.12038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"0","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

Single-cell RNA sequencing (scRNA-seq) has significantly advanced our understanding of cellular heterogeneity and the complex interplay within the tumor microenvironment (TME) of colorectal cancer (CRC). However, translating these molecular insights into clinically actionable prognostic biomarkers and therapeutic strategies remains a considerable challenge. In this study, we conducted a comprehensive scRNA-seq analysis of 306 CRC samples comprising 448,255 cells to characterize the TME in depth. By constructing intercellular communication networks based on connection counts and communication probabilities, we identified fibroblasts as central regulatory hubs within the TME. Using Wilcoxon rank-sum tests and univariate survival analyses, we initially identified 23 prognostic fibroblast markers. These were refined to a seven-gene fibroblast-related prognostic signature via an integrated machine learning approach. The signature exhibited robust predictive performance in the The Cancer Genome Atlas - Colon Adenocarcinoma (TCGA-COAD) training cohort (n=351; C-index=0.65) and was successfully validated in the GSE17536 dataset (n=177; C-index=0.63). Functional enrichment analyses revealed that this signature is involved in immune regulation and multiple tumor-associated cellular pathways. Notably, high-risk patients displayed increased macrophage and NK cell infiltration, impaired immune function, and elevated immune rejection scores, while low-risk patients demonstrated heightened sensitivity to camptothecin and irinotecan. Together, our findings underscore the prognostic value of fibroblast-derived signatures in CRC and support their potential utility in risk stratification and the development of personalized therapeutic strategies, contributing to the advancement of precision oncology.

单细胞和大量转录组学的机器学习整合鉴定结直肠癌成纤维细胞驱动的预后标志物。
单细胞RNA测序(scRNA-seq)极大地促进了我们对结直肠癌(CRC)细胞异质性和肿瘤微环境(TME)内复杂相互作用的理解。然而,将这些分子见解转化为临床可操作的预后生物标志物和治疗策略仍然是一个相当大的挑战。在这项研究中,我们对306份CRC样本进行了全面的scRNA-seq分析,其中包括448,255个细胞,以深入表征TME。通过构建基于连接数和通信概率的细胞间通信网络,我们确定成纤维细胞是TME中的中央调控枢纽。使用Wilcoxon秩和试验和单变量生存分析,我们初步确定了23种预后成纤维细胞标记物。通过集成的机器学习方法,这些被精炼为7个基因成纤维细胞相关的预后特征。该标记在癌症基因组图谱-结肠腺癌(TCGA-COAD)培训队列中显示出强大的预测性能(n=351;C-index=0.65),并在GSE17536数据集(n=177;c指数= 0.63)。功能富集分析显示,该信号参与免疫调节和多种肿瘤相关的细胞通路。值得注意的是,高风险患者表现出巨噬细胞和NK细胞浸润增加,免疫功能受损,免疫排斥评分升高,而低风险患者表现出对喜树碱和伊立替康的敏感性升高。总之,我们的研究结果强调了成纤维细胞衍生特征在结直肠癌中的预后价值,并支持它们在风险分层和个性化治疗策略开发中的潜在应用,有助于精确肿瘤学的进步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
1.10
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信