Development and validation of a leukemia prognostic model through single-cell RNA sequencing and machine learning approaches.

IF 2.9 4区 医学 Q3 ENDOCRINOLOGY & METABOLISM
Doujia Chen, Jie Yang, Mengting Wang, Tianye Jian
{"title":"Development and validation of a leukemia prognostic model through single-cell RNA sequencing and machine learning approaches.","authors":"Doujia Chen, Jie Yang, Mengting Wang, Tianye Jian","doi":"10.1007/s12672-025-03757-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Leukemia prognosis varies significantly among patients, highlighting the need for accurate prediction tools. Emerging evidence suggests that the immune microenvironment plays a crucial role in leukemia progression and treatment response.</p><p><strong>Methods: </strong>We analyzed RNA expression profiles and clinical data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases, supplemented by single-cell RNA sequencing datasets. Differential gene expression analysis was performed using stringent criteria (logFC > 1, FDR < 0.05) to identify leukemia-associated genes. Ten distinct machine learning algorithms, including Lasso, CoxBoost, and ensemble methods, were implemented for prognostic model development with cross-platform validation. Single-cell analysis employed Seurat for quality control and cell type annotation, while CellChat algorithm mapped intercellular communication networks. Experimental validation was conducted using quantitative RT-PCR analysis of key immune markers (TLR2, TLR4, CCR7, IL18) in U937 and K562 leukemia cell lines compared to normal peripheral blood mononuclear cells.</p><p><strong>Results: </strong>The machine learning-derived prognostic model demonstrated exceptional predictive performance with area under the curve values of 0.874, 0.891, and 0.925 for 1-, 2-, and 3-year survival endpoints, respectively. Six critical immune regulatory genes (TLR2, TLR4, CCR7, IL18, TIRAP, FOXP3) were identified as both differentially expressed and prognostically significant, with IL18 showing the highest discriminative capacity (AUC = 0.983). RT-PCR validation confirmed significant upregulation of all tested genes in leukemia cell lines: TLR2 (3.8-fold in U937, 2.2-fold in K562), TLR4 (3.4-fold in U937, 1.8-fold in K562), CCR7 (4.1-fold in U937, 2.7-fold in K562), and IL18 (5.2-fold in U937, 3.6-fold in K562) compared to normal controls (all p < 0.05). Single-cell analysis revealed substantial cellular heterogeneity with cell type-specific expression patterns and complex intercellular communication networks involving B cells, T cells, natural killer cells, and dendritic cells.</p><p><strong>Conclusion: </strong>This study provides a reliable prognostic tool for leukemia and offers insights into the critical role of the immune microenvironment in leukemia pathogenesis. Our findings may guide the development of personalized immunotherapy strategies for leukemia patients.</p>","PeriodicalId":11148,"journal":{"name":"Discover. Oncology","volume":"16 1","pages":"1846"},"PeriodicalIF":2.9000,"publicationDate":"2025-10-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12514105/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Discover. Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s12672-025-03757-9","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ENDOCRINOLOGY & METABOLISM","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Leukemia prognosis varies significantly among patients, highlighting the need for accurate prediction tools. Emerging evidence suggests that the immune microenvironment plays a crucial role in leukemia progression and treatment response.

Methods: We analyzed RNA expression profiles and clinical data from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases, supplemented by single-cell RNA sequencing datasets. Differential gene expression analysis was performed using stringent criteria (logFC > 1, FDR < 0.05) to identify leukemia-associated genes. Ten distinct machine learning algorithms, including Lasso, CoxBoost, and ensemble methods, were implemented for prognostic model development with cross-platform validation. Single-cell analysis employed Seurat for quality control and cell type annotation, while CellChat algorithm mapped intercellular communication networks. Experimental validation was conducted using quantitative RT-PCR analysis of key immune markers (TLR2, TLR4, CCR7, IL18) in U937 and K562 leukemia cell lines compared to normal peripheral blood mononuclear cells.

Results: The machine learning-derived prognostic model demonstrated exceptional predictive performance with area under the curve values of 0.874, 0.891, and 0.925 for 1-, 2-, and 3-year survival endpoints, respectively. Six critical immune regulatory genes (TLR2, TLR4, CCR7, IL18, TIRAP, FOXP3) were identified as both differentially expressed and prognostically significant, with IL18 showing the highest discriminative capacity (AUC = 0.983). RT-PCR validation confirmed significant upregulation of all tested genes in leukemia cell lines: TLR2 (3.8-fold in U937, 2.2-fold in K562), TLR4 (3.4-fold in U937, 1.8-fold in K562), CCR7 (4.1-fold in U937, 2.7-fold in K562), and IL18 (5.2-fold in U937, 3.6-fold in K562) compared to normal controls (all p < 0.05). Single-cell analysis revealed substantial cellular heterogeneity with cell type-specific expression patterns and complex intercellular communication networks involving B cells, T cells, natural killer cells, and dendritic cells.

Conclusion: This study provides a reliable prognostic tool for leukemia and offers insights into the critical role of the immune microenvironment in leukemia pathogenesis. Our findings may guide the development of personalized immunotherapy strategies for leukemia patients.

Abstract Image

Abstract Image

Abstract Image

通过单细胞RNA测序和机器学习方法开发和验证白血病预后模型。
背景:白血病患者预后差异显著,需要准确的预测工具。越来越多的证据表明,免疫微环境在白血病的进展和治疗反应中起着至关重要的作用。方法:我们分析了来自癌症基因组图谱(TCGA)和基因表达图谱(GEO)数据库的RNA表达谱和临床数据,并辅以单细胞RNA测序数据集。采用严格的标准(logFC >1, FDR)进行差异基因表达分析。结果:机器学习衍生的预后模型在1年、2年和3年生存终点的曲线下面积分别为0.874、0.891和0.925,显示出卓越的预测性能。6个关键免疫调节基因(TLR2、TLR4、CCR7、IL18、TIRAP、FOXP3)均有差异表达且具有预后意义,其中IL18的鉴别能力最高(AUC = 0.983)。RT-PCR验证证实,与正常对照相比,白血病细胞系中所有测试基因TLR2 (U937为3.8倍,K562为2.2倍)、TLR4 (U937为3.4倍,K562为1.8倍)、CCR7 (U937为4.1倍,K562为2.7倍)和IL18 (U937为5.2倍,K562为3.6倍)均显著上调(均p)。结论:本研究为白血病的预后提供了可靠的工具,并揭示了免疫微环境在白血病发病中的关键作用。我们的发现可能指导白血病患者个性化免疫治疗策略的发展。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Discover. Oncology
Discover. Oncology Medicine-Endocrinology, Diabetes and Metabolism
CiteScore
2.40
自引率
9.10%
发文量
122
审稿时长
5 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信