Decoding temporal heterogeneity in NSCLC through machine learning and prognostic model construction.

IF 2.5 3区 医学 Q3 ONCOLOGY
Junpeng Cheng, Meizhu Xiao, Qingkang Meng, Min Zhang, Denan Zhang, Lei Liu, Qing Jin, Zhijin Fu, Yanjiao Li, Xiujie Chen, Hongbo Xie
{"title":"Decoding temporal heterogeneity in NSCLC through machine learning and prognostic model construction.","authors":"Junpeng Cheng, Meizhu Xiao, Qingkang Meng, Min Zhang, Denan Zhang, Lei Liu, Qing Jin, Zhijin Fu, Yanjiao Li, Xiujie Chen, Hongbo Xie","doi":"10.1186/s12957-024-03435-0","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Non-small cell lung cancer (NSCLC) is a prevalent and heterogeneous disease with significant genomic variations between the early and advanced stages. The identification of key genes and pathways driving NSCLC tumor progression is critical for improving the diagnosis and treatment outcomes of this disease.</p><p><strong>Methods: </strong>In this study, we conducted single-cell transcriptome analysis on 93,406 cells from 22 NSCLC patients to characterize malignant NSCLC cancer cells. Utilizing cNMF, we classified these cells into distinct modules, thus identifying the diverse molecular profiles within NSCLC. Through pseudotime analysis, we delineated temporal gene expression changes during NSCLC evolution, thus demonstrating genes associated with disease progression. Using the XGBoost model, we assessed the significance of these genes in the pseudotime trajectory. Our findings were validated by using transcriptome sequencing data from The Cancer Genome Atlas (TCGA), supplemented via LASSO regression to refine the selection of characteristic genes. Subsequently, we established a risk score model based on these genes, thus providing a potential tool for cancer risk assessment and personalized treatment strategies.</p><p><strong>Results: </strong>We used cNMF to classify malignant NSCLC cells into three functional modules, including the metabolic reprogramming module, cell cycle module, and cell stemness module, which can be used for the functional classification of malignant tumor cells in NSCLC. These findings also indicate that metabolism, the cell cycle, and tumor stemness play important driving roles in the malignant evolution of NSCLC. We integrated cNMF and XGBoost to select marker genes that are indicative of both early and advanced NSCLC stages. The expression of genes such as CHCHD2, GAPDH, and CD24 was strongly correlated with the malignant evolution of NSCLC at the single-cell data level. These genes have been validated via histological data. The risk score model that we established (represented by eight genes) was ultimately validated with GEO data.</p><p><strong>Conclusion: </strong>In summary, our study contributes to the identification of temporal heterogeneous biomarkers in NSCLC, thus offering insights into disease progression mechanisms and potential therapeutic targets. The developed workflow demonstrates promise for future applications in clinical practice.</p>","PeriodicalId":23856,"journal":{"name":"World Journal of Surgical Oncology","volume":null,"pages":null},"PeriodicalIF":2.5000,"publicationDate":"2024-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11170806/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"World Journal of Surgical Oncology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12957-024-03435-0","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Non-small cell lung cancer (NSCLC) is a prevalent and heterogeneous disease with significant genomic variations between the early and advanced stages. The identification of key genes and pathways driving NSCLC tumor progression is critical for improving the diagnosis and treatment outcomes of this disease.

Methods: In this study, we conducted single-cell transcriptome analysis on 93,406 cells from 22 NSCLC patients to characterize malignant NSCLC cancer cells. Utilizing cNMF, we classified these cells into distinct modules, thus identifying the diverse molecular profiles within NSCLC. Through pseudotime analysis, we delineated temporal gene expression changes during NSCLC evolution, thus demonstrating genes associated with disease progression. Using the XGBoost model, we assessed the significance of these genes in the pseudotime trajectory. Our findings were validated by using transcriptome sequencing data from The Cancer Genome Atlas (TCGA), supplemented via LASSO regression to refine the selection of characteristic genes. Subsequently, we established a risk score model based on these genes, thus providing a potential tool for cancer risk assessment and personalized treatment strategies.

Results: We used cNMF to classify malignant NSCLC cells into three functional modules, including the metabolic reprogramming module, cell cycle module, and cell stemness module, which can be used for the functional classification of malignant tumor cells in NSCLC. These findings also indicate that metabolism, the cell cycle, and tumor stemness play important driving roles in the malignant evolution of NSCLC. We integrated cNMF and XGBoost to select marker genes that are indicative of both early and advanced NSCLC stages. The expression of genes such as CHCHD2, GAPDH, and CD24 was strongly correlated with the malignant evolution of NSCLC at the single-cell data level. These genes have been validated via histological data. The risk score model that we established (represented by eight genes) was ultimately validated with GEO data.

Conclusion: In summary, our study contributes to the identification of temporal heterogeneous biomarkers in NSCLC, thus offering insights into disease progression mechanisms and potential therapeutic targets. The developed workflow demonstrates promise for future applications in clinical practice.

通过机器学习和预后模型构建解码 NSCLC 中的时间异质性。
背景:非小细胞肺癌(NSCLC)是一种常见的异质性疾病,其早期和晚期阶段的基因组差异显著。鉴定驱动 NSCLC 肿瘤进展的关键基因和通路对于改善该疾病的诊断和治疗效果至关重要:在这项研究中,我们对来自 22 名 NSCLC 患者的 93,406 个细胞进行了单细胞转录组分析,以确定恶性 NSCLC 癌细胞的特征。利用 cNMF,我们将这些细胞划分为不同的模块,从而确定了 NSCLC 中不同的分子特征。通过伪时间分析,我们划定了 NSCLC 演变过程中基因表达的时间变化,从而展示了与疾病进展相关的基因。利用 XGBoost 模型,我们评估了这些基因在伪时间轨迹中的重要性。我们利用癌症基因组图谱(TCGA)的转录组测序数据验证了我们的研究结果,并通过 LASSO 回归对特征基因的选择进行了补充。随后,我们根据这些基因建立了一个风险评分模型,从而为癌症风险评估和个性化治疗策略提供了一个潜在的工具:我们利用 cNMF 将恶性 NSCLC 细胞分为三个功能模块,包括代谢重编程模块、细胞周期模块和细胞干性模块,可用于 NSCLC 恶性肿瘤细胞的功能分类。这些发现还表明,代谢、细胞周期和肿瘤干性在NSCLC的恶性演化过程中起着重要的驱动作用。我们整合了 cNMF 和 XGBoost,筛选出了可指示 NSCLC 早期和晚期的标记基因。在单细胞数据水平上,CHCHD2、GAPDH 和 CD24 等基因的表达与 NSCLC 的恶性演变密切相关。这些基因已通过组织学数据得到验证。我们建立的风险评分模型(以八个基因为代表)最终得到了 GEO 数据的验证:总之,我们的研究有助于鉴定 NSCLC 中的时间异质性生物标记物,从而深入了解疾病进展机制和潜在的治疗靶点。所开发的工作流程有望在未来的临床实践中得到应用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
4.70
自引率
15.60%
发文量
362
审稿时长
3 months
期刊介绍: World Journal of Surgical Oncology publishes articles related to surgical oncology and its allied subjects, such as epidemiology, cancer research, biomarkers, prevention, pathology, radiology, cancer treatment, clinical trials, multimodality treatment and molecular biology. Emphasis is placed on original research articles. The journal also publishes significant clinical case reports, as well as balanced and timely reviews on selected topics. Oncology is a multidisciplinary super-speciality of which surgical oncology forms an integral component, especially with solid tumors. Surgical oncologists around the world are involved in research extending from detecting the mechanisms underlying the causation of cancer, to its treatment and prevention. The role of a surgical oncologist extends across the whole continuum of care. With continued developments in diagnosis and treatment, the role of a surgical oncologist is ever-changing. Hence, World Journal of Surgical Oncology aims to keep readers abreast with latest developments that will ultimately influence the work of surgical oncologists.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信