Meng Yue, Jianing Zhao, Si Wu, Lijing Cai, Xinran Wang, Ying Jia, Xiaoxiao Wang, Yongjun Wang, Yueping Liu
{"title":"Establishment of multiple machine learning prognostic model for gene differences between primary tumors and lymph nodes in luminal breast cancer.","authors":"Meng Yue, Jianing Zhao, Si Wu, Lijing Cai, Xinran Wang, Ying Jia, Xiaoxiao Wang, Yongjun Wang, Yueping Liu","doi":"10.1007/s10549-024-07574-6","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aimed to explore the correlation between primary tumors (PT) and paired metastatic lymph nodes (LN) and to develop a predictive model to provide evidence for forecasting patient prognoses.</p><p><strong>Methods: </strong>We obtained single-cell and bulk transcriptome data from the Gene Expression Omnibus database. Furthermore, mRNA transcriptomic data, encompassing 112 normal tissues and 1066 breast cancer samples, along with survival, clinical, and mutation information for breast cancer patients, were acquired from The Cancer Genome Atlas (TCGA). Employing a machine learning integration framework incorporating ten distinct algorithms, we developed and validated a prognostic model.</p><p><strong>Results: </strong>We constructed a prognostic model named Lymph Node Metastasis-Related Scores (LMRS) using 26 differentially expressed genes trained on eight TCGA datasets. Across validation sets, the model demonstrated a high C-index, signifying its stability and effectiveness, outperforming 64 models from other studies. Notably, cytolytic activity and T cell co-stimulation were downregulated in the high LMRS group, alongside a downregulation of immune cells, including B cells, CD8 + T cells, iDCs, and TILs. Similarly, most immune checkpoints exhibited a decreasing trend with high LMRS expression. Finally, we selected the hub biomarkers PGK1 and HSP90 for pathological verification. Results indicated higher expression levels in PT and LN compared to normal and benign tumors, with higher expression levels in LN than in PT.</p><p><strong>Conclusion: </strong>This comprehensive analysis sheds light on gene expression differences between PT and LN in breast cancer, culminating in the development of a multiple-gene prognostic model with high clinical accuracy for prognosis prediction.</p>","PeriodicalId":9133,"journal":{"name":"Breast Cancer Research and Treatment","volume":" ","pages":"365-376"},"PeriodicalIF":3.0000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Breast Cancer Research and Treatment","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10549-024-07574-6","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/10 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: This study aimed to explore the correlation between primary tumors (PT) and paired metastatic lymph nodes (LN) and to develop a predictive model to provide evidence for forecasting patient prognoses.
Methods: We obtained single-cell and bulk transcriptome data from the Gene Expression Omnibus database. Furthermore, mRNA transcriptomic data, encompassing 112 normal tissues and 1066 breast cancer samples, along with survival, clinical, and mutation information for breast cancer patients, were acquired from The Cancer Genome Atlas (TCGA). Employing a machine learning integration framework incorporating ten distinct algorithms, we developed and validated a prognostic model.
Results: We constructed a prognostic model named Lymph Node Metastasis-Related Scores (LMRS) using 26 differentially expressed genes trained on eight TCGA datasets. Across validation sets, the model demonstrated a high C-index, signifying its stability and effectiveness, outperforming 64 models from other studies. Notably, cytolytic activity and T cell co-stimulation were downregulated in the high LMRS group, alongside a downregulation of immune cells, including B cells, CD8 + T cells, iDCs, and TILs. Similarly, most immune checkpoints exhibited a decreasing trend with high LMRS expression. Finally, we selected the hub biomarkers PGK1 and HSP90 for pathological verification. Results indicated higher expression levels in PT and LN compared to normal and benign tumors, with higher expression levels in LN than in PT.
Conclusion: This comprehensive analysis sheds light on gene expression differences between PT and LN in breast cancer, culminating in the development of a multiple-gene prognostic model with high clinical accuracy for prognosis prediction.
期刊介绍:
Breast Cancer Research and Treatment provides the surgeon, radiotherapist, medical oncologist, endocrinologist, epidemiologist, immunologist or cell biologist investigating problems in breast cancer a single forum for communication. The journal creates a "market place" for breast cancer topics which cuts across all the usual lines of disciplines, providing a site for presenting pertinent investigations, and for discussing critical questions relevant to the entire field. It seeks to develop a new focus and new perspectives for all those concerned with breast cancer.