Integrated Prognostic Model for Young Breast Cancer Patients: Insights from SEER, METABRIC, and TCGA Databases.

IF 2.5 3区 医学 Q2 ONCOLOGY
Yongxin Li, Dexiang Li, Xinlong Tao, Yinyin Ye, Chengrong Zhang, Zhengbo Xu, Zhilin Liu, Miaozhou Wang, Zhen Liu, Zitao Li, Hongxia Liang, Fanzhen Kong, Jiuda Zhao
{"title":"Integrated Prognostic Model for Young Breast Cancer Patients: Insights from SEER, METABRIC, and TCGA Databases.","authors":"Yongxin Li, Dexiang Li, Xinlong Tao, Yinyin Ye, Chengrong Zhang, Zhengbo Xu, Zhilin Liu, Miaozhou Wang, Zhen Liu, Zitao Li, Hongxia Liang, Fanzhen Kong, Jiuda Zhao","doi":"10.1016/j.clbc.2025.07.015","DOIUrl":null,"url":null,"abstract":"<p><p>This study aims to develop a prognostic model for breast cancer in young women (BCY) by integrating clinical and genomic data. We analyzed clinical data from SEER and identified key prognostic genes using METABRIC and TCGA datasets. Machine learning (LASSO and XGBoost) was used for gene selection, and a survival prediction model was built. The model was validated in independent datasets, and an interactive online nomogram was developed. The study included 139,994 breast cancer patients, confirming that age under 40 is an independent adverse factor for overall survival (OS). Using gene expression data from METABRIC and TCGA, PLA2G2A and C8orf76 were identified as critical prognostic biomarkers. These genes were incorporated into a predictive model that stratified patients into high- and low-risk groups with significant survival differences. The model achieved strong predictive performance, with AUC values of 0.862, 0.813, and 0.756 for 3-, 5-, and 8-year overall survival (OS), respectively. Additionally, a web-based tool (https://lyx00.shinyapps.io/BCY-Model-CG/) was developed to facilitate clinical implementation, allowing individualized survival predictions based on patient-specific clinical and genetic data. This integrated prognostic model, combining clinical and genomic data, improves survival prediction for breast cancer in young women (BCY) patients. It offers a more precise tool for risk assessment and personalized treatment planning.</p>","PeriodicalId":10197,"journal":{"name":"Clinical breast cancer","volume":" ","pages":""},"PeriodicalIF":2.5000,"publicationDate":"2025-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical breast cancer","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.clbc.2025.07.015","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

This study aims to develop a prognostic model for breast cancer in young women (BCY) by integrating clinical and genomic data. We analyzed clinical data from SEER and identified key prognostic genes using METABRIC and TCGA datasets. Machine learning (LASSO and XGBoost) was used for gene selection, and a survival prediction model was built. The model was validated in independent datasets, and an interactive online nomogram was developed. The study included 139,994 breast cancer patients, confirming that age under 40 is an independent adverse factor for overall survival (OS). Using gene expression data from METABRIC and TCGA, PLA2G2A and C8orf76 were identified as critical prognostic biomarkers. These genes were incorporated into a predictive model that stratified patients into high- and low-risk groups with significant survival differences. The model achieved strong predictive performance, with AUC values of 0.862, 0.813, and 0.756 for 3-, 5-, and 8-year overall survival (OS), respectively. Additionally, a web-based tool (https://lyx00.shinyapps.io/BCY-Model-CG/) was developed to facilitate clinical implementation, allowing individualized survival predictions based on patient-specific clinical and genetic data. This integrated prognostic model, combining clinical and genomic data, improves survival prediction for breast cancer in young women (BCY) patients. It offers a more precise tool for risk assessment and personalized treatment planning.

年轻乳腺癌患者的综合预后模型:来自SEER, METABRIC和TCGA数据库的见解。
本研究旨在通过整合临床和基因组数据,建立年轻女性乳腺癌(BCY)的预后模型。我们分析了来自SEER的临床数据,并使用METABRIC和TCGA数据集确定了关键的预后基因。利用机器学习(LASSO和XGBoost)进行基因选择,建立存活预测模型。在独立数据集上对模型进行了验证,并开发了交互式在线图。该研究纳入了139994名乳腺癌患者,证实年龄在40岁以下是影响总生存率(OS)的独立不利因素。利用METABRIC和TCGA的基因表达数据,PLA2G2A和C8orf76被确定为关键的预后生物标志物。这些基因被纳入一个预测模型,该模型将患者分为高危组和低危组,并存在显著的生存差异。该模型具有较强的预测性能,3年、5年和8年总生存期(OS)的AUC值分别为0.862、0.813和0.756。此外,开发了一个基于网络的工具(https://lyx00.shinyapps.io/BCY-Model-CG/)以促进临床实施,允许基于患者特定临床和遗传数据的个性化生存预测。这种综合预后模型结合了临床和基因组数据,提高了年轻女性乳腺癌患者的生存预测。它为风险评估和个性化治疗计划提供了更精确的工具。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Clinical breast cancer
Clinical breast cancer 医学-肿瘤学
CiteScore
5.40
自引率
3.20%
发文量
174
审稿时长
48 days
期刊介绍: Clinical Breast Cancer is a peer-reviewed bimonthly journal that publishes original articles describing various aspects of clinical and translational research of breast cancer. Clinical Breast Cancer is devoted to articles on detection, diagnosis, prevention, and treatment of breast cancer. The main emphasis is on recent scientific developments in all areas related to breast cancer. Specific areas of interest include clinical research reports from various therapeutic modalities, cancer genetics, drug sensitivity and resistance, novel imaging, tumor genomics, biomarkers, and chemoprevention strategies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信