Machine learning-based prognostic model for human immunodeficiency virus-associated cutaneous T-cell lymphoma: A Surveillance, Epidemiology, and End Results database analysis.

IF 1.5 4区 医学 Q4 MEDICINE, RESEARCH & EXPERIMENTAL
Journal of International Medical Research Pub Date : 2025-09-01 Epub Date: 2025-09-08 DOI:10.1177/03000605251359433
Weimin Huang, Manwen Tian, Lanlan Jia, Jing Ai, Jinying Gan, Junteng Chen, Lingzhen Chen, Yongmin Zhang
{"title":"Machine learning-based prognostic model for human immunodeficiency virus-associated cutaneous T-cell lymphoma: A Surveillance, Epidemiology, and End Results database analysis.","authors":"Weimin Huang, Manwen Tian, Lanlan Jia, Jing Ai, Jinying Gan, Junteng Chen, Lingzhen Chen, Yongmin Zhang","doi":"10.1177/03000605251359433","DOIUrl":null,"url":null,"abstract":"<p><p>ObjectiveAccurate prognostication is crucial for managing human immunodeficiency virus (HIV)-associated cutaneous T-cell lymphoma. In this study, we aimed to develop an improved machine learning-based prognostic model for predicting the 5-year survival rates in HIV-associated cutaneous T-cell lymphoma patients.MethodsWe derived and tested machine learning models using algorithms including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest. Our study involved data from a US population-based cohort of patients diagnosed with HIV-associated cutaneous T-cell lymphoma between 1 January 2000 and 31 December 2018, which were extracted from the Surveillance, Epidemiology, and End Results database. The primary outcome was the prediction of 5-year overall survival. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC), and calibration was assessed using Brier scores.ResultsA cohort of 381 HIV-associated cutaneous T-cell lymphoma patients was analyzed. Multivariate logistic regression identified age ≥60 years (odds ratio = 4.88), regional stage (odds ratio = 10.31), distant stage (odds ratio = 28.37), and chemotherapy (odds ratio = 4.71) as significant independent risk factors for 5-year mortality. Among seven machine learning models developed, the XGBoost model demonstrated the highest discrimination for 5-year overall survival (AUC = 0.867), followed by LightGBM (AUC = 0.835). Both models exhibited good calibration with low Brier scores (XGBoost = 0.130, LightGBM = 0.109). Support Vector Machine performed optimally in ten-fold cross-validation, logistic regression showed the lowest Brier score (0.106), and XGBoost provided the best balance of discrimination and robust performance.ConclusionOur novel machine learning approach produced prognostic models with superior discrimination for 5-year overall survival in HIV-associated cutaneous T-cell lymphoma patients using standard clinicopathological variables. These models offer potential for more accurate and personalized prognostics, potentially improving patient management and clinical decision-making.</p>","PeriodicalId":16129,"journal":{"name":"Journal of International Medical Research","volume":"53 9","pages":"3000605251359433"},"PeriodicalIF":1.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12417650/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of International Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/03000605251359433","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/8 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 0

Abstract

ObjectiveAccurate prognostication is crucial for managing human immunodeficiency virus (HIV)-associated cutaneous T-cell lymphoma. In this study, we aimed to develop an improved machine learning-based prognostic model for predicting the 5-year survival rates in HIV-associated cutaneous T-cell lymphoma patients.MethodsWe derived and tested machine learning models using algorithms including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest. Our study involved data from a US population-based cohort of patients diagnosed with HIV-associated cutaneous T-cell lymphoma between 1 January 2000 and 31 December 2018, which were extracted from the Surveillance, Epidemiology, and End Results database. The primary outcome was the prediction of 5-year overall survival. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC), and calibration was assessed using Brier scores.ResultsA cohort of 381 HIV-associated cutaneous T-cell lymphoma patients was analyzed. Multivariate logistic regression identified age ≥60 years (odds ratio = 4.88), regional stage (odds ratio = 10.31), distant stage (odds ratio = 28.37), and chemotherapy (odds ratio = 4.71) as significant independent risk factors for 5-year mortality. Among seven machine learning models developed, the XGBoost model demonstrated the highest discrimination for 5-year overall survival (AUC = 0.867), followed by LightGBM (AUC = 0.835). Both models exhibited good calibration with low Brier scores (XGBoost = 0.130, LightGBM = 0.109). Support Vector Machine performed optimally in ten-fold cross-validation, logistic regression showed the lowest Brier score (0.106), and XGBoost provided the best balance of discrimination and robust performance.ConclusionOur novel machine learning approach produced prognostic models with superior discrimination for 5-year overall survival in HIV-associated cutaneous T-cell lymphoma patients using standard clinicopathological variables. These models offer potential for more accurate and personalized prognostics, potentially improving patient management and clinical decision-making.

Abstract Image

Abstract Image

Abstract Image

基于机器学习的人类免疫缺陷病毒相关皮肤t细胞淋巴瘤预后模型:监测、流行病学和最终结果数据库分析。
目的:准确的预后对治疗人类免疫缺陷病毒(HIV)相关皮肤t细胞淋巴瘤至关重要。在这项研究中,我们旨在开发一种改进的基于机器学习的预测模型,用于预测hiv相关皮肤t细胞淋巴瘤患者的5年生存率。方法我们使用极端梯度增强(XGBoost)、光梯度增强机(LightGBM)和随机森林算法推导和测试机器学习模型。我们的研究涉及2000年1月1日至2018年12月31日期间诊断为hiv相关皮肤t细胞淋巴瘤的美国人群队列患者的数据,这些数据来自监测、流行病学和最终结果数据库。主要结局是预测5年总生存期。使用受试者工作特征曲线下面积(AUC)评估模型判别,使用Brier评分评估模型校准。结果对381例hiv相关皮肤t细胞淋巴瘤患者进行了队列分析。多因素logistic回归发现年龄≥60岁(优势比= 4.88)、局部分期(优势比= 10.31)、远处分期(优势比= 28.37)和化疗(优势比= 4.71)是5年死亡率的重要独立危险因素。在开发的7个机器学习模型中,XGBoost模型对5年总生存期的判别率最高(AUC = 0.867),其次是LightGBM (AUC = 0.835)。两种模型均具有较低的Brier评分(XGBoost = 0.130, LightGBM = 0.109)。支持向量机在十重交叉验证中表现最佳,逻辑回归显示Brier得分最低(0.106),XGBoost提供了最佳的判别性和鲁棒性平衡。我们的新机器学习方法使用标准临床病理变量,为hiv相关皮肤t细胞淋巴瘤患者的5年总生存率建立了具有优越判别能力的预后模型。这些模型提供了更准确和个性化的预后,有可能改善患者管理和临床决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.20
自引率
0.00%
发文量
555
审稿时长
1 months
期刊介绍: _Journal of International Medical Research_ is a leading international journal for rapid publication of original medical, pre-clinical and clinical research, reviews, preliminary and pilot studies on a page charge basis. As a service to authors, every article accepted by peer review will be given a full technical edit to make papers as accessible and readable to the international medical community as rapidly as possible. Once the technical edit queries have been answered to the satisfaction of the journal, the paper will be published and made available freely to everyone under a creative commons licence. Symposium proceedings, summaries of presentations or collections of medical, pre-clinical or clinical data on a specific topic are welcome for publication as supplements. Print ISSN: 0300-0605
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信