{"title":"基于机器学习的人类免疫缺陷病毒相关皮肤t细胞淋巴瘤预后模型:监测、流行病学和最终结果数据库分析。","authors":"Weimin Huang, Manwen Tian, Lanlan Jia, Jing Ai, Jinying Gan, Junteng Chen, Lingzhen Chen, Yongmin Zhang","doi":"10.1177/03000605251359433","DOIUrl":null,"url":null,"abstract":"<p><p>ObjectiveAccurate prognostication is crucial for managing human immunodeficiency virus (HIV)-associated cutaneous T-cell lymphoma. In this study, we aimed to develop an improved machine learning-based prognostic model for predicting the 5-year survival rates in HIV-associated cutaneous T-cell lymphoma patients.MethodsWe derived and tested machine learning models using algorithms including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest. Our study involved data from a US population-based cohort of patients diagnosed with HIV-associated cutaneous T-cell lymphoma between 1 January 2000 and 31 December 2018, which were extracted from the Surveillance, Epidemiology, and End Results database. The primary outcome was the prediction of 5-year overall survival. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC), and calibration was assessed using Brier scores.ResultsA cohort of 381 HIV-associated cutaneous T-cell lymphoma patients was analyzed. Multivariate logistic regression identified age ≥60 years (odds ratio = 4.88), regional stage (odds ratio = 10.31), distant stage (odds ratio = 28.37), and chemotherapy (odds ratio = 4.71) as significant independent risk factors for 5-year mortality. Among seven machine learning models developed, the XGBoost model demonstrated the highest discrimination for 5-year overall survival (AUC = 0.867), followed by LightGBM (AUC = 0.835). Both models exhibited good calibration with low Brier scores (XGBoost = 0.130, LightGBM = 0.109). Support Vector Machine performed optimally in ten-fold cross-validation, logistic regression showed the lowest Brier score (0.106), and XGBoost provided the best balance of discrimination and robust performance.ConclusionOur novel machine learning approach produced prognostic models with superior discrimination for 5-year overall survival in HIV-associated cutaneous T-cell lymphoma patients using standard clinicopathological variables. These models offer potential for more accurate and personalized prognostics, potentially improving patient management and clinical decision-making.</p>","PeriodicalId":16129,"journal":{"name":"Journal of International Medical Research","volume":"53 9","pages":"3000605251359433"},"PeriodicalIF":1.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12417650/pdf/","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based prognostic model for human immunodeficiency virus-associated cutaneous T-cell lymphoma: A Surveillance, Epidemiology, and End Results database analysis.\",\"authors\":\"Weimin Huang, Manwen Tian, Lanlan Jia, Jing Ai, Jinying Gan, Junteng Chen, Lingzhen Chen, Yongmin Zhang\",\"doi\":\"10.1177/03000605251359433\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>ObjectiveAccurate prognostication is crucial for managing human immunodeficiency virus (HIV)-associated cutaneous T-cell lymphoma. In this study, we aimed to develop an improved machine learning-based prognostic model for predicting the 5-year survival rates in HIV-associated cutaneous T-cell lymphoma patients.MethodsWe derived and tested machine learning models using algorithms including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest. Our study involved data from a US population-based cohort of patients diagnosed with HIV-associated cutaneous T-cell lymphoma between 1 January 2000 and 31 December 2018, which were extracted from the Surveillance, Epidemiology, and End Results database. The primary outcome was the prediction of 5-year overall survival. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC), and calibration was assessed using Brier scores.ResultsA cohort of 381 HIV-associated cutaneous T-cell lymphoma patients was analyzed. Multivariate logistic regression identified age ≥60 years (odds ratio = 4.88), regional stage (odds ratio = 10.31), distant stage (odds ratio = 28.37), and chemotherapy (odds ratio = 4.71) as significant independent risk factors for 5-year mortality. Among seven machine learning models developed, the XGBoost model demonstrated the highest discrimination for 5-year overall survival (AUC = 0.867), followed by LightGBM (AUC = 0.835). Both models exhibited good calibration with low Brier scores (XGBoost = 0.130, LightGBM = 0.109). Support Vector Machine performed optimally in ten-fold cross-validation, logistic regression showed the lowest Brier score (0.106), and XGBoost provided the best balance of discrimination and robust performance.ConclusionOur novel machine learning approach produced prognostic models with superior discrimination for 5-year overall survival in HIV-associated cutaneous T-cell lymphoma patients using standard clinicopathological variables. These models offer potential for more accurate and personalized prognostics, potentially improving patient management and clinical decision-making.</p>\",\"PeriodicalId\":16129,\"journal\":{\"name\":\"Journal of International Medical Research\",\"volume\":\"53 9\",\"pages\":\"3000605251359433\"},\"PeriodicalIF\":1.5000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12417650/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of International Medical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1177/03000605251359433\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/9/8 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q4\",\"JCRName\":\"MEDICINE, RESEARCH & EXPERIMENTAL\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of International Medical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/03000605251359433","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/9/8 0:00:00","PubModel":"Epub","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
Machine learning-based prognostic model for human immunodeficiency virus-associated cutaneous T-cell lymphoma: A Surveillance, Epidemiology, and End Results database analysis.
ObjectiveAccurate prognostication is crucial for managing human immunodeficiency virus (HIV)-associated cutaneous T-cell lymphoma. In this study, we aimed to develop an improved machine learning-based prognostic model for predicting the 5-year survival rates in HIV-associated cutaneous T-cell lymphoma patients.MethodsWe derived and tested machine learning models using algorithms including Extreme Gradient Boosting (XGBoost), Light Gradient Boosting Machine (LightGBM), and Random Forest. Our study involved data from a US population-based cohort of patients diagnosed with HIV-associated cutaneous T-cell lymphoma between 1 January 2000 and 31 December 2018, which were extracted from the Surveillance, Epidemiology, and End Results database. The primary outcome was the prediction of 5-year overall survival. Model discrimination was assessed using the area under the receiver operating characteristic curve (AUC), and calibration was assessed using Brier scores.ResultsA cohort of 381 HIV-associated cutaneous T-cell lymphoma patients was analyzed. Multivariate logistic regression identified age ≥60 years (odds ratio = 4.88), regional stage (odds ratio = 10.31), distant stage (odds ratio = 28.37), and chemotherapy (odds ratio = 4.71) as significant independent risk factors for 5-year mortality. Among seven machine learning models developed, the XGBoost model demonstrated the highest discrimination for 5-year overall survival (AUC = 0.867), followed by LightGBM (AUC = 0.835). Both models exhibited good calibration with low Brier scores (XGBoost = 0.130, LightGBM = 0.109). Support Vector Machine performed optimally in ten-fold cross-validation, logistic regression showed the lowest Brier score (0.106), and XGBoost provided the best balance of discrimination and robust performance.ConclusionOur novel machine learning approach produced prognostic models with superior discrimination for 5-year overall survival in HIV-associated cutaneous T-cell lymphoma patients using standard clinicopathological variables. These models offer potential for more accurate and personalized prognostics, potentially improving patient management and clinical decision-making.
期刊介绍:
_Journal of International Medical Research_ is a leading international journal for rapid publication of original medical, pre-clinical and clinical research, reviews, preliminary and pilot studies on a page charge basis.
As a service to authors, every article accepted by peer review will be given a full technical edit to make papers as accessible and readable to the international medical community as rapidly as possible.
Once the technical edit queries have been answered to the satisfaction of the journal, the paper will be published and made available freely to everyone under a creative commons licence.
Symposium proceedings, summaries of presentations or collections of medical, pre-clinical or clinical data on a specific topic are welcome for publication as supplements.
Print ISSN: 0300-0605