{"title":"Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.","authors":"Yi-Guang Wang, Hsin-An Chang, Mu-Hong Chen, Nian-Sheng Tzeng, Jin Narumoto, Chih-Sung Liang, Ta-Chuan Yeh","doi":"10.30773/pi.2024.0351","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study develops an eXtreme Gradient Boosting (XGBoost) regression model to identify key predictors of mortality and 5-year survival in dementia patients, highlighting the role of comorbidities. The findings highlight key risk factors that may facilitate targeted adjustments in clinical care and resource allocation for high-risk patients.</p><p><strong>Methods: </strong>We used Taiwan's National Health Insurance dataset to develop and validate an XGBoost model predicting 5-year survival in dementia patients aged 65 years or older. The cohort (n=6,556) was split into 80% for training, 10% for validation, and 10% for testing. A total of 24 variables, including comorbidities and demographic factors, were selected as predictors. Hyperparameters were tuned to optimize performance, with a learning rate of 0.1, 1,000 estimators, and a maximum depth of 10. Regularization techniques were applied to prevent overfitting.</p><p><strong>Results: </strong>The XGBoost model achieved 81.86% accuracy in predicting 5-year survival, with a receiver operating characteristic area under the curve of 0.81 and a log loss of 0.61. Of the 37 initial features, 24 were included, and the top 10 predictors were nasogastric tube insertion, chronic kidney disease, cancer, lung disease, urinary tract infection, fracture, peripheral vascular disease, antidepressant use, hypertension, and upper gastrointestinal issues.</p><p><strong>Conclusion: </strong>The XGBoost model effectively predicts 5-year survival in dementia patients, identifying key predictors that can guide targeted care, preventive strategies, and healthcare resource planning.</p>","PeriodicalId":21164,"journal":{"name":"Psychiatry Investigation","volume":" ","pages":"1057-1067"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12444194/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychiatry Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.30773/pi.2024.0351","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: This study develops an eXtreme Gradient Boosting (XGBoost) regression model to identify key predictors of mortality and 5-year survival in dementia patients, highlighting the role of comorbidities. The findings highlight key risk factors that may facilitate targeted adjustments in clinical care and resource allocation for high-risk patients.
Methods: We used Taiwan's National Health Insurance dataset to develop and validate an XGBoost model predicting 5-year survival in dementia patients aged 65 years or older. The cohort (n=6,556) was split into 80% for training, 10% for validation, and 10% for testing. A total of 24 variables, including comorbidities and demographic factors, were selected as predictors. Hyperparameters were tuned to optimize performance, with a learning rate of 0.1, 1,000 estimators, and a maximum depth of 10. Regularization techniques were applied to prevent overfitting.
Results: The XGBoost model achieved 81.86% accuracy in predicting 5-year survival, with a receiver operating characteristic area under the curve of 0.81 and a log loss of 0.61. Of the 37 initial features, 24 were included, and the top 10 predictors were nasogastric tube insertion, chronic kidney disease, cancer, lung disease, urinary tract infection, fracture, peripheral vascular disease, antidepressant use, hypertension, and upper gastrointestinal issues.
Conclusion: The XGBoost model effectively predicts 5-year survival in dementia patients, identifying key predictors that can guide targeted care, preventive strategies, and healthcare resource planning.
期刊介绍:
The Psychiatry Investigation is published on the 25th day of every month in English by the Korean Neuropsychiatric Association (KNPA). The Journal covers the whole range of psychiatry and neuroscience. Both basic and clinical contributions are encouraged from all disciplines and research areas relevant to the pathophysiology and management of neuropsychiatric disorders and symptoms, as well as researches related to cross cultural psychiatry and ethnic issues in psychiatry. The Journal publishes editorials, review articles, original articles, brief reports, viewpoints and correspondences. All research articles are peer reviewed. Contributions are accepted for publication on the condition that their substance has not been published or submitted for publication elsewhere. Authors submitting papers to the Journal (serially or otherwise) with a common theme or using data derived from the same sample (or a subset thereof) must send details of all relevant previous publications and simultaneous submissions. The Journal is not responsible for statements made by contributors. Material in the Journal does not necessarily reflect the views of the Editor or of the KNPA. Manuscripts accepted for publication are copy-edited to improve readability and to ensure conformity with house style.