预测痴呆患者的5年生存率和死亡率:使用XGBoost增强护理和资源分配的数据驱动方法

IF 1.8 4区 医学 Q3 PSYCHIATRY
Yi-Guang Wang, Hsin-An Chang, Mu-Hong Chen, Nian-Sheng Tzeng, Jin Narumoto, Chih-Sung Liang, Ta-Chuan Yeh
{"title":"预测痴呆患者的5年生存率和死亡率:使用XGBoost增强护理和资源分配的数据驱动方法","authors":"Yi-Guang Wang, Hsin-An Chang, Mu-Hong Chen, Nian-Sheng Tzeng, Jin Narumoto, Chih-Sung Liang, Ta-Chuan Yeh","doi":"10.30773/pi.2024.0351","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study develops an eXtreme Gradient Boosting (XGBoost) regression model to identify key predictors of mortality and 5-year survival in dementia patients, highlighting the role of comorbidities. The findings highlight key risk factors that may facilitate targeted adjustments in clinical care and resource allocation for high-risk patients.</p><p><strong>Methods: </strong>We used Taiwan's National Health Insurance dataset to develop and validate an XGBoost model predicting 5-year survival in dementia patients aged 65 years or older. The cohort (n=6,556) was split into 80% for training, 10% for validation, and 10% for testing. A total of 24 variables, including comorbidities and demographic factors, were selected as predictors. Hyperparameters were tuned to optimize performance, with a learning rate of 0.1, 1,000 estimators, and a maximum depth of 10. Regularization techniques were applied to prevent overfitting.</p><p><strong>Results: </strong>The XGBoost model achieved 81.86% accuracy in predicting 5-year survival, with a receiver operating characteristic area under the curve of 0.81 and a log loss of 0.61. Of the 37 initial features, 24 were included, and the top 10 predictors were nasogastric tube insertion, chronic kidney disease, cancer, lung disease, urinary tract infection, fracture, peripheral vascular disease, antidepressant use, hypertension, and upper gastrointestinal issues.</p><p><strong>Conclusion: </strong>The XGBoost model effectively predicts 5-year survival in dementia patients, identifying key predictors that can guide targeted care, preventive strategies, and healthcare resource planning.</p>","PeriodicalId":21164,"journal":{"name":"Psychiatry Investigation","volume":" ","pages":"1057-1067"},"PeriodicalIF":1.8000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12444194/pdf/","citationCount":"0","resultStr":"{\"title\":\"Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.\",\"authors\":\"Yi-Guang Wang, Hsin-An Chang, Mu-Hong Chen, Nian-Sheng Tzeng, Jin Narumoto, Chih-Sung Liang, Ta-Chuan Yeh\",\"doi\":\"10.30773/pi.2024.0351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objective: </strong>This study develops an eXtreme Gradient Boosting (XGBoost) regression model to identify key predictors of mortality and 5-year survival in dementia patients, highlighting the role of comorbidities. The findings highlight key risk factors that may facilitate targeted adjustments in clinical care and resource allocation for high-risk patients.</p><p><strong>Methods: </strong>We used Taiwan's National Health Insurance dataset to develop and validate an XGBoost model predicting 5-year survival in dementia patients aged 65 years or older. The cohort (n=6,556) was split into 80% for training, 10% for validation, and 10% for testing. A total of 24 variables, including comorbidities and demographic factors, were selected as predictors. Hyperparameters were tuned to optimize performance, with a learning rate of 0.1, 1,000 estimators, and a maximum depth of 10. Regularization techniques were applied to prevent overfitting.</p><p><strong>Results: </strong>The XGBoost model achieved 81.86% accuracy in predicting 5-year survival, with a receiver operating characteristic area under the curve of 0.81 and a log loss of 0.61. Of the 37 initial features, 24 were included, and the top 10 predictors were nasogastric tube insertion, chronic kidney disease, cancer, lung disease, urinary tract infection, fracture, peripheral vascular disease, antidepressant use, hypertension, and upper gastrointestinal issues.</p><p><strong>Conclusion: </strong>The XGBoost model effectively predicts 5-year survival in dementia patients, identifying key predictors that can guide targeted care, preventive strategies, and healthcare resource planning.</p>\",\"PeriodicalId\":21164,\"journal\":{\"name\":\"Psychiatry Investigation\",\"volume\":\" \",\"pages\":\"1057-1067\"},\"PeriodicalIF\":1.8000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12444194/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Psychiatry Investigation\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.30773/pi.2024.0351\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"PSYCHIATRY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Psychiatry Investigation","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.30773/pi.2024.0351","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"PSYCHIATRY","Score":null,"Total":0}
引用次数: 0

摘要

目的:本研究建立了一个极端梯度增强(XGBoost)回归模型,以确定痴呆患者死亡率和5年生存率的关键预测因素,并强调合并症的作用。研究结果强调了关键的危险因素,可能有助于有针对性地调整临床护理和高危患者的资源分配。方法:我们使用台湾的国民健康保险数据集开发并验证了预测65岁及以上痴呆患者5年生存率的XGBoost模型。队列(n=6,556)分为80%用于培训,10%用于验证,10%用于测试。共选择24个变量作为预测因子,包括合并症和人口统计学因素。对超参数进行了调优以优化性能,学习率为0.1,估计器为1,000,最大深度为10。正则化技术用于防止过拟合。结果:XGBoost模型预测5年生存率的准确率为81.86%,曲线下受者工作特征面积为0.81,对数损失为0.61。在37个初始特征中,包括24个,前10个预测因素是鼻胃管插入、慢性肾病、癌症、肺病、尿路感染、骨折、周围血管疾病、使用抗抑郁药、高血压和上消化道问题。结论:XGBoost模型可有效预测痴呆患者的5年生存率,确定关键预测因子,指导有针对性的护理、预防策略和医疗资源规划。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.

Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.

Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.

Predicting 5-Year Survival and Mortality in Dementia Patients: A Data-Driven Approach Using XGBoost for Enhanced Care and Resource Allocation.

Objective: This study develops an eXtreme Gradient Boosting (XGBoost) regression model to identify key predictors of mortality and 5-year survival in dementia patients, highlighting the role of comorbidities. The findings highlight key risk factors that may facilitate targeted adjustments in clinical care and resource allocation for high-risk patients.

Methods: We used Taiwan's National Health Insurance dataset to develop and validate an XGBoost model predicting 5-year survival in dementia patients aged 65 years or older. The cohort (n=6,556) was split into 80% for training, 10% for validation, and 10% for testing. A total of 24 variables, including comorbidities and demographic factors, were selected as predictors. Hyperparameters were tuned to optimize performance, with a learning rate of 0.1, 1,000 estimators, and a maximum depth of 10. Regularization techniques were applied to prevent overfitting.

Results: The XGBoost model achieved 81.86% accuracy in predicting 5-year survival, with a receiver operating characteristic area under the curve of 0.81 and a log loss of 0.61. Of the 37 initial features, 24 were included, and the top 10 predictors were nasogastric tube insertion, chronic kidney disease, cancer, lung disease, urinary tract infection, fracture, peripheral vascular disease, antidepressant use, hypertension, and upper gastrointestinal issues.

Conclusion: The XGBoost model effectively predicts 5-year survival in dementia patients, identifying key predictors that can guide targeted care, preventive strategies, and healthcare resource planning.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
CiteScore
4.10
自引率
3.70%
发文量
105
审稿时长
6-12 weeks
期刊介绍: The Psychiatry Investigation is published on the 25th day of every month in English by the Korean Neuropsychiatric Association (KNPA). The Journal covers the whole range of psychiatry and neuroscience. Both basic and clinical contributions are encouraged from all disciplines and research areas relevant to the pathophysiology and management of neuropsychiatric disorders and symptoms, as well as researches related to cross cultural psychiatry and ethnic issues in psychiatry. The Journal publishes editorials, review articles, original articles, brief reports, viewpoints and correspondences. All research articles are peer reviewed. Contributions are accepted for publication on the condition that their substance has not been published or submitted for publication elsewhere. Authors submitting papers to the Journal (serially or otherwise) with a common theme or using data derived from the same sample (or a subset thereof) must send details of all relevant previous publications and simultaneous submissions. The Journal is not responsible for statements made by contributors. Material in the Journal does not necessarily reflect the views of the Editor or of the KNPA. Manuscripts accepted for publication are copy-edited to improve readability and to ensure conformity with house style.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信