Predicting Hospitalization in Older Adults Using Machine Learning.

IF 2.1 Q3 GERIATRICS & GERONTOLOGY
Raymundo Buenrostro-Mariscal, Osval A Montesinos-López, Cesar Gonzalez-Gonzalez
{"title":"Predicting Hospitalization in Older Adults Using Machine Learning.","authors":"Raymundo Buenrostro-Mariscal, Osval A Montesinos-López, Cesar Gonzalez-Gonzalez","doi":"10.3390/geriatrics10010006","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background/Objectives</b>: Hospitalization among older adults is a growing challenge in Mexico due to the high prevalence of chronic diseases and limited public healthcare resources. This study aims to develop a predictive model for hospitalization using longitudinal data from the Mexican Health and Aging Study (MHAS) using the random forest (RF) algorithm. <b>Methods</b>: An RF-based machine learning model was designed and evaluated under different data partition strategies (ST) with and without variable interaction. Variable importance was assessed based on the mean decrease in impurity and permutation importance, enhancing our understanding of predictors of hospitalization. The model's robustness was ensured through modified nested cross-validation, with evaluation metrics including sensitivity, specificity, and the kappa coefficient. <b>Results</b>: The model with ST2, incorporating interaction and a 20% test proportion, achieved the best balance between sensitivity (0.7215, standard error ± 0.0038), and specificity (0.4935, standard error ± 0.0039). Variable importance analysis revealed that functional limitations (e.g., abvd3, 31.1% importance), age (12.75%), and history of cerebrovascular accidents (12.4%) were the strongest predictors. Socioeconomic factors, including education level (12.08%), also emerged as critical predictors, highlighting the model's ability to capture complex interactions between health and socioeconomic variables. <b>Conclusions</b>: The integration of variable importance analysis enhances the interpretability of the RF model, providing novel insights into the predictors of hospitalization in older adults. These findings underscore the potential for clinical applications, including anticipating hospital demand and optimizing resource allocation. Future research will focus on integrating subgroup analyses for comorbidities and advanced techniques for handling missing data to further improve predictive accuracy.</p>","PeriodicalId":12653,"journal":{"name":"Geriatrics","volume":"10 1","pages":""},"PeriodicalIF":2.1000,"publicationDate":"2025-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11755630/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geriatrics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/geriatrics10010006","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Background/Objectives: Hospitalization among older adults is a growing challenge in Mexico due to the high prevalence of chronic diseases and limited public healthcare resources. This study aims to develop a predictive model for hospitalization using longitudinal data from the Mexican Health and Aging Study (MHAS) using the random forest (RF) algorithm. Methods: An RF-based machine learning model was designed and evaluated under different data partition strategies (ST) with and without variable interaction. Variable importance was assessed based on the mean decrease in impurity and permutation importance, enhancing our understanding of predictors of hospitalization. The model's robustness was ensured through modified nested cross-validation, with evaluation metrics including sensitivity, specificity, and the kappa coefficient. Results: The model with ST2, incorporating interaction and a 20% test proportion, achieved the best balance between sensitivity (0.7215, standard error ± 0.0038), and specificity (0.4935, standard error ± 0.0039). Variable importance analysis revealed that functional limitations (e.g., abvd3, 31.1% importance), age (12.75%), and history of cerebrovascular accidents (12.4%) were the strongest predictors. Socioeconomic factors, including education level (12.08%), also emerged as critical predictors, highlighting the model's ability to capture complex interactions between health and socioeconomic variables. Conclusions: The integration of variable importance analysis enhances the interpretability of the RF model, providing novel insights into the predictors of hospitalization in older adults. These findings underscore the potential for clinical applications, including anticipating hospital demand and optimizing resource allocation. Future research will focus on integrating subgroup analyses for comorbidities and advanced techniques for handling missing data to further improve predictive accuracy.

使用机器学习预测老年人住院。
背景/目的:在墨西哥,由于慢性病的高患病率和有限的公共卫生保健资源,老年人住院是一个日益严峻的挑战。本研究旨在利用墨西哥健康与老龄化研究(MHAS)的纵向数据,利用随机森林(RF)算法建立住院治疗的预测模型。方法:设计了基于rf的机器学习模型,并在有无变量交互的不同数据分区策略(ST)下进行了评估。根据杂质和排列重要性的平均减少来评估变量重要性,增强我们对住院预测因子的理解。通过改进的嵌套交叉验证来确保模型的稳健性,评估指标包括敏感性、特异性和kappa系数。结果:考虑相互作用和20%检验比例的ST2模型在敏感性(0.7215,标准误差±0.0038)和特异性(0.4935,标准误差±0.0039)之间取得了最佳平衡。变量重要性分析显示,功能限制(如abvd3,重要性为31.1%)、年龄(12.75%)和脑血管意外史(12.4%)是最强的预测因素。社会经济因素,包括教育水平(12.08%),也成为关键的预测因素,突出了该模型捕捉健康和社会经济变量之间复杂相互作用的能力。结论:变量重要性分析的整合增强了RF模型的可解释性,为老年人住院治疗的预测因素提供了新的见解。这些发现强调了临床应用的潜力,包括预测医院需求和优化资源分配。未来的研究将集中于整合共病的亚组分析和处理缺失数据的先进技术,以进一步提高预测的准确性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Geriatrics
Geriatrics 医学-老年医学
CiteScore
3.30
自引率
0.00%
发文量
115
审稿时长
20.03 days
期刊介绍: • Geriatric biology • Geriatric health services research • Geriatric medicine research • Geriatric neurology, stroke, cognition and oncology • Geriatric surgery • Geriatric physical functioning, physical health and activity • Geriatric psychiatry and psychology • Geriatric nutrition • Geriatric epidemiology • Geriatric rehabilitation
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信