Lu Shao, Zhong Wang, Xiang Qi, Jing Wang, Hongtao Cheng, Xichenhui Qiu, Ting Xu, Jun-E Zhang, Bei Wu
{"title":"中国养老院老年人跌倒风险评估工具的验证和比较:一项前瞻性队列研究。","authors":"Lu Shao, Zhong Wang, Xiang Qi, Jing Wang, Hongtao Cheng, Xichenhui Qiu, Ting Xu, Jun-E Zhang, Bei Wu","doi":"10.1016/j.jamda.2025.105882","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>This study evaluates and compares the predictive performance of 4 widely used fall risk assessment tools-Morse Fall Scale (MFS), St. Thomas's Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY), Hendrich II Fall Risk Model (Hendrich II), and Timed Up and Go Test (TUGT)-in Chinese nursing homes, with a focus on optimizing cutoff values for better applicability.</p><p><strong>Design: </strong>A prospective cohort study.</p><p><strong>Setting and participants: </strong>The study was conducted in 4 nursing homes in China, including 866 older adults capable of providing informed consent and completing verbal communication.</p><p><strong>Methods: </strong>Participants were assessed using the 4 fall risk tools, and their fall events were recorded over 6 months. Predictive performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, and area under the receiver operating characteristic curve. Calibration curves were generated to assess the agreement between predicted and observed fall probabilities.</p><p><strong>Results: </strong>Using the original cutoff values, the MFS (≥45) and TUGT (≥12 seconds) both showed high sensitivity (0.889 and 0.933, respectively) but low specificity (0.284 and 0.261, respectively). In contrast, the STRATIFY (≥2) and Hendrich II (≥5) exhibited high specificity (0.964 and 0.827, respectively) but low sensitivity (0.117 and 0.328, respectively). After optimization, the MFS (≥65) improved specificity (0.592) with moderate sensitivity (0.689), the STRATIFY (≥1) increased sensitivity (0.856) while reducing specificity to 0.407, the Hendrich II (≥2) achieved specificity of 0.519 with sensitivity of 0.739, and the TUGT (≥26.6 seconds) maintained high sensitivity (0.739) but had a specificity of 0.622. The TUGT demonstrated the strongest overall predictive accuracy (area under the receiver operating characteristic curve, 0.722).</p><p><strong>Conclusions and implications: </strong>All tools showed limitations in balancing sensitivity and specificity. Adjusting thresholds improved performance but did not yield optimal results. The findings highlight the importance of tailoring fall risk assessments to specific populations with thresholds adjusted to optimize performance. Future research should explore integrating clinical assessments with data-driven predictive models to enhance fall risk evaluation in long-term care settings.</p>","PeriodicalId":17180,"journal":{"name":"Journal of the American Medical Directors Association","volume":" ","pages":"105882"},"PeriodicalIF":3.8000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Validation and Comparison of 4 Fall Risk Assessment Tools for Older Adults in Chinese Nursing Homes: A Prospective Cohort Study.\",\"authors\":\"Lu Shao, Zhong Wang, Xiang Qi, Jing Wang, Hongtao Cheng, Xichenhui Qiu, Ting Xu, Jun-E Zhang, Bei Wu\",\"doi\":\"10.1016/j.jamda.2025.105882\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Objectives: </strong>This study evaluates and compares the predictive performance of 4 widely used fall risk assessment tools-Morse Fall Scale (MFS), St. Thomas's Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY), Hendrich II Fall Risk Model (Hendrich II), and Timed Up and Go Test (TUGT)-in Chinese nursing homes, with a focus on optimizing cutoff values for better applicability.</p><p><strong>Design: </strong>A prospective cohort study.</p><p><strong>Setting and participants: </strong>The study was conducted in 4 nursing homes in China, including 866 older adults capable of providing informed consent and completing verbal communication.</p><p><strong>Methods: </strong>Participants were assessed using the 4 fall risk tools, and their fall events were recorded over 6 months. Predictive performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, and area under the receiver operating characteristic curve. Calibration curves were generated to assess the agreement between predicted and observed fall probabilities.</p><p><strong>Results: </strong>Using the original cutoff values, the MFS (≥45) and TUGT (≥12 seconds) both showed high sensitivity (0.889 and 0.933, respectively) but low specificity (0.284 and 0.261, respectively). In contrast, the STRATIFY (≥2) and Hendrich II (≥5) exhibited high specificity (0.964 and 0.827, respectively) but low sensitivity (0.117 and 0.328, respectively). After optimization, the MFS (≥65) improved specificity (0.592) with moderate sensitivity (0.689), the STRATIFY (≥1) increased sensitivity (0.856) while reducing specificity to 0.407, the Hendrich II (≥2) achieved specificity of 0.519 with sensitivity of 0.739, and the TUGT (≥26.6 seconds) maintained high sensitivity (0.739) but had a specificity of 0.622. The TUGT demonstrated the strongest overall predictive accuracy (area under the receiver operating characteristic curve, 0.722).</p><p><strong>Conclusions and implications: </strong>All tools showed limitations in balancing sensitivity and specificity. Adjusting thresholds improved performance but did not yield optimal results. The findings highlight the importance of tailoring fall risk assessments to specific populations with thresholds adjusted to optimize performance. Future research should explore integrating clinical assessments with data-driven predictive models to enhance fall risk evaluation in long-term care settings.</p>\",\"PeriodicalId\":17180,\"journal\":{\"name\":\"Journal of the American Medical Directors Association\",\"volume\":\" \",\"pages\":\"105882\"},\"PeriodicalIF\":3.8000,\"publicationDate\":\"2025-10-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Medical Directors Association\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jamda.2025.105882\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"GERIATRICS & GERONTOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Directors Association","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jamda.2025.105882","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GERIATRICS & GERONTOLOGY","Score":null,"Total":0}
引用次数: 0
摘要
目的:本研究对我国养老院常用的莫尔斯跌倒量表(MFS)、圣托马斯住院老年跌倒风险评估工具(STRATIFY)、Hendrich II跌倒风险模型(Hendrich II)和Timed Up and Go Test (TUGT) 4种跌倒风险评估工具的预测性能进行评价和比较,并重点优化截止值以提高适用性。设计:前瞻性队列研究。环境和参与者:研究在中国的4家养老院进行,包括866名能够提供知情同意并完成口头交流的老年人。方法:使用4种跌倒风险工具对参与者进行评估,并在6个月内记录他们的跌倒事件。采用敏感性、特异性、阳性预测值、阴性预测值和受试者工作特征曲线下面积评价预测效果。生成校准曲线以评估预测和观测到的坠落概率之间的一致性。结果:使用原始截止值,MFS(≥45)和TUGT(≥12秒)均具有高灵敏度(分别为0.889和0.933)和低特异性(分别为0.284和0.261)。相比之下,STRATIFY(≥2)和Hendrich II(≥5)具有高特异性(分别为0.964和0.827)和低敏感性(分别为0.117和0.328)。优化后,MFS(≥65)提高了特异性(0.592),敏感性中等(0.689),STRATIFY(≥1)提高了敏感性(0.856),特异性降至0.407,Hendrich II(≥2)达到了0.519,敏感性为0.739,TUGT(≥26.6秒)保持了高灵敏度(0.739),但特异性为0.622。TUGT表现出最强的整体预测精度(受试者工作特征曲线下面积,0.722)。结论和意义:所有工具在平衡敏感性和特异性方面都存在局限性。调整阈值可以提高性能,但不能产生最佳结果。研究结果强调了为特定人群量身定制跌倒风险评估的重要性,并调整阈值以优化绩效。未来的研究应探索将临床评估与数据驱动的预测模型相结合,以加强长期护理环境中跌倒风险的评估。
Validation and Comparison of 4 Fall Risk Assessment Tools for Older Adults in Chinese Nursing Homes: A Prospective Cohort Study.
Objectives: This study evaluates and compares the predictive performance of 4 widely used fall risk assessment tools-Morse Fall Scale (MFS), St. Thomas's Risk Assessment Tool in Falling Elderly Inpatients (STRATIFY), Hendrich II Fall Risk Model (Hendrich II), and Timed Up and Go Test (TUGT)-in Chinese nursing homes, with a focus on optimizing cutoff values for better applicability.
Design: A prospective cohort study.
Setting and participants: The study was conducted in 4 nursing homes in China, including 866 older adults capable of providing informed consent and completing verbal communication.
Methods: Participants were assessed using the 4 fall risk tools, and their fall events were recorded over 6 months. Predictive performance was evaluated using sensitivity, specificity, positive predictive value, negative predictive value, and area under the receiver operating characteristic curve. Calibration curves were generated to assess the agreement between predicted and observed fall probabilities.
Results: Using the original cutoff values, the MFS (≥45) and TUGT (≥12 seconds) both showed high sensitivity (0.889 and 0.933, respectively) but low specificity (0.284 and 0.261, respectively). In contrast, the STRATIFY (≥2) and Hendrich II (≥5) exhibited high specificity (0.964 and 0.827, respectively) but low sensitivity (0.117 and 0.328, respectively). After optimization, the MFS (≥65) improved specificity (0.592) with moderate sensitivity (0.689), the STRATIFY (≥1) increased sensitivity (0.856) while reducing specificity to 0.407, the Hendrich II (≥2) achieved specificity of 0.519 with sensitivity of 0.739, and the TUGT (≥26.6 seconds) maintained high sensitivity (0.739) but had a specificity of 0.622. The TUGT demonstrated the strongest overall predictive accuracy (area under the receiver operating characteristic curve, 0.722).
Conclusions and implications: All tools showed limitations in balancing sensitivity and specificity. Adjusting thresholds improved performance but did not yield optimal results. The findings highlight the importance of tailoring fall risk assessments to specific populations with thresholds adjusted to optimize performance. Future research should explore integrating clinical assessments with data-driven predictive models to enhance fall risk evaluation in long-term care settings.
期刊介绍:
JAMDA, the official journal of AMDA - The Society for Post-Acute and Long-Term Care Medicine, is a leading peer-reviewed publication that offers practical information and research geared towards healthcare professionals in the post-acute and long-term care fields. It is also a valuable resource for policy-makers, organizational leaders, educators, and advocates.
The journal provides essential information for various healthcare professionals such as medical directors, attending physicians, nurses, consultant pharmacists, geriatric psychiatrists, nurse practitioners, physician assistants, physical and occupational therapists, social workers, and others involved in providing, overseeing, and promoting quality