两项公开报告的结果质量措施在退伍军人健康管理局内表征医疗保健质量的不可靠性。

IF 3.2 2区医学 Q2 HEALTH CARE SCIENCES & SERVICES

Health Services Research Pub Date : 2025-10-01 DOI:10.1111/1475-6773.70050

Kenneth J Nieser, Daniel J Tancredi, Alex H S Harris

{"title":"两项公开报告的结果质量措施在退伍军人健康管理局内表征医疗保健质量的不可靠性。","authors":"Kenneth J Nieser, Daniel J Tancredi, Alex H S Harris","doi":"10.1111/1475-6773.70050","DOIUrl":null,"url":null,"abstract":"Objective: To estimate the reliability of two outcome quality measures in Veterans Health Administration (VHA) data using three different methods.Study setting and design: We created two cohorts of VHA patients meeting criteria for two measures: (1) risk-standardized complication rates following elective primary total hip arthroplasty and/or total knee arthroplasty (THA/TKA), and (2) risk-standardized mortality rates following acute myocardial infarction hospitalization (AMI). We fit hierarchical logistic regression models and calculated facility-level risk-standardized rates. We estimated entity-level reliability using three commonly applied methods: (1) delta method approximation; (2) latent scale model; (3) split-sample method.Data sources and analytic sample: For each measure, we extracted risk adjustment and outcome data from the VHA Corporate Data Warehouse for patients meeting eligibility criteria in fiscal years 2021 and 2022.Principal findings: Most facilities had complication rates following total hip and/or knee arthroplasty and mortality rates following hospitalization for acute myocardial infarction that, statistically, were no different from the national average. Reliability estimates based on delta method approximation (0.14 for THA/TKA; 0.12 for AMI) and the split-sample method (0.12 for THA/TKA; 0.19 for AMI) were very low for both measures. As we varied the sample sizes, we found that much higher sample sizes would be needed to reliably differentiate quality of care across facilities. On the other hand, reliability estimates based on the latent scale model were substantially higher than the other two methods (0.64 for THA/TKA; 0.41 for AMI), suggesting that there is substantially more between-facility variation in latent quality than manifests in observed outcomes.Conclusions: Reliability estimates based on the latent scale approach are not numerically or conceptually interchangeable with estimates based on the other two approaches. Given that health outcomes are generally reported using observed outcomes, reliability estimation based on the latent scale approach should not be used without a strong rationale.","PeriodicalId":55065,"journal":{"name":"Health Services Research","volume":" ","pages":"e70050"},"PeriodicalIF":3.2000,"publicationDate":"2025-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"The Unreliability of Two Publicly Reported Outcome Quality Measures for Characterizing Health Care Quality Within the Veterans Health Administration.\",\"authors\":\"Kenneth J Nieser, Daniel J Tancredi, Alex H S Harris\",\"doi\":\"10.1111/1475-6773.70050\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Objective: To estimate the reliability of two outcome quality measures in Veterans Health Administration (VHA) data using three different methods.Study setting and design: We created two cohorts of VHA patients meeting criteria for two measures: (1) risk-standardized complication rates following elective primary total hip arthroplasty and/or total knee arthroplasty (THA/TKA), and (2) risk-standardized mortality rates following acute myocardial infarction hospitalization (AMI). We fit hierarchical logistic regression models and calculated facility-level risk-standardized rates. We estimated entity-level reliability using three commonly applied methods: (1) delta method approximation; (2) latent scale model; (3) split-sample method.Data sources and analytic sample: For each measure, we extracted risk adjustment and outcome data from the VHA Corporate Data Warehouse for patients meeting eligibility criteria in fiscal years 2021 and 2022.Principal findings: Most facilities had complication rates following total hip and/or knee arthroplasty and mortality rates following hospitalization for acute myocardial infarction that, statistically, were no different from the national average. Reliability estimates based on delta method approximation (0.14 for THA/TKA; 0.12 for AMI) and the split-sample method (0.12 for THA/TKA; 0.19 for AMI) were very low for both measures. As we varied the sample sizes, we found that much higher sample sizes would be needed to reliably differentiate quality of care across facilities. On the other hand, reliability estimates based on the latent scale model were substantially higher than the other two methods (0.64 for THA/TKA; 0.41 for AMI), suggesting that there is substantially more between-facility variation in latent quality than manifests in observed outcomes.Conclusions: Reliability estimates based on the latent scale approach are not numerically or conceptually interchangeable with estimates based on the other two approaches. Given that health outcomes are generally reported using observed outcomes, reliability estimation based on the latent scale approach should not be used without a strong rationale.\",\"PeriodicalId\":55065,\"journal\":{\"name\":\"Health Services Research\",\"volume\":\" \",\"pages\":\"e70050\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2025-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Health Services Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1111/1475-6773.70050\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"HEALTH CARE SCIENCES & SERVICES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Services Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/1475-6773.70050","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}

引用次数: 0

摘要

目的：用三种不同的方法评估退伍军人健康管理局（VHA）数据中两种结局质量指标的可靠性。研究背景和设计：我们创建了两个符合两项标准的VHA患者队列：(1)选择性原发性全髋关节置换术和/或全膝关节置换术（THA/TKA）后风险标准化并发症发生率，(2)急性心肌梗死住院（AMI）后风险标准化死亡率。我们拟合了层次逻辑回归模型，并计算了设施级别的风险标准化率。我们使用三种常用的方法估计实体级可靠性：(1)delta法近似；(2)潜标模型；(3)分样法。数据来源和分析样本：对于每项测量，我们从VHA公司数据仓库中提取了2021和2022财政年度符合资格标准的患者的风险调整和结果数据。主要发现：在统计上，大多数设施的全髋关节和/或膝关节置换术后并发症发生率和急性心肌梗死住院后死亡率与全国平均水平没有差异。基于delta法近似的可靠性估计（THA/TKA为0.14，AMI为0.12）和分裂样本法（THA/TKA为0.12，AMI为0.19）对于这两种测量方法都非常低。当我们改变样本量时，我们发现需要更大的样本量来可靠地区分不同设施的护理质量。另一方面，基于潜在尺度模型的信度估计明显高于其他两种方法（THA/TKA为0.64，AMI为0.41），这表明设施之间的潜在质量差异明显大于观察结果。结论：基于潜在量表方法的信度估计与基于其他两种方法的估计在数字或概念上不可互换。鉴于健康结果通常是用观察结果来报告的，因此，在没有充分理由的情况下，不应使用基于潜在量表方法的可靠性估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

The Unreliability of Two Publicly Reported Outcome Quality Measures for Characterizing Health Care Quality Within the Veterans Health Administration.

Objective: To estimate the reliability of two outcome quality measures in Veterans Health Administration (VHA) data using three different methods.

Study setting and design: We created two cohorts of VHA patients meeting criteria for two measures: (1) risk-standardized complication rates following elective primary total hip arthroplasty and/or total knee arthroplasty (THA/TKA), and (2) risk-standardized mortality rates following acute myocardial infarction hospitalization (AMI). We fit hierarchical logistic regression models and calculated facility-level risk-standardized rates. We estimated entity-level reliability using three commonly applied methods: (1) delta method approximation; (2) latent scale model; (3) split-sample method.

Data sources and analytic sample: For each measure, we extracted risk adjustment and outcome data from the VHA Corporate Data Warehouse for patients meeting eligibility criteria in fiscal years 2021 and 2022.

Principal findings: Most facilities had complication rates following total hip and/or knee arthroplasty and mortality rates following hospitalization for acute myocardial infarction that, statistically, were no different from the national average. Reliability estimates based on delta method approximation (0.14 for THA/TKA; 0.12 for AMI) and the split-sample method (0.12 for THA/TKA; 0.19 for AMI) were very low for both measures. As we varied the sample sizes, we found that much higher sample sizes would be needed to reliably differentiate quality of care across facilities. On the other hand, reliability estimates based on the latent scale model were substantially higher than the other two methods (0.64 for THA/TKA; 0.41 for AMI), suggesting that there is substantially more between-facility variation in latent quality than manifests in observed outcomes.

Conclusions: Reliability estimates based on the latent scale approach are not numerically or conceptually interchangeable with estimates based on the other two approaches. Given that health outcomes are generally reported using observed outcomes, reliability estimation based on the latent scale approach should not be used without a strong rationale.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Health Services Research 医学-卫生保健

CiteScore

4.80

自引率

5.90%

发文量

193

审稿时长

4-8 weeks

期刊介绍： Health Services Research (HSR) is a peer-reviewed scholarly journal that provides researchers and public and private policymakers with the latest research findings, methods, and concepts related to the financing, organization, delivery, evaluation, and outcomes of health services. Rated as one of the top journals in the fields of health policy and services and health care administration, HSR publishes outstanding articles reporting the findings of original investigations that expand knowledge and understanding of the wide-ranging field of health care and that will help to improve the health of individuals and communities.