Combining four patient-reported outcome measurement instruments to develop a common metric computerized adaptive test to measure outcome after total knee arthroplasty.

IF 4.6 1区医学 Q1 ORTHOPEDICS

Bone & Joint Journal Pub Date : 2025-09-01 DOI:10.1302/0301-620X.107B9.BJJ-2024-0993.R2

Chetan Khatri, Nick D Clement, Deborah J MacDonald, Chloe E H Scott, Andrew J Metcalfe, Jeremy N Rodrigues, Conrad J Harrison

{"title":"Combining four patient-reported outcome measurement instruments to develop a common metric computerized adaptive test to measure outcome after total knee arthroplasty.","authors":"Chetan Khatri, Nick D Clement, Deborah J MacDonald, Chloe E H Scott, Andrew J Metcalfe, Jeremy N Rodrigues, Conrad J Harrison","doi":"10.1302/0301-620X.107B9.BJJ-2024-0993.R2","DOIUrl":null,"url":null,"abstract":"Aims: To map the Oxford Knee Score (OKS), OKS Activity & Participation Questionnaire, Forgotten Joint Score (FJS), and High Activity Arthroplasty Score to a common scale using principles of modern test theory. Using the common scale, we then aimed to build a computerized adaptive test (CAT) to reduce item burden.Methods: Participants undergoing total knee arthroplasty provided preoperative and postoperative (six and 12 months) paired responses for four patient-reported outcome measure (PROM) instruments, with the OKS being the common linking instrument. After meeting assumptions (unidimensionality, monotonicity, local independence, and measurement invariance), a graded response model was fitted to the data. A CAT algorithm was developed.Results: There were 6,301 paired responses included. Confirmatory factor analysis demonstrated that the four PROM instruments measured a similar or very similar latent construct, which we describe as 'knee health'. The combination of all four instruments resulted in higher test-level information than any individual instrument. The CAT reduced 36 items to a median of eight items, while maintaining high measurement precision (standard error of measurement 0.29).Conclusion: The common metric provides a more precise measurement of knee health than any of the individual instruments, and avoids floor and ceiling effects inherent in individual scores. The CAT algorithm reduced the item burden to a median of eight items, which is lower than the FJS or OKS, when used as individual instruments. It will also allow for standardization of outcome reporting and pooling of results across databases that use any of the four instruments to facilitate individual and meta-analyses across different scores.","PeriodicalId":48944,"journal":{"name":"Bone & Joint Journal","volume":"107-B 9","pages":"885-895"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bone & Joint Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1302/0301-620X.107B9.BJJ-2024-0993.R2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}

引用次数: 0

Abstract

Aims: To map the Oxford Knee Score (OKS), OKS Activity & Participation Questionnaire, Forgotten Joint Score (FJS), and High Activity Arthroplasty Score to a common scale using principles of modern test theory. Using the common scale, we then aimed to build a computerized adaptive test (CAT) to reduce item burden.

Methods: Participants undergoing total knee arthroplasty provided preoperative and postoperative (six and 12 months) paired responses for four patient-reported outcome measure (PROM) instruments, with the OKS being the common linking instrument. After meeting assumptions (unidimensionality, monotonicity, local independence, and measurement invariance), a graded response model was fitted to the data. A CAT algorithm was developed.

Results: There were 6,301 paired responses included. Confirmatory factor analysis demonstrated that the four PROM instruments measured a similar or very similar latent construct, which we describe as 'knee health'. The combination of all four instruments resulted in higher test-level information than any individual instrument. The CAT reduced 36 items to a median of eight items, while maintaining high measurement precision (standard error of measurement 0.29).

Conclusion: The common metric provides a more precise measurement of knee health than any of the individual instruments, and avoids floor and ceiling effects inherent in individual scores. The CAT algorithm reduced the item burden to a median of eight items, which is lower than the FJS or OKS, when used as individual instruments. It will also allow for standardization of outcome reporting and pooling of results across databases that use any of the four instruments to facilitate individual and meta-analyses across different scores.

查看原文本刊更多论文

结合四种患者报告的结果测量仪器，开发一种通用的度量计算机化自适应测试来测量全膝关节置换术后的结果。

目的：利用现代测试理论的原理，将牛津膝关节评分（OKS）、OKS活动与参与问卷、遗忘关节评分（FJS）和高活动关节成形术评分映射到一个共同的量表。使用通用量表，我们的目标是建立一个计算机化的自适应测试（CAT），以减少项目负担。方法：接受全膝关节置换术的参与者提供了术前和术后（6个月和12个月）对四种患者报告的结果测量（PROM）器械的配对反应，其中OKS是常见的连接器械。在满足假设条件（单维性、单调性、局部独立性和测量不变性）后，对数据进行梯度响应模型拟合。提出了一种CAT算法。结果：共纳入6301份配对问卷。验证性因素分析表明，四种PROM仪器测量了相似或非常相似的潜在结构，我们将其描述为“膝关节健康”。所有四种仪器的组合比任何单独的仪器产生更高的测试水平信息。CAT将36个项目减少到8个项目的中位数，同时保持了较高的测量精度（测量标准误差0.29）。结论：通用指标提供了比任何单个工具更精确的膝关节健康测量，并避免了个体评分固有的下限和上限效应。CAT算法将项目负担减少到8个项目的中位数，这比FJS或OKS作为单独的工具使用时要低。它还将允许结果报告的标准化和跨数据库的结果汇集，使用四种工具中的任何一种来促进不同分数的个人和元分析。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Bone & Joint Journal ORTHOPEDICS-SURGERY

CiteScore

9.40

自引率

10.90%

发文量

318

期刊介绍： We welcome original articles from any part of the world. The papers are assessed by members of the Editorial Board and our international panel of expert reviewers, then either accepted for publication or rejected by the Editor. We receive over 2000 submissions each year and accept about 250 for publication, many after revisions recommended by the reviewers, editors or statistical advisers. A decision usually takes between six and eight weeks. Each paper is assessed by two reviewers with a special interest in the subject covered by the paper, and also by members of the editorial team. Controversial papers will be discussed at a full meeting of the Editorial Board. Publication is between four and six months after acceptance.