Combining four patient-reported outcome measurement instruments to develop a common metric computerized adaptive test to measure outcome after total knee arthroplasty.
Chetan Khatri, Nick D Clement, Deborah J MacDonald, Chloe E H Scott, Andrew J Metcalfe, Jeremy N Rodrigues, Conrad J Harrison
{"title":"Combining four patient-reported outcome measurement instruments to develop a common metric computerized adaptive test to measure outcome after total knee arthroplasty.","authors":"Chetan Khatri, Nick D Clement, Deborah J MacDonald, Chloe E H Scott, Andrew J Metcalfe, Jeremy N Rodrigues, Conrad J Harrison","doi":"10.1302/0301-620X.107B9.BJJ-2024-0993.R2","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>To map the Oxford Knee Score (OKS), OKS Activity & Participation Questionnaire, Forgotten Joint Score (FJS), and High Activity Arthroplasty Score to a common scale using principles of modern test theory. Using the common scale, we then aimed to build a computerized adaptive test (CAT) to reduce item burden.</p><p><strong>Methods: </strong>Participants undergoing total knee arthroplasty provided preoperative and postoperative (six and 12 months) paired responses for four patient-reported outcome measure (PROM) instruments, with the OKS being the common linking instrument. After meeting assumptions (unidimensionality, monotonicity, local independence, and measurement invariance), a graded response model was fitted to the data. A CAT algorithm was developed.</p><p><strong>Results: </strong>There were 6,301 paired responses included. Confirmatory factor analysis demonstrated that the four PROM instruments measured a similar or very similar latent construct, which we describe as 'knee health'. The combination of all four instruments resulted in higher test-level information than any individual instrument. The CAT reduced 36 items to a median of eight items, while maintaining high measurement precision (standard error of measurement 0.29).</p><p><strong>Conclusion: </strong>The common metric provides a more precise measurement of knee health than any of the individual instruments, and avoids floor and ceiling effects inherent in individual scores. The CAT algorithm reduced the item burden to a median of eight items, which is lower than the FJS or OKS, when used as individual instruments. It will also allow for standardization of outcome reporting and pooling of results across databases that use any of the four instruments to facilitate individual and meta-analyses across different scores.</p>","PeriodicalId":48944,"journal":{"name":"Bone & Joint Journal","volume":"107-B 9","pages":"885-895"},"PeriodicalIF":4.6000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Bone & Joint Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1302/0301-620X.107B9.BJJ-2024-0993.R2","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ORTHOPEDICS","Score":null,"Total":0}
引用次数: 0
Abstract
Aims: To map the Oxford Knee Score (OKS), OKS Activity & Participation Questionnaire, Forgotten Joint Score (FJS), and High Activity Arthroplasty Score to a common scale using principles of modern test theory. Using the common scale, we then aimed to build a computerized adaptive test (CAT) to reduce item burden.
Methods: Participants undergoing total knee arthroplasty provided preoperative and postoperative (six and 12 months) paired responses for four patient-reported outcome measure (PROM) instruments, with the OKS being the common linking instrument. After meeting assumptions (unidimensionality, monotonicity, local independence, and measurement invariance), a graded response model was fitted to the data. A CAT algorithm was developed.
Results: There were 6,301 paired responses included. Confirmatory factor analysis demonstrated that the four PROM instruments measured a similar or very similar latent construct, which we describe as 'knee health'. The combination of all four instruments resulted in higher test-level information than any individual instrument. The CAT reduced 36 items to a median of eight items, while maintaining high measurement precision (standard error of measurement 0.29).
Conclusion: The common metric provides a more precise measurement of knee health than any of the individual instruments, and avoids floor and ceiling effects inherent in individual scores. The CAT algorithm reduced the item burden to a median of eight items, which is lower than the FJS or OKS, when used as individual instruments. It will also allow for standardization of outcome reporting and pooling of results across databases that use any of the four instruments to facilitate individual and meta-analyses across different scores.
期刊介绍:
We welcome original articles from any part of the world. The papers are assessed by members of the Editorial Board and our international panel of expert reviewers, then either accepted for publication or rejected by the Editor. We receive over 2000 submissions each year and accept about 250 for publication, many after revisions recommended by the reviewers, editors or statistical advisers. A decision usually takes between six and eight weeks. Each paper is assessed by two reviewers with a special interest in the subject covered by the paper, and also by members of the editorial team. Controversial papers will be discussed at a full meeting of the Editorial Board. Publication is between four and six months after acceptance.