M. Hassan Murad , Rebecca L. Morgan , Yngve Falck-Ytter , Reem A. Mustafa , Shahnaz Sultan , Philipp Dahm , Madelin R. Siedler , Osama Altayar , Perica Davitkov , Syed Arsalan Ahmed Naqvi , Irbaz Bin Riaz , Zhen Wang , Lifeng Lin
{"title":"Simultaneous evaluation of the imprecision and inconsistency domains of GRADE can be performed using prediction intervals","authors":"M. Hassan Murad , Rebecca L. Morgan , Yngve Falck-Ytter , Reem A. Mustafa , Shahnaz Sultan , Philipp Dahm , Madelin R. Siedler , Osama Altayar , Perica Davitkov , Syed Arsalan Ahmed Naqvi , Irbaz Bin Riaz , Zhen Wang , Lifeng Lin","doi":"10.1016/j.jclinepi.2024.111543","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>To explore the use of prediction interval (PI) for the simultaneous evaluation of the imprecision and inconsistency domains of Grading of Recommendations, Assessment, and Evaluation using stakeholder-provided decision thresholds.</div></div><div><h3>Study Design and Setting</h3><div>We propose transforming the PI of a meta-analysis from a relative risk scale to an absolute risk difference using an appropriate baseline risk. The transformed PI is compared to stakeholder-provided thresholds on an absolute scale. We applied this approach to a large convenience sample of meta-analyses extracted from the Cochrane Database of Systematic Reviews and compared it against the traditional approach of rating imprecision and inconsistency separately using confidence intervals and statistical measures of heterogeneity, respectively. We used empirically derived thresholds following Grading of Recommendations, Assessment, and Evaluation guidance.</div></div><div><h3>Results</h3><div>The convenience sample consisted of 2516 meta-analyses (median of 7 studies per meta-analysis; interquartile range: 5–11). The main analysis showed the percentage of meta-analyses in which both approaches had the same number of certainty levels rated down was 59%. The PI approach led to more levels of rating down (lower certainty) in 27% and to fewer levels of rating down (higher certainty) in 14%. Multiple sensitivity analyses using different thresholds showed similar results, but the PI approach had particularly increased width with a larger number of included studies and higher I<sup>2</sup> values.</div></div><div><h3>Conclusion</h3><div>Using the PI for simultaneous evaluation of imprecision and inconsistency seems feasible and logical but can lead to lower certainty ratings. The PI-based approach requires further testing in future systematic reviews and guidelines using context-specific thresholds and evidence-to-decision criteria.</div></div><div><h3>Plain Language Summary</h3><div>The prediction interval (PI) addresses both the imprecision and inconsistency domains of certainty. In this study, we applied this PI approach to simultaneously judge both domains and compared this to the traditional approach of making these separate judgments. The 2 approaches had moderate agreement. The PI-based approach requires further testing in future systematic reviews and guidelines using context-specific thresholds and evidence-to-decision criteria.</div></div>","PeriodicalId":51079,"journal":{"name":"Journal of Clinical Epidemiology","volume":"175 ","pages":"Article 111543"},"PeriodicalIF":7.3000,"publicationDate":"2024-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Clinical Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895435624002993","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
To explore the use of prediction interval (PI) for the simultaneous evaluation of the imprecision and inconsistency domains of Grading of Recommendations, Assessment, and Evaluation using stakeholder-provided decision thresholds.
Study Design and Setting
We propose transforming the PI of a meta-analysis from a relative risk scale to an absolute risk difference using an appropriate baseline risk. The transformed PI is compared to stakeholder-provided thresholds on an absolute scale. We applied this approach to a large convenience sample of meta-analyses extracted from the Cochrane Database of Systematic Reviews and compared it against the traditional approach of rating imprecision and inconsistency separately using confidence intervals and statistical measures of heterogeneity, respectively. We used empirically derived thresholds following Grading of Recommendations, Assessment, and Evaluation guidance.
Results
The convenience sample consisted of 2516 meta-analyses (median of 7 studies per meta-analysis; interquartile range: 5–11). The main analysis showed the percentage of meta-analyses in which both approaches had the same number of certainty levels rated down was 59%. The PI approach led to more levels of rating down (lower certainty) in 27% and to fewer levels of rating down (higher certainty) in 14%. Multiple sensitivity analyses using different thresholds showed similar results, but the PI approach had particularly increased width with a larger number of included studies and higher I2 values.
Conclusion
Using the PI for simultaneous evaluation of imprecision and inconsistency seems feasible and logical but can lead to lower certainty ratings. The PI-based approach requires further testing in future systematic reviews and guidelines using context-specific thresholds and evidence-to-decision criteria.
Plain Language Summary
The prediction interval (PI) addresses both the imprecision and inconsistency domains of certainty. In this study, we applied this PI approach to simultaneously judge both domains and compared this to the traditional approach of making these separate judgments. The 2 approaches had moderate agreement. The PI-based approach requires further testing in future systematic reviews and guidelines using context-specific thresholds and evidence-to-decision criteria.
期刊介绍:
The Journal of Clinical Epidemiology strives to enhance the quality of clinical and patient-oriented healthcare research by advancing and applying innovative methods in conducting, presenting, synthesizing, disseminating, and translating research results into optimal clinical practice. Special emphasis is placed on training new generations of scientists and clinical practice leaders.