Expert consensus-derived evaluation criteria for orthodontic treatment outcomes using a novel ranking method: A retrospective dental cast analysis study
{"title":"Expert consensus-derived evaluation criteria for orthodontic treatment outcomes using a novel ranking method: A retrospective dental cast analysis study","authors":"Huanhuan Chen, Hanwei Zheng, Yue Lai, Wei Li, Chenda Meng, Tianyi Wang, Guangying Song, Bing Han, Tianmin Xu","doi":"10.1016/j.ortho.2025.101057","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>This retrospective expert consensus study (PKUSSIRB No.202058145) aimed to establish expert consensus-derived evaluation criteria for orthodontic treatment outcomes using the Merge Ranking Method on post-treatment dental casts.</div></div><div><h3>Material and methods</h3><div>From patients treated at the Department of Orthodontics from January 2018 to December 2022, 216 cases were randomly selected for evaluation by 65 orthodontic experts using the Merge Ranking Method. Concurrently, nine objective indicators of the 216 post-treatment dental casts were measured by three researchers. The consistency analysis of experts’ subjective evaluation and researchers’ objective measurement was conducted, respectively. Through subjective-to-objective correlation analysis and regression analysis, the objective indicators significantly correlated with experts’ subjective evaluations were selected, their weights were determined, and the threshold values of grading evaluation were screened.</div></div><div><h3>Results</h3><div>The 65 orthodontic experts demonstrated: (1) moderate pairwise consistency (mean Spearman's ρ<!--> <!-->=<!--> <!-->0.560, 95% bootstrap CI: 0.556-0.564), (2) significant group-level concordance across two independent panels (Kendall's W<!--> <!-->=<!--> <!-->0.544–0.606, all <em>P</em> <!--><<!--> <!-->0.001), and (3) near-perfect cross-panel reliability for 24 overlapping cases (Kendall's τ-b<!--> <!-->=<!--> <!-->0.833–0.880, <em>P</em> <!--><<!--> <!-->0.001), confirming panel homogeneity for subsequent analyses. Inter-rater reliability among the three researchers showed excellent consistency (mean ICC<!--> <!-->=<!--> <!-->0.835, 95% CI: 0.788–0.882, range: 0.736–0.920), paralleled by high intra-rater reliability (mean ICC<!--> <!-->=<!--> <!-->0.832, 95% CI: 0.806–0.858, range: 0.715–0.948) across all 216 cases. Six objective indicators (occlusal relationship, overbite, alignment, overjet, occlusal contact, and buccal-lingual inclination) significantly predicted expert evaluations in a regression model (cumulative R<sup>2</sup> <!-->=<!--> <!-->0.598, <em>P</em> <!--><<!--> <!-->0.001). The threshold values for grading orthodontic treatment outcomes as Excellent, Good, Fair, Poor, and Worst were screened to be 1.846, 2.454, 3.492, and 4.312, respectively.</div></div><div><h3>Conclusions</h3><div>This expert consensus study demonstrated moderate consistency in subjective orthodontic outcome evaluation, with the occlusal relationship emerging as the primary quality determinant. The developed Merge Ranking Method addressed conventional ranking limitations through its innovative two-stage approach: initial segmented evaluation reduced expert fatigue, while subsequent dynamic adjustments improved borderline case classification.</div></div>","PeriodicalId":45449,"journal":{"name":"International Orthodontics","volume":"24 1","pages":"Article 101057"},"PeriodicalIF":1.9000,"publicationDate":"2025-09-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Orthodontics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1761722725000920","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
This retrospective expert consensus study (PKUSSIRB No.202058145) aimed to establish expert consensus-derived evaluation criteria for orthodontic treatment outcomes using the Merge Ranking Method on post-treatment dental casts.
Material and methods
From patients treated at the Department of Orthodontics from January 2018 to December 2022, 216 cases were randomly selected for evaluation by 65 orthodontic experts using the Merge Ranking Method. Concurrently, nine objective indicators of the 216 post-treatment dental casts were measured by three researchers. The consistency analysis of experts’ subjective evaluation and researchers’ objective measurement was conducted, respectively. Through subjective-to-objective correlation analysis and regression analysis, the objective indicators significantly correlated with experts’ subjective evaluations were selected, their weights were determined, and the threshold values of grading evaluation were screened.
Results
The 65 orthodontic experts demonstrated: (1) moderate pairwise consistency (mean Spearman's ρ = 0.560, 95% bootstrap CI: 0.556-0.564), (2) significant group-level concordance across two independent panels (Kendall's W = 0.544–0.606, all P < 0.001), and (3) near-perfect cross-panel reliability for 24 overlapping cases (Kendall's τ-b = 0.833–0.880, P < 0.001), confirming panel homogeneity for subsequent analyses. Inter-rater reliability among the three researchers showed excellent consistency (mean ICC = 0.835, 95% CI: 0.788–0.882, range: 0.736–0.920), paralleled by high intra-rater reliability (mean ICC = 0.832, 95% CI: 0.806–0.858, range: 0.715–0.948) across all 216 cases. Six objective indicators (occlusal relationship, overbite, alignment, overjet, occlusal contact, and buccal-lingual inclination) significantly predicted expert evaluations in a regression model (cumulative R2 = 0.598, P < 0.001). The threshold values for grading orthodontic treatment outcomes as Excellent, Good, Fair, Poor, and Worst were screened to be 1.846, 2.454, 3.492, and 4.312, respectively.
Conclusions
This expert consensus study demonstrated moderate consistency in subjective orthodontic outcome evaluation, with the occlusal relationship emerging as the primary quality determinant. The developed Merge Ranking Method addressed conventional ranking limitations through its innovative two-stage approach: initial segmented evaluation reduced expert fatigue, while subsequent dynamic adjustments improved borderline case classification.
期刊介绍:
Une revue de référence dans le domaine de orthodontie et des disciplines frontières Your reference in dentofacial orthopedics International Orthodontics adresse aux orthodontistes, aux dentistes, aux stomatologistes, aux chirurgiens maxillo-faciaux et aux plasticiens de la face, ainsi quà leurs assistant(e)s. International Orthodontics is addressed to orthodontists, dentists, stomatologists, maxillofacial surgeons and facial plastic surgeons, as well as their assistants.