{"title":"Deep Learning Based Evaluation of Skeletal Maturation: A Comparative Analysis of Five Hand-Wrist Methods.","authors":"Serhat Tentaş, Samet Özden","doi":"10.1111/ocr.70008","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>The study aims to evaluate the effectiveness of deep learning algorithms in skeletal age estimation by comparing the diagnostic reliability of five different hand-wrist maturation (HWM) assessment methods.</p><p><strong>Materials and methods: </strong>A total of 6572 hand-wrist radiographs from orthodontic patients aged 8-16 years were retrospectively analysed. Radiographs were categorised into five groups based on HWM classification methods: (I) Björk's nine-stage, (II) Hägg and Taranger's five-stage, (III) Chapman's four-stage, (IV) three-stage hook of hamate ossification based and (V) simplified three-stage Björk's classification based. YOLOv8x-based deep learning models were trained separately for each group. The dataset was split into training, validation and test subsets. Performance was evaluated using accuracy, precision, recall, F1 score and AUC metrics.</p><p><strong>Results: </strong>The YOLOv8x-cls model demonstrated high classification performance across all five groups. Group IV and Group II achieved the highest accuracy and F1 scores, with average F1 values of 0.99 and 0.96, respectively. Group III and Group V also showed strong performance (F1 = 0.93 and 0.92). In Group I, slightly lower classification performance was observed in the S-H2 and MP3-Cap stages (F1 = 0.72-0.74), which correspond to the pubertal growth peak, while early and late skeletal maturation stages were classified with high accuracy and F1 scores. ROC curve analysis further supported these findings, with AUC values for MP3-Cap and S-H2 recorded as 0.70 and 0.75, respectively, whereas higher AUC values were achieved in most other stages across all groups.</p><p><strong>Conclusion: </strong>Deep learning models proved effective in evaluating skeletal maturation across five different HWM methods. Particularly high performance was observed in anatomically distinct regions such as the MP3, adductor sesamoid and hamate bone, which can be reliably identified by general dentists, enabling earlier referrals and timely orthodontic interventions.</p>","PeriodicalId":19652,"journal":{"name":"Orthodontics & Craniofacial Research","volume":" ","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Orthodontics & Craniofacial Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/ocr.70008","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: The study aims to evaluate the effectiveness of deep learning algorithms in skeletal age estimation by comparing the diagnostic reliability of five different hand-wrist maturation (HWM) assessment methods.
Materials and methods: A total of 6572 hand-wrist radiographs from orthodontic patients aged 8-16 years were retrospectively analysed. Radiographs were categorised into five groups based on HWM classification methods: (I) Björk's nine-stage, (II) Hägg and Taranger's five-stage, (III) Chapman's four-stage, (IV) three-stage hook of hamate ossification based and (V) simplified three-stage Björk's classification based. YOLOv8x-based deep learning models were trained separately for each group. The dataset was split into training, validation and test subsets. Performance was evaluated using accuracy, precision, recall, F1 score and AUC metrics.
Results: The YOLOv8x-cls model demonstrated high classification performance across all five groups. Group IV and Group II achieved the highest accuracy and F1 scores, with average F1 values of 0.99 and 0.96, respectively. Group III and Group V also showed strong performance (F1 = 0.93 and 0.92). In Group I, slightly lower classification performance was observed in the S-H2 and MP3-Cap stages (F1 = 0.72-0.74), which correspond to the pubertal growth peak, while early and late skeletal maturation stages were classified with high accuracy and F1 scores. ROC curve analysis further supported these findings, with AUC values for MP3-Cap and S-H2 recorded as 0.70 and 0.75, respectively, whereas higher AUC values were achieved in most other stages across all groups.
Conclusion: Deep learning models proved effective in evaluating skeletal maturation across five different HWM methods. Particularly high performance was observed in anatomically distinct regions such as the MP3, adductor sesamoid and hamate bone, which can be reliably identified by general dentists, enabling earlier referrals and timely orthodontic interventions.
期刊介绍:
Orthodontics & Craniofacial Research - Genes, Growth and Development is published to serve its readers as an international forum for the presentation and critical discussion of issues pertinent to the advancement of the specialty of orthodontics and the evidence-based knowledge of craniofacial growth and development. This forum is based on scientifically supported information, but also includes minority and conflicting opinions.
The objective of the journal is to facilitate effective communication between the research community and practicing clinicians. Original papers of high scientific quality that report the findings of clinical trials, clinical epidemiology, and novel therapeutic or diagnostic approaches are appropriate submissions. Similarly, we welcome papers in genetics, developmental biology, syndromology, surgery, speech and hearing, and other biomedical disciplines related to clinical orthodontics and normal and abnormal craniofacial growth and development. In addition to original and basic research, the journal publishes concise reviews, case reports of substantial value, invited essays, letters, and announcements.
The journal is published quarterly. The review of submitted papers will be coordinated by the editor and members of the editorial board. It is policy to review manuscripts within 3 to 4 weeks of receipt and to publish within 3 to 6 months of acceptance.