Predicting treatment pathways in Class II malocclusion patients using machine learning: A comparative study of four algorithms for classifying camouflage, growth modulation, and surgical decisions
{"title":"Predicting treatment pathways in Class II malocclusion patients using machine learning: A comparative study of four algorithms for classifying camouflage, growth modulation, and surgical decisions","authors":"Mukesh Kumar, Sumit Kumar, Malvika Agarwal, Ekta Yadav, Sougandhika Gandi","doi":"10.1016/j.ortho.2025.101070","DOIUrl":null,"url":null,"abstract":"<div><h3>Objectives</h3><div>The aim of this study was to develop a machine-learning model to assist in treatment decision-making for surgery, camouflage, and growth modulation in Class II malocclusion patients and to evaluate its validity and reliability.</div></div><div><h3>Material and methods</h3><div>A total of 506 Class II malocclusion patients were included in the study, with patients randomly assigned to a training set (405) and a test set (101). Four machine-learning (ML) models – logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM) – were trained to predict the most suitable treatment approach: camouflage, growth modulation (GM), or surgery. The accuracy of treatment decisions was evaluated for each model, along with 95% confidence intervals (CIs). Additionally, the McNemar's test was used to assess the statistical significance of model performance.</div></div><div><h3>Results</h3><div>The AUC-PR values indicate that SVM and RF are the best-performing models, both achieving 1.00 for GM, 0.92 for camouflage, and 0.82 for surgery, demonstrating strong classification capabilities across all classes. LR performs well for GM (0.97<strong>)</strong> but struggles with camouflage and surgery (both 0.66), indicating inconsistencies. The DT has the lowest overall performance, with 0.62 for GM and camouflage, and 0.55 for surgery, suggesting weaker classification reliability. Given these results, SVM and RF emerge as the most effective models, offering the best balance of precision and recall across all classes.</div></div><div><h3>Conclusions</h3><div>Support vector machine and random forest demonstrate strong classification for growth modulation with high precision and recall, while camouflage remains stable until 80% recall before precision declines. Surgery involves greater trade-offs between precision and recall. This study further supports that ANB, Nasolabial angle, SNA, H angle, Age, Mandibular plane angle can be used as strong predictors in assessing patient's treatment needs.</div></div>","PeriodicalId":45449,"journal":{"name":"International Orthodontics","volume":"24 1","pages":"Article 101070"},"PeriodicalIF":1.9000,"publicationDate":"2025-10-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Orthodontics","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1761722725001056","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives
The aim of this study was to develop a machine-learning model to assist in treatment decision-making for surgery, camouflage, and growth modulation in Class II malocclusion patients and to evaluate its validity and reliability.
Material and methods
A total of 506 Class II malocclusion patients were included in the study, with patients randomly assigned to a training set (405) and a test set (101). Four machine-learning (ML) models – logistic regression (LR), decision tree (DT), random forest (RF), and support vector machine (SVM) – were trained to predict the most suitable treatment approach: camouflage, growth modulation (GM), or surgery. The accuracy of treatment decisions was evaluated for each model, along with 95% confidence intervals (CIs). Additionally, the McNemar's test was used to assess the statistical significance of model performance.
Results
The AUC-PR values indicate that SVM and RF are the best-performing models, both achieving 1.00 for GM, 0.92 for camouflage, and 0.82 for surgery, demonstrating strong classification capabilities across all classes. LR performs well for GM (0.97) but struggles with camouflage and surgery (both 0.66), indicating inconsistencies. The DT has the lowest overall performance, with 0.62 for GM and camouflage, and 0.55 for surgery, suggesting weaker classification reliability. Given these results, SVM and RF emerge as the most effective models, offering the best balance of precision and recall across all classes.
Conclusions
Support vector machine and random forest demonstrate strong classification for growth modulation with high precision and recall, while camouflage remains stable until 80% recall before precision declines. Surgery involves greater trade-offs between precision and recall. This study further supports that ANB, Nasolabial angle, SNA, H angle, Age, Mandibular plane angle can be used as strong predictors in assessing patient's treatment needs.
期刊介绍:
Une revue de référence dans le domaine de orthodontie et des disciplines frontières Your reference in dentofacial orthopedics International Orthodontics adresse aux orthodontistes, aux dentistes, aux stomatologistes, aux chirurgiens maxillo-faciaux et aux plasticiens de la face, ainsi quà leurs assistant(e)s. International Orthodontics is addressed to orthodontists, dentists, stomatologists, maxillofacial surgeons and facial plastic surgeons, as well as their assistants.