{"title":"Accuracy of artificial intelligence in orthodontic extraction treatment planning: a systematic review and meta analysis.","authors":"SeyedMehdi Ziaei, Dorsa Samani, Mohammadreza Behjati, Ava Ostovar Ravari, Yasaman Salimi, Sina Ahmadi, Sahar Rajaei, Farnoosh Alimohammadi, Soheil Raji, Niloofar Deravi, Haleh Fakhimi","doi":"10.1186/s12903-025-06880-9","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>This study aimed to evaluate the diagnostic accuracy of artificial intelligence (AI) models in predicting dental extractions during orthodontic treatment planning.</p><p><strong>Method: </strong>A systematic review and meta-analysis were conducted following PRISMA guidelines and registered in PROSPERO (CRD42024582455). Comprehensive searches were performed across PubMed, Scopus, Web Of Science, and Google Scholar up to June 2, 2025. Eligible cross-sectional studies assessing AI-based models against clinical standards were included. Data on model performance were extracted and pooled using a random-effects model. Subgroup and meta-regression analyses were conducted to explore heterogeneity.</p><p><strong>Results: </strong>Seven cross-sectional studies from six countries with a combined sample of 6,261 patients were included. Pooled sensitivity and specificity of AI models were 70% (95% CI: 61-78) and 90% (95% CI: 87-92), respectively, though heterogeneity was high (I² = 96.7% and 93.7%). Convolutional neural networks (CNN)-based models (ResNet and VGG) demonstrated the highest diagnostic performance with no heterogeneity. Meta-regression showed that disease prevalence significantly influenced sensitivity (p = 0.050). Funnel plots revealed asymmetry, suggesting possible publication bias.</p><p><strong>Conclusion: </strong>AI models, particularly CNN-based models, show promising accuracy in predicting the need for orthodontic extractions. Therefore, they can be used to create predictive models for orthodontic extractions to increase accuracy. Due to the high heterogeneity, further large-scale studies are needed to support clinical implementation.</p>","PeriodicalId":9072,"journal":{"name":"BMC Oral Health","volume":"25 1","pages":"1576"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12512631/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Oral Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12903-025-06880-9","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Background: This study aimed to evaluate the diagnostic accuracy of artificial intelligence (AI) models in predicting dental extractions during orthodontic treatment planning.
Method: A systematic review and meta-analysis were conducted following PRISMA guidelines and registered in PROSPERO (CRD42024582455). Comprehensive searches were performed across PubMed, Scopus, Web Of Science, and Google Scholar up to June 2, 2025. Eligible cross-sectional studies assessing AI-based models against clinical standards were included. Data on model performance were extracted and pooled using a random-effects model. Subgroup and meta-regression analyses were conducted to explore heterogeneity.
Results: Seven cross-sectional studies from six countries with a combined sample of 6,261 patients were included. Pooled sensitivity and specificity of AI models were 70% (95% CI: 61-78) and 90% (95% CI: 87-92), respectively, though heterogeneity was high (I² = 96.7% and 93.7%). Convolutional neural networks (CNN)-based models (ResNet and VGG) demonstrated the highest diagnostic performance with no heterogeneity. Meta-regression showed that disease prevalence significantly influenced sensitivity (p = 0.050). Funnel plots revealed asymmetry, suggesting possible publication bias.
Conclusion: AI models, particularly CNN-based models, show promising accuracy in predicting the need for orthodontic extractions. Therefore, they can be used to create predictive models for orthodontic extractions to increase accuracy. Due to the high heterogeneity, further large-scale studies are needed to support clinical implementation.
期刊介绍:
BMC Oral Health is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of disorders of the mouth, teeth and gums, as well as related molecular genetics, pathophysiology, and epidemiology.