Assessment of the quality of different commercial providers using artificial intelligence for automated cephalometric analysis compared to human orthodontic experts.

IF 1.6 4区医学 Q3 DENTISTRY, ORAL SURGERY & MEDICINE

Journal of Orofacial Orthopedics-Fortschritte Der Kieferorthopadie Pub Date : 2025-05-01 Epub Date: 2023-08-29 DOI:10.1007/s00056-023-00491-1

Felix Kunz, Angelika Stellzig-Eisenhauer, Lisa Marie Widmaier, Florian Zeman, Julian Boldt

{"title":"Assessment of the quality of different commercial providers using artificial intelligence for automated cephalometric analysis compared to human orthodontic experts.","authors":"Felix Kunz, Angelika Stellzig-Eisenhauer, Lisa Marie Widmaier, Florian Zeman, Julian Boldt","doi":"10.1007/s00056-023-00491-1","DOIUrl":null,"url":null,"abstract":"Purpose: The aim of this investigation was to evaluate the accuracy of various skeletal and dental cephalometric parameters as produced by different commercial providers that make use of artificial intelligence (AI)-assisted automated cephalometric analysis and to compare their quality to a gold standard established by orthodontic experts.Methods: Twelve experienced orthodontic examiners pinpointed 15 radiographic landmarks on a total of 50 cephalometric X‑rays. The landmarks were used to generate 9 parameters for orthodontic treatment planning. The \"humans' gold standard\" was defined by calculating the median value of all 12 human assessments for each parameter, which in turn served as reference values for comparisons with results given by four different commercial providers of automated cephalometric analyses (DentaliQ.ortho [CellmatiQ GmbH, Hamburg, Germany], WebCeph [AssembleCircle Corp, Seongnam-si, Korea], AudaxCeph [Audax d.o.o., Ljubljana, Slovenia], CephX [Orca Dental AI, Herzliya, Israel]). Repeated measures analysis of variances (ANOVAs) were calculated and Bland-Altman plots were generated for comparisons.Results: The results of the repeated measures ANOVAs indicated significant differences between the commercial providers' predictions and the humans' gold standard for all nine investigated parameters. However, the pairwise comparisons also demonstrate that there were major differences among the four commercial providers. While there were no significant mean differences between the values of DentaliQ.ortho and the humans' gold standard, the predictions of AudaxCeph showed significant deviations in seven out of nine parameters. Also, the Bland-Altman plots demonstrate that a reduced precision of AI predictions must be expected especially for values attributed to the inclination of the incisors.Conclusion: Fully automated cephalometric analyses are promising in terms of timesaving and avoidance of individual human errors. At present, however, they should only be used under supervision of experienced clinicians.","PeriodicalId":54776,"journal":{"name":"Journal of Orofacial Orthopedics-Fortschritte Der Kieferorthopadie","volume":" ","pages":"145-160"},"PeriodicalIF":1.6000,"publicationDate":"2025-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12043786/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Orofacial Orthopedics-Fortschritte Der Kieferorthopadie","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00056-023-00491-1","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/29 0:00:00","PubModel":"Epub","JCR":"Q3","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: The aim of this investigation was to evaluate the accuracy of various skeletal and dental cephalometric parameters as produced by different commercial providers that make use of artificial intelligence (AI)-assisted automated cephalometric analysis and to compare their quality to a gold standard established by orthodontic experts.

Methods: Twelve experienced orthodontic examiners pinpointed 15 radiographic landmarks on a total of 50 cephalometric X‑rays. The landmarks were used to generate 9 parameters for orthodontic treatment planning. The "humans' gold standard" was defined by calculating the median value of all 12 human assessments for each parameter, which in turn served as reference values for comparisons with results given by four different commercial providers of automated cephalometric analyses (DentaliQ.ortho [CellmatiQ GmbH, Hamburg, Germany], WebCeph [AssembleCircle Corp, Seongnam-si, Korea], AudaxCeph [Audax d.o.o., Ljubljana, Slovenia], CephX [Orca Dental AI, Herzliya, Israel]). Repeated measures analysis of variances (ANOVAs) were calculated and Bland-Altman plots were generated for comparisons.

Results: The results of the repeated measures ANOVAs indicated significant differences between the commercial providers' predictions and the humans' gold standard for all nine investigated parameters. However, the pairwise comparisons also demonstrate that there were major differences among the four commercial providers. While there were no significant mean differences between the values of DentaliQ.ortho and the humans' gold standard, the predictions of AudaxCeph showed significant deviations in seven out of nine parameters. Also, the Bland-Altman plots demonstrate that a reduced precision of AI predictions must be expected especially for values attributed to the inclination of the incisors.

Conclusion: Fully automated cephalometric analyses are promising in terms of timesaving and avoidance of individual human errors. At present, however, they should only be used under supervision of experienced clinicians.

Abstract Image

查看原文本刊更多论文

与人类正畸专家相比，使用人工智能进行自动颅面测量分析的不同商业供应商的质量评估。

目的：本研究的目的是评估不同商业供应商利用人工智能（AI）辅助的自动头测分析产生的各种骨骼和牙齿头测参数的准确性，并将其质量与正畸专家建立的金标准进行比较。方法：12名经验丰富的正畸检查人员在总共50张头颅X线片上确定了15个放射学标志。利用这些标志生成9个参数，用于制定正畸治疗计划。“人类的黄金标准”是通过计算每个参数的所有12个人类评估的中位数来定义的，这反过来又作为与四个不同的自动头测量分析商业提供商（DentaliQ）给出的结果进行比较的参考值。ortho [CellmatiQ GmbH，汉堡，德国]，WebCeph [AssembleCircle Corp，韩国城南市]，AudaxCeph [Audax d.o.o，卢伯雅那，斯洛文尼亚]，CephX [Orca Dental AI，赫兹利亚，以色列])。计算重复测量方差分析（anova），并生成Bland-Altman图进行比较。结果：重复测量方差分析的结果表明，商业供应商的预测与人类的黄金标准在所有9个调查参数之间存在显著差异。然而，两两比较也表明，四个商业供应商之间存在重大差异。而DentaliQ值之间的平均值无显著差异。与人类的黄金标准相比，AudaxCeph的预测在9个参数中有7个显示出明显的偏差。此外，Bland-Altman图表明，人工智能预测的精度必须降低，特别是对于归因于门牙倾斜的值。结论：全自动头颅测量分析在节省时间和避免个人人为错误方面是有希望的。然而，目前它们只能在有经验的临床医生的监督下使用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Orofacial Orthopedics-Fortschritte Der Kieferorthopadie 医学-牙科与口腔外科

CiteScore

3.90

自引率

0.00%

发文量

审稿时长

>12 weeks

期刊介绍： The Journal of Orofacial Orthopedics provides orthodontists and dentists who are also actively interested in orthodontics, whether in university clinics or private practice, with highly authoritative and up-to-date information based on experimental and clinical research. The journal is one of the leading publications for the promulgation of the results of original work both in the areas of scientific and clinical orthodontics and related areas. All articles undergo peer review before publication. The German Society of Orthodontics (DGKFO) also publishes in the journal important communications, statements and announcements.