Multi-label diagnosis of dental conditions from panoramic x-rays using attention-enhanced deep learning.

IF 1.8

Oral and maxillofacial surgery Pub Date : 2025-10-07 DOI:10.1007/s10006-025-01463-y

Zahra Raeisi, Shayan Rokhva, Fatemeh Rahmani, Ali Goodarzi, Hossein Najafzadeh

{"title":"Multi-label diagnosis of dental conditions from panoramic x-rays using attention-enhanced deep learning.","authors":"Zahra Raeisi, Shayan Rokhva, Fatemeh Rahmani, Ali Goodarzi, Hossein Najafzadeh","doi":"10.1007/s10006-025-01463-y","DOIUrl":null,"url":null,"abstract":"Objective: This study aimed to develop and evaluate automated deep learning models for multi-class classification of dental conditions in panoramic X-ray images, comparing the effectiveness of custom CNN architectures with attention mechanisms, pre-trained models, and hybrid approaches.Methodology: A dataset of 1,512 panoramic dental X-rays was preprocessed through segmentation, creating 4,764 class-balanced images across four categories: Fillings, Cavity, Implant, and Impacted Tooth. Data augmentation and preprocessing techniques including brightness adjustment, CLAHE enhancement, and normalization were applied. Multiple architectures were evaluated: custom CNN with attention mechanism, pre-trained models (VGG16, ResNet50, Xception) with attention integration, and hybrid CNN-machine learning approaches (CNN + SVM, CNN + Random Forest, CNN + Decision Tree). Performance was assessed using 5-fold cross-validation with accuracy, precision, recall, F1-score, and ROC-AUC metrics.Results: The hybrid CNN + Random Forest model with preprocessing achieved the highest performance: 90.6% accuracy, 0.987 ROC-AUC, and 0.906 F1-score. Preprocessing consistently improved performance across all architectures, with accuracy gains ranging from 6.3% (VGG16) to 19.4% (ResNet50). The custom CNN with attention mechanism reached 86.0% accuracy, outperforming conventional CNN approaches (76.0%). Among pre-trained models, Xception with preprocessing achieved 79.8% accuracy.Conclusion: Hybrid CNN-machine learning approaches demonstrated superior performance for dental condition classification compared to end-to-end deep learning models. However, clinical implementation requires addressing the dataset limitation of lacking normal/healthy cases and conducting prospective validation studies across diverse clinical populations to establish real-world effectiveness and safety.","PeriodicalId":520733,"journal":{"name":"Oral and maxillofacial surgery","volume":"29 1","pages":"166"},"PeriodicalIF":1.8000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oral and maxillofacial surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10006-025-01463-y","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Objective: This study aimed to develop and evaluate automated deep learning models for multi-class classification of dental conditions in panoramic X-ray images, comparing the effectiveness of custom CNN architectures with attention mechanisms, pre-trained models, and hybrid approaches.

Methodology: A dataset of 1,512 panoramic dental X-rays was preprocessed through segmentation, creating 4,764 class-balanced images across four categories: Fillings, Cavity, Implant, and Impacted Tooth. Data augmentation and preprocessing techniques including brightness adjustment, CLAHE enhancement, and normalization were applied. Multiple architectures were evaluated: custom CNN with attention mechanism, pre-trained models (VGG16, ResNet50, Xception) with attention integration, and hybrid CNN-machine learning approaches (CNN + SVM, CNN + Random Forest, CNN + Decision Tree). Performance was assessed using 5-fold cross-validation with accuracy, precision, recall, F1-score, and ROC-AUC metrics.

Results: The hybrid CNN + Random Forest model with preprocessing achieved the highest performance: 90.6% accuracy, 0.987 ROC-AUC, and 0.906 F1-score. Preprocessing consistently improved performance across all architectures, with accuracy gains ranging from 6.3% (VGG16) to 19.4% (ResNet50). The custom CNN with attention mechanism reached 86.0% accuracy, outperforming conventional CNN approaches (76.0%). Among pre-trained models, Xception with preprocessing achieved 79.8% accuracy.

Conclusion: Hybrid CNN-machine learning approaches demonstrated superior performance for dental condition classification compared to end-to-end deep learning models. However, clinical implementation requires addressing the dataset limitation of lacking normal/healthy cases and conducting prospective validation studies across diverse clinical populations to establish real-world effectiveness and safety.

查看原文本刊更多论文

利用注意力增强深度学习从全景x射线中多标签诊断牙齿状况。

目的：本研究旨在开发和评估用于全景x射线图像中牙齿状况多类别分类的自动深度学习模型，比较自定义CNN架构与注意机制、预训练模型和混合方法的有效性。方法：通过分割对1512张全景牙科x射线数据集进行预处理，创建4,764张类别平衡的图像，分为四类：填充物，腔体，种植体和阻生牙。数据增强和预处理技术包括亮度调整、CLAHE增强和归一化。评估了多种架构：具有注意力机制的自定义CNN，具有注意力集成的预训练模型（VGG16, ResNet50, Xception），以及CNN-机器混合学习方法（CNN + SVM， CNN +随机森林，CNN +决策树）。使用准确度、精密度、召回率、f1评分和ROC-AUC指标的5倍交叉验证来评估性能。结果：预处理后的CNN + Random Forest混合模型的准确率为90.6%，ROC-AUC为0.987，F1-score为0.906。预处理在所有架构中都能持续提高性能，精度提高范围从6.3% （VGG16）到19.4% （ResNet50）。带有注意机制的自定义CNN准确率达到86.0%，优于传统CNN方法（76.0%）。在预训练模型中，经过预处理的exception准确率达到79.8%。结论：与端到端深度学习模型相比，混合cnn -机器学习方法在牙齿状况分类方面表现出更好的性能。然而，临床实施需要解决缺乏正常/健康病例的数据集限制，并在不同的临床人群中进行前瞻性验证研究，以建立现实世界的有效性和安全性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Oral and maxillofacial surgery

自引率

0.00%

发文量