Multi-label diagnosis of dental conditions from panoramic x-rays using attention-enhanced deep learning.

IF 1.8
Zahra Raeisi, Shayan Rokhva, Fatemeh Rahmani, Ali Goodarzi, Hossein Najafzadeh
{"title":"Multi-label diagnosis of dental conditions from panoramic x-rays using attention-enhanced deep learning.","authors":"Zahra Raeisi, Shayan Rokhva, Fatemeh Rahmani, Ali Goodarzi, Hossein Najafzadeh","doi":"10.1007/s10006-025-01463-y","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>This study aimed to develop and evaluate automated deep learning models for multi-class classification of dental conditions in panoramic X-ray images, comparing the effectiveness of custom CNN architectures with attention mechanisms, pre-trained models, and hybrid approaches.</p><p><strong>Methodology: </strong>A dataset of 1,512 panoramic dental X-rays was preprocessed through segmentation, creating 4,764 class-balanced images across four categories: Fillings, Cavity, Implant, and Impacted Tooth. Data augmentation and preprocessing techniques including brightness adjustment, CLAHE enhancement, and normalization were applied. Multiple architectures were evaluated: custom CNN with attention mechanism, pre-trained models (VGG16, ResNet50, Xception) with attention integration, and hybrid CNN-machine learning approaches (CNN + SVM, CNN + Random Forest, CNN + Decision Tree). Performance was assessed using 5-fold cross-validation with accuracy, precision, recall, F1-score, and ROC-AUC metrics.</p><p><strong>Results: </strong>The hybrid CNN + Random Forest model with preprocessing achieved the highest performance: 90.6% accuracy, 0.987 ROC-AUC, and 0.906 F1-score. Preprocessing consistently improved performance across all architectures, with accuracy gains ranging from 6.3% (VGG16) to 19.4% (ResNet50). The custom CNN with attention mechanism reached 86.0% accuracy, outperforming conventional CNN approaches (76.0%). Among pre-trained models, Xception with preprocessing achieved 79.8% accuracy.</p><p><strong>Conclusion: </strong>Hybrid CNN-machine learning approaches demonstrated superior performance for dental condition classification compared to end-to-end deep learning models. However, clinical implementation requires addressing the dataset limitation of lacking normal/healthy cases and conducting prospective validation studies across diverse clinical populations to establish real-world effectiveness and safety.</p>","PeriodicalId":520733,"journal":{"name":"Oral and maxillofacial surgery","volume":"29 1","pages":"166"},"PeriodicalIF":1.8000,"publicationDate":"2025-10-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Oral and maxillofacial surgery","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s10006-025-01463-y","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This study aimed to develop and evaluate automated deep learning models for multi-class classification of dental conditions in panoramic X-ray images, comparing the effectiveness of custom CNN architectures with attention mechanisms, pre-trained models, and hybrid approaches.

Methodology: A dataset of 1,512 panoramic dental X-rays was preprocessed through segmentation, creating 4,764 class-balanced images across four categories: Fillings, Cavity, Implant, and Impacted Tooth. Data augmentation and preprocessing techniques including brightness adjustment, CLAHE enhancement, and normalization were applied. Multiple architectures were evaluated: custom CNN with attention mechanism, pre-trained models (VGG16, ResNet50, Xception) with attention integration, and hybrid CNN-machine learning approaches (CNN + SVM, CNN + Random Forest, CNN + Decision Tree). Performance was assessed using 5-fold cross-validation with accuracy, precision, recall, F1-score, and ROC-AUC metrics.

Results: The hybrid CNN + Random Forest model with preprocessing achieved the highest performance: 90.6% accuracy, 0.987 ROC-AUC, and 0.906 F1-score. Preprocessing consistently improved performance across all architectures, with accuracy gains ranging from 6.3% (VGG16) to 19.4% (ResNet50). The custom CNN with attention mechanism reached 86.0% accuracy, outperforming conventional CNN approaches (76.0%). Among pre-trained models, Xception with preprocessing achieved 79.8% accuracy.

Conclusion: Hybrid CNN-machine learning approaches demonstrated superior performance for dental condition classification compared to end-to-end deep learning models. However, clinical implementation requires addressing the dataset limitation of lacking normal/healthy cases and conducting prospective validation studies across diverse clinical populations to establish real-world effectiveness and safety.

利用注意力增强深度学习从全景x射线中多标签诊断牙齿状况。
目的:本研究旨在开发和评估用于全景x射线图像中牙齿状况多类别分类的自动深度学习模型,比较自定义CNN架构与注意机制、预训练模型和混合方法的有效性。方法:通过分割对1512张全景牙科x射线数据集进行预处理,创建4,764张类别平衡的图像,分为四类:填充物,腔体,种植体和阻生牙。数据增强和预处理技术包括亮度调整、CLAHE增强和归一化。评估了多种架构:具有注意力机制的自定义CNN,具有注意力集成的预训练模型(VGG16, ResNet50, Xception),以及CNN-机器混合学习方法(CNN + SVM, CNN +随机森林,CNN +决策树)。使用准确度、精密度、召回率、f1评分和ROC-AUC指标的5倍交叉验证来评估性能。结果:预处理后的CNN + Random Forest混合模型的准确率为90.6%,ROC-AUC为0.987,F1-score为0.906。预处理在所有架构中都能持续提高性能,精度提高范围从6.3% (VGG16)到19.4% (ResNet50)。带有注意机制的自定义CNN准确率达到86.0%,优于传统CNN方法(76.0%)。在预训练模型中,经过预处理的exception准确率达到79.8%。结论:与端到端深度学习模型相比,混合cnn -机器学习方法在牙齿状况分类方面表现出更好的性能。然而,临床实施需要解决缺乏正常/健康病例的数据集限制,并在不同的临床人群中进行前瞻性验证研究,以建立现实世界的有效性和安全性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信