评估视觉变压器和卷积神经网络在牙科图像处理的背景下:一个系统的回顾。

IF 3.1 2区 医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE
Turgut Felek, Hümeyra Tercanlı, Rümeysa Şendişçi Gök
{"title":"评估视觉变压器和卷积神经网络在牙科图像处理的背景下:一个系统的回顾。","authors":"Turgut Felek, Hümeyra Tercanlı, Rümeysa Şendişçi Gök","doi":"10.1186/s12903-025-07036-5","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The aim of this systematic review is to compare the efficacy of convolutional neural networks (CNN) and Vision Transformers (ViT) in the field of dental imaging, in order to examine in depth the potential, advantages, and limitations of both models in this domain.</p><p><strong>Methods: </strong>The search strings used in the study were \"((\"Vision Transformer\" OR ViT OR \"Transformer architecture\") AND (\"Convolutional Neural Network\" OR CNN OR ConvNet) AND (Dental OR Dentistry OR \"Maxillofacial\" OR \"Oral Radiology\") AND (Image OR Imaging OR Radiograph))\". The search was conducted in January 2025. Two investigators independently evaluated the full texts of all eligible articles and excluded those that did not meet the inclusion/exclusion criteria.</p><p><strong>Results: </strong>Of 2596 articles, 21 met the inclusion criteria. Depending on the task category, of the 21 studies that were reviewed, 14 (66.7%) utilized classification, while 7 (33.3%) utilized segmentation. Panoramic radiography is the most commonly used imaging modality (52.3%) and the ViT-based model was observed to have the highest performance (58%).</p><p><strong>Conclusion: </strong>ViT-based deep learning models tend to exhibit higher performance in many dental image analysis scenarios compared to traditional convolutional neural networks. However, in practice CNN and ViT approaches can be used in a complementary manner.</p>","PeriodicalId":9072,"journal":{"name":"BMC Oral Health","volume":"25 1","pages":"1626"},"PeriodicalIF":3.1000,"publicationDate":"2025-10-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Evaluating vision transformers and convolutional neural networks in the context of dental image processing: a systematic review.\",\"authors\":\"Turgut Felek, Hümeyra Tercanlı, Rümeysa Şendişçi Gök\",\"doi\":\"10.1186/s12903-025-07036-5\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The aim of this systematic review is to compare the efficacy of convolutional neural networks (CNN) and Vision Transformers (ViT) in the field of dental imaging, in order to examine in depth the potential, advantages, and limitations of both models in this domain.</p><p><strong>Methods: </strong>The search strings used in the study were \\\"((\\\"Vision Transformer\\\" OR ViT OR \\\"Transformer architecture\\\") AND (\\\"Convolutional Neural Network\\\" OR CNN OR ConvNet) AND (Dental OR Dentistry OR \\\"Maxillofacial\\\" OR \\\"Oral Radiology\\\") AND (Image OR Imaging OR Radiograph))\\\". The search was conducted in January 2025. Two investigators independently evaluated the full texts of all eligible articles and excluded those that did not meet the inclusion/exclusion criteria.</p><p><strong>Results: </strong>Of 2596 articles, 21 met the inclusion criteria. Depending on the task category, of the 21 studies that were reviewed, 14 (66.7%) utilized classification, while 7 (33.3%) utilized segmentation. Panoramic radiography is the most commonly used imaging modality (52.3%) and the ViT-based model was observed to have the highest performance (58%).</p><p><strong>Conclusion: </strong>ViT-based deep learning models tend to exhibit higher performance in many dental image analysis scenarios compared to traditional convolutional neural networks. However, in practice CNN and ViT approaches can be used in a complementary manner.</p>\",\"PeriodicalId\":9072,\"journal\":{\"name\":\"BMC Oral Health\",\"volume\":\"25 1\",\"pages\":\"1626\"},\"PeriodicalIF\":3.1000,\"publicationDate\":\"2025-10-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"BMC Oral Health\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1186/s12903-025-07036-5\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"DENTISTRY, ORAL SURGERY & MEDICINE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Oral Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12903-025-07036-5","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0

摘要

背景:本系统综述的目的是比较卷积神经网络(CNN)和视觉变换(ViT)在牙科成像领域的效果,以深入研究这两种模型在该领域的潜力、优势和局限性。方法:研究中使用的搜索字符串为“视觉变压器”或“ViT”或“变压器架构”和“卷积神经网络”或“CNN”或“ConvNet”和(牙科或牙科或“颌面”或“口腔放射学”)和(图像或成像或放射线))”。搜寻工作于2025年1月进行。两位研究者独立评估了所有符合条件的文章的全文,并排除了那些不符合纳入/排除标准的文章。结果:2596篇文献中,21篇符合纳入标准。根据任务类别的不同,在我们回顾的21个研究中,14个(66.7%)使用了分类,而7个(33.3%)使用了分割。全景x线摄影是最常用的成像方式(52.3%),而基于vit的模型表现最佳(58%)。结论:与传统卷积神经网络相比,基于vit的深度学习模型在许多牙科图像分析场景中表现出更高的性能。然而,在实践中,CNN和ViT方法可以互补使用。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Evaluating vision transformers and convolutional neural networks in the context of dental image processing: a systematic review.

Background: The aim of this systematic review is to compare the efficacy of convolutional neural networks (CNN) and Vision Transformers (ViT) in the field of dental imaging, in order to examine in depth the potential, advantages, and limitations of both models in this domain.

Methods: The search strings used in the study were "(("Vision Transformer" OR ViT OR "Transformer architecture") AND ("Convolutional Neural Network" OR CNN OR ConvNet) AND (Dental OR Dentistry OR "Maxillofacial" OR "Oral Radiology") AND (Image OR Imaging OR Radiograph))". The search was conducted in January 2025. Two investigators independently evaluated the full texts of all eligible articles and excluded those that did not meet the inclusion/exclusion criteria.

Results: Of 2596 articles, 21 met the inclusion criteria. Depending on the task category, of the 21 studies that were reviewed, 14 (66.7%) utilized classification, while 7 (33.3%) utilized segmentation. Panoramic radiography is the most commonly used imaging modality (52.3%) and the ViT-based model was observed to have the highest performance (58%).

Conclusion: ViT-based deep learning models tend to exhibit higher performance in many dental image analysis scenarios compared to traditional convolutional neural networks. However, in practice CNN and ViT approaches can be used in a complementary manner.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
BMC Oral Health
BMC Oral Health DENTISTRY, ORAL SURGERY & MEDICINE-
CiteScore
3.90
自引率
6.90%
发文量
481
审稿时长
6-12 weeks
期刊介绍: BMC Oral Health is an open access, peer-reviewed journal that considers articles on all aspects of the prevention, diagnosis and management of disorders of the mouth, teeth and gums, as well as related molecular genetics, pathophysiology, and epidemiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信