Leveraging Pretrained Transformers for Efficient Segmentation and Lesion Detection in Cone-Beam Computed Tomography Scans.

IF 3.5 2区 医学 Q1 DENTISTRY, ORAL SURGERY & MEDICINE
Rui Qi Chen, Yeonju Lee, Hao Yan, Muralidhar Mupparapu, Fleming Lure, Jing Li, Frank C Setzer
{"title":"Leveraging Pretrained Transformers for Efficient Segmentation and Lesion Detection in Cone-Beam Computed Tomography Scans.","authors":"Rui Qi Chen, Yeonju Lee, Hao Yan, Muralidhar Mupparapu, Fleming Lure, Jing Li, Frank C Setzer","doi":"10.1016/j.joen.2024.07.012","DOIUrl":null,"url":null,"abstract":"<p><strong>Introduction: </strong>Cone-beam computed tomography (CBCT) is widely used to detect jaw lesions, although CBCT interpretation is time-consuming and challenging. Artificial intelligence for CBCT segmentation may improve lesion detection accuracy. However, consistent automated lesion detection remains difficult, especially with limited training data. This study aimed to assess the applicability of pretrained transformer-based architectures for semantic segmentation of CBCT volumes when applied to periapical lesion detection.</p><p><strong>Methods: </strong>CBCT volumes (n = 138) were collected and annotated by expert clinicians using 5 labels - \"lesion,\" \"restorative material,\" \"bone,\" \"tooth structure,\" and \"background.\" U-Net (convolutional neural network-based) and Swin-UNETR (transformer-based) models, pretrained (Swin-UNETR-PRETRAIN), and from scratch (Swin-UNETR-SCRATCH), were trained with subsets of the annotated CBCTs. These models were then evaluated for semantic segmentation performance using the Sørensen-Dice coefficient (DICE), lesion detection performance using sensitivity and specificity, and training sample size requirements by comparing models trained with 20, 40, 60, or 103 samples.</p><p><strong>Results: </strong>Trained with 103 samples, Swin-UNETR-PRETRAIN achieved a DICE of 0.8512 for \"lesion,\" 0.8282 for \"restorative materials,\" 0.9178 for \"bone,\" 0.9029 for \"tooth structure,\" and 0.9901 for \"background.\" \"Lesion\" DICE was statistically similar between Swin-UNETR-PRETRAIN trained with 103 and 60 images (P > .05), with the latter achieving 1.00 sensitivity and 0.94 specificity in lesion detection. With small training sets, Swin-UNETR-PRETRAIN outperformed Swin-UNETR-SCRATCH in DICE over all labels (P < .001 [n = 20], P < .001 [n = 40]), and U-Net in lesion detection specificity (P = .006 [n = 20], P = .031 [n = 40]).</p><p><strong>Conclusions: </strong>Transformer-based Swin-UNETR architectures allowed for excellent semantic segmentation and periapical lesion detection. Pretrained, it may provide an alternative with smaller training datasets compared to classic U-Net architectures.</p>","PeriodicalId":15703,"journal":{"name":"Journal of endodontics","volume":null,"pages":null},"PeriodicalIF":3.5000,"publicationDate":"2024-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of endodontics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.joen.2024.07.012","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"DENTISTRY, ORAL SURGERY & MEDICINE","Score":null,"Total":0}
引用次数: 0

Abstract

Introduction: Cone-beam computed tomography (CBCT) is widely used to detect jaw lesions, although CBCT interpretation is time-consuming and challenging. Artificial intelligence for CBCT segmentation may improve lesion detection accuracy. However, consistent automated lesion detection remains difficult, especially with limited training data. This study aimed to assess the applicability of pretrained transformer-based architectures for semantic segmentation of CBCT volumes when applied to periapical lesion detection.

Methods: CBCT volumes (n = 138) were collected and annotated by expert clinicians using 5 labels - "lesion," "restorative material," "bone," "tooth structure," and "background." U-Net (convolutional neural network-based) and Swin-UNETR (transformer-based) models, pretrained (Swin-UNETR-PRETRAIN), and from scratch (Swin-UNETR-SCRATCH), were trained with subsets of the annotated CBCTs. These models were then evaluated for semantic segmentation performance using the Sørensen-Dice coefficient (DICE), lesion detection performance using sensitivity and specificity, and training sample size requirements by comparing models trained with 20, 40, 60, or 103 samples.

Results: Trained with 103 samples, Swin-UNETR-PRETRAIN achieved a DICE of 0.8512 for "lesion," 0.8282 for "restorative materials," 0.9178 for "bone," 0.9029 for "tooth structure," and 0.9901 for "background." "Lesion" DICE was statistically similar between Swin-UNETR-PRETRAIN trained with 103 and 60 images (P > .05), with the latter achieving 1.00 sensitivity and 0.94 specificity in lesion detection. With small training sets, Swin-UNETR-PRETRAIN outperformed Swin-UNETR-SCRATCH in DICE over all labels (P < .001 [n = 20], P < .001 [n = 40]), and U-Net in lesion detection specificity (P = .006 [n = 20], P = .031 [n = 40]).

Conclusions: Transformer-based Swin-UNETR architectures allowed for excellent semantic segmentation and periapical lesion detection. Pretrained, it may provide an alternative with smaller training datasets compared to classic U-Net architectures.

利用预训练变压器在锥形束 CT 扫描中进行高效分割和病变检测
简介锥形束计算机断层扫描(CBCT)被广泛用于检测颌骨病变,但 CBCT 的判读耗时且具有挑战性。用于 CBCT 分段的人工智能(AI)可提高病变检测的准确性。然而,一致的自动病变检测仍然很困难,尤其是在训练数据有限的情况下。本研究旨在评估基于变压器的预训练架构在应用于根尖周病变检测时对 CBCT 图像进行语义分割的适用性:方法:收集 CBCT 图像(n=138),由临床专家使用 "病变"、"修复材料"、"骨"、"牙齿结构 "和 "背景 "五个标签进行标注。使用注释 CBCT 的子集对 U-Net(基于卷积神经网络 (CNN))和 Swin-UNETR(基于转换器)模型进行了预训练(Swin-UNETR-PRETRAIN)和从头开始训练(Swin-UNETR-SCRATCH)。然后使用索伦森-戴斯系数(DICE)对这些模型的语义分割性能进行评估,使用灵敏度和特异性对病变检测性能进行评估,并通过比较使用 20、40、60 或 103 个样本训练的模型,对训练样本的大小进行评估:使用 103 个样本进行训练后,Swin-UNETR-PRETRAIN 的 "病变 "DICE 为 0.8512,"修复材料 "DICE 为 0.8282,"骨骼 "DICE 为 0.9178,"牙齿结构 "DICE 为 0.9029,"背景 "DICE 为 0.9901。用 103 张图像和 60 张图像训练的 Swin-UNETR-PRETRAIN 的 "病变 "DICE 在统计学上相似(P>.05),后者在病变检测方面的灵敏度为 1.00,特异度为 0.94。在使用小型训练集的情况下,Swin-UNETR-PRETRAIN 在所有标签的 DICE 中的表现优于 Swin-UNETR-SCRATCH(PConclusions:基于变换器的 Swin-UNETR 架构可实现出色的语义分割和根尖周病变检测。与传统的 U-Net 架构相比,经过预先训练的 Swin-UNETR-SCRATCH 可为较小的训练数据集提供替代方案。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Journal of endodontics
Journal of endodontics 医学-牙科与口腔外科
CiteScore
8.80
自引率
9.50%
发文量
224
审稿时长
42 days
期刊介绍: The Journal of Endodontics, the official journal of the American Association of Endodontists, publishes scientific articles, case reports and comparison studies evaluating materials and methods of pulp conservation and endodontic treatment. Endodontists and general dentists can learn about new concepts in root canal treatment and the latest advances in techniques and instrumentation in the one journal that helps them keep pace with rapid changes in this field.
文献相关原料
公司名称 产品信息 采购帮参考价格
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信