[Scale-invariant feature-enhanced deep learning framework for oral mucosal lesion segmentation].

Q4 Medicine
R Zhang, L Jin, Q M Chen, T T Ding, Q Y Zhang, Y W Chen, X Tian, Y Y Cao, X Y Chen, F D Zhu
{"title":"[Scale-invariant feature-enhanced deep learning framework for oral mucosal lesion segmentation].","authors":"R Zhang, L Jin, Q M Chen, T T Ding, Q Y Zhang, Y W Chen, X Tian, Y Y Cao, X Y Chen, F D Zhu","doi":"10.3760/cma.j.cn112144-20241210-00467","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> To develop PixelSIFT-UNet, a novel semantic segmentation model that integrates deep learning with scale-invariant feature transform (SIFT) algorithm to improve the segmentation accuracy of oral mucosal lesions. <b>Methods:</b> This investigation utilized 838 standard clinical white light images of oral mucosal diseases acquired from January 2020 to December 2022 at the Stomatology Hospital Zhejiang University School of Medicine. Randomization was achieved through Python's random.seed function implementation. The random sample function was subsequently applied for sampling distribution. The dataset was stratified into three subsets with a 6∶2∶2 ratio: training (<i>n</i>=506), validation (<i>n</i>=166), and testing (<i>n</i>=166). Lesion boundaries were annotated using Labelme software, and a PixelSIFT-UNet-based deep learning model was developed with VGG-16 and ResNet-50 backbone networks. Model parameters were optimized using the validation set, and performance metrics [including Dice coefficient, mean intersection over union (mIoU), mean pixel accuracy (mPA), and Precision] were assessed on the test set. The model's performance was benchmarked against conventional semantic segmentation frameworks (U-Net and PSPNet). <b>Results:</b> The developed PixelSIFT-UNet model could achieve precise segmentation of three common oral mucosal lesions: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. Utilizing VGG-16 as the backbone network, the model achieved Dice coefficient, mIoU, mPA, and Precision values of 0.642, 0.699, 0.836, and 0.792, respectively. Implementation with ResNet-50 backbone network yielded metrics of 0.668, 0.733, 0.872 and 0.817, demonstrating significant improvements across all performance indicators compared to conventional U-Net model (relevant metrics: 0.662, 0.717, 0.861 and 0.809) and PSPNet model (relevant metrics: 0.671, 0.721, 0.858 and 0.813). <b>Conclusions:</b> The proposed PixelSIFT-UNet architecture demonstrates superior performance in oral mucosal lesion segmentation tasks, surpassing conventional semantic segmentation models and providing robust quantitative improvements in segmentation accuracy.</p>","PeriodicalId":23965,"journal":{"name":"中华口腔医学杂志","volume":"60 3","pages":"239-247"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华口腔医学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112144-20241210-00467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: To develop PixelSIFT-UNet, a novel semantic segmentation model that integrates deep learning with scale-invariant feature transform (SIFT) algorithm to improve the segmentation accuracy of oral mucosal lesions. Methods: This investigation utilized 838 standard clinical white light images of oral mucosal diseases acquired from January 2020 to December 2022 at the Stomatology Hospital Zhejiang University School of Medicine. Randomization was achieved through Python's random.seed function implementation. The random sample function was subsequently applied for sampling distribution. The dataset was stratified into three subsets with a 6∶2∶2 ratio: training (n=506), validation (n=166), and testing (n=166). Lesion boundaries were annotated using Labelme software, and a PixelSIFT-UNet-based deep learning model was developed with VGG-16 and ResNet-50 backbone networks. Model parameters were optimized using the validation set, and performance metrics [including Dice coefficient, mean intersection over union (mIoU), mean pixel accuracy (mPA), and Precision] were assessed on the test set. The model's performance was benchmarked against conventional semantic segmentation frameworks (U-Net and PSPNet). Results: The developed PixelSIFT-UNet model could achieve precise segmentation of three common oral mucosal lesions: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. Utilizing VGG-16 as the backbone network, the model achieved Dice coefficient, mIoU, mPA, and Precision values of 0.642, 0.699, 0.836, and 0.792, respectively. Implementation with ResNet-50 backbone network yielded metrics of 0.668, 0.733, 0.872 and 0.817, demonstrating significant improvements across all performance indicators compared to conventional U-Net model (relevant metrics: 0.662, 0.717, 0.861 and 0.809) and PSPNet model (relevant metrics: 0.671, 0.721, 0.858 and 0.813). Conclusions: The proposed PixelSIFT-UNet architecture demonstrates superior performance in oral mucosal lesion segmentation tasks, surpassing conventional semantic segmentation models and providing robust quantitative improvements in segmentation accuracy.

目的:开发一种新型语义分割模型 PixelSIFT-UNet,该模型将深度学习与尺度不变特征变换(SIFT)算法相结合,以提高口腔黏膜病变的分割准确性。方法:本研究利用浙江大学医学院附属口腔医院于 2020 年 1 月至 2022 年 12 月期间获得的 838 张标准临床口腔黏膜疾病白光图像。随机化是通过 Python 的 random.seed 函数实现的。随后应用随机抽样函数进行抽样分布。数据集以 6∶2∶2 的比例分为三个子集:训练集(n=506)、验证集(n=166)和测试集(n=166)。使用 Labelme 软件注释病变边界,并使用 VGG-16 和 ResNet-50 骨干网络开发基于 PixelSIFT-UNet 的深度学习模型。使用验证集对模型参数进行了优化,并在测试集上评估了性能指标[包括骰子系数、平均交集大于联合(mIoU)、平均像素精度(mPA)和精度]。该模型的性能以传统语义分割框架(U-Net 和 PSPNet)为基准。结果所开发的 PixelSIFT-UNet 模型能精确分割三种常见的口腔黏膜病变:口腔扁平苔藓、口腔白斑病和口腔黏膜下纤维化。利用 VGG-16 作为骨干网络,该模型的 Dice coefficient、mIoU、mPA 和 Precision 值分别达到了 0.642、0.699、0.836 和 0.792。与传统的 U-Net 模型(相关指标:0.662、0.717、0.861 和 0.809)和 PSPNet 模型(相关指标:0.671、0.721、0.858 和 0.813)相比,使用 ResNet-50 骨干网络实现的指标分别为 0.668、0.733、0.872 和 0.817,在所有性能指标上都有显著提高。结论所提出的 PixelSIFT-UNet 架构在口腔黏膜病变分割任务中表现出卓越的性能,超越了传统的语义分割模型,并在分割准确性方面提供了稳健的定量改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
中华口腔医学杂志
中华口腔医学杂志 Medicine-Medicine (all)
CiteScore
0.90
自引率
0.00%
发文量
9692
期刊介绍: Founded in August 1953, Chinese Journal of Stomatology is a monthly academic journal of stomatology published publicly at home and abroad, sponsored by the Chinese Medical Association and co-sponsored by the Chinese Stomatology Association. It mainly reports the leading scientific research results and clinical diagnosis and treatment experience in the field of oral medicine, as well as the basic theoretical research that has a guiding role in oral clinical practice and is closely combined with oral clinical practice. Chinese Journal of Over the years, Stomatology has been published in Medline, Scopus database, Toxicology Abstracts Database, Chemical Abstracts Database, American Cancer database, Russian Abstracts database, China Core Journal of Science and Technology, Peking University Core Journal, CSCD and other more than 20 important journals at home and abroad Physical medicine database and retrieval system included.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信