R Zhang, L Jin, Q M Chen, T T Ding, Q Y Zhang, Y W Chen, X Tian, Y Y Cao, X Y Chen, F D Zhu
{"title":"[Scale-invariant feature-enhanced deep learning framework for oral mucosal lesion segmentation].","authors":"R Zhang, L Jin, Q M Chen, T T Ding, Q Y Zhang, Y W Chen, X Tian, Y Y Cao, X Y Chen, F D Zhu","doi":"10.3760/cma.j.cn112144-20241210-00467","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> To develop PixelSIFT-UNet, a novel semantic segmentation model that integrates deep learning with scale-invariant feature transform (SIFT) algorithm to improve the segmentation accuracy of oral mucosal lesions. <b>Methods:</b> This investigation utilized 838 standard clinical white light images of oral mucosal diseases acquired from January 2020 to December 2022 at the Stomatology Hospital Zhejiang University School of Medicine. Randomization was achieved through Python's random.seed function implementation. The random sample function was subsequently applied for sampling distribution. The dataset was stratified into three subsets with a 6∶2∶2 ratio: training (<i>n</i>=506), validation (<i>n</i>=166), and testing (<i>n</i>=166). Lesion boundaries were annotated using Labelme software, and a PixelSIFT-UNet-based deep learning model was developed with VGG-16 and ResNet-50 backbone networks. Model parameters were optimized using the validation set, and performance metrics [including Dice coefficient, mean intersection over union (mIoU), mean pixel accuracy (mPA), and Precision] were assessed on the test set. The model's performance was benchmarked against conventional semantic segmentation frameworks (U-Net and PSPNet). <b>Results:</b> The developed PixelSIFT-UNet model could achieve precise segmentation of three common oral mucosal lesions: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. Utilizing VGG-16 as the backbone network, the model achieved Dice coefficient, mIoU, mPA, and Precision values of 0.642, 0.699, 0.836, and 0.792, respectively. Implementation with ResNet-50 backbone network yielded metrics of 0.668, 0.733, 0.872 and 0.817, demonstrating significant improvements across all performance indicators compared to conventional U-Net model (relevant metrics: 0.662, 0.717, 0.861 and 0.809) and PSPNet model (relevant metrics: 0.671, 0.721, 0.858 and 0.813). <b>Conclusions:</b> The proposed PixelSIFT-UNet architecture demonstrates superior performance in oral mucosal lesion segmentation tasks, surpassing conventional semantic segmentation models and providing robust quantitative improvements in segmentation accuracy.</p>","PeriodicalId":23965,"journal":{"name":"中华口腔医学杂志","volume":"60 3","pages":"239-247"},"PeriodicalIF":0.0000,"publicationDate":"2025-03-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"中华口腔医学杂志","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3760/cma.j.cn112144-20241210-00467","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To develop PixelSIFT-UNet, a novel semantic segmentation model that integrates deep learning with scale-invariant feature transform (SIFT) algorithm to improve the segmentation accuracy of oral mucosal lesions. Methods: This investigation utilized 838 standard clinical white light images of oral mucosal diseases acquired from January 2020 to December 2022 at the Stomatology Hospital Zhejiang University School of Medicine. Randomization was achieved through Python's random.seed function implementation. The random sample function was subsequently applied for sampling distribution. The dataset was stratified into three subsets with a 6∶2∶2 ratio: training (n=506), validation (n=166), and testing (n=166). Lesion boundaries were annotated using Labelme software, and a PixelSIFT-UNet-based deep learning model was developed with VGG-16 and ResNet-50 backbone networks. Model parameters were optimized using the validation set, and performance metrics [including Dice coefficient, mean intersection over union (mIoU), mean pixel accuracy (mPA), and Precision] were assessed on the test set. The model's performance was benchmarked against conventional semantic segmentation frameworks (U-Net and PSPNet). Results: The developed PixelSIFT-UNet model could achieve precise segmentation of three common oral mucosal lesions: oral lichen planus, oral leukoplakia, and oral submucous fibrosis. Utilizing VGG-16 as the backbone network, the model achieved Dice coefficient, mIoU, mPA, and Precision values of 0.642, 0.699, 0.836, and 0.792, respectively. Implementation with ResNet-50 backbone network yielded metrics of 0.668, 0.733, 0.872 and 0.817, demonstrating significant improvements across all performance indicators compared to conventional U-Net model (relevant metrics: 0.662, 0.717, 0.861 and 0.809) and PSPNet model (relevant metrics: 0.671, 0.721, 0.858 and 0.813). Conclusions: The proposed PixelSIFT-UNet architecture demonstrates superior performance in oral mucosal lesion segmentation tasks, surpassing conventional semantic segmentation models and providing robust quantitative improvements in segmentation accuracy.
期刊介绍:
Founded in August 1953, Chinese Journal of Stomatology is a monthly academic journal of stomatology published publicly at home and abroad, sponsored by the Chinese Medical Association and co-sponsored by the Chinese Stomatology Association. It mainly reports the leading scientific research results and clinical diagnosis and treatment experience in the field of oral medicine, as well as the basic theoretical research that has a guiding role in oral clinical practice and is closely combined with oral clinical practice.
Chinese Journal of Over the years, Stomatology has been published in Medline, Scopus database, Toxicology Abstracts Database, Chemical Abstracts Database, American Cancer database, Russian Abstracts database, China Core Journal of Science and Technology, Peking University Core Journal, CSCD and other more than 20 important journals at home and abroad Physical medicine database and retrieval system included.