ClickSAM: Fine-tuning Segment Anything Model using click prompts for ultrasound image segmentation.

Proceedings of SPIE--the International Society for Optical Engineering Pub Date : 2024-02-01 Epub Date: 2024-04-01 DOI:10.1117/12.3005879

Aimee Guo, Grace Fei, Hemanth Pasupuleti, Jing Wang

{"title":"ClickSAM: Fine-tuning Segment Anything Model using click prompts for ultrasound image segmentation.","authors":"Aimee Guo, Grace Fei, Hemanth Pasupuleti, Jing Wang","doi":"10.1117/12.3005879","DOIUrl":null,"url":null,"abstract":"The newly released Segment Anything Model (SAM) is a popular tool used in image processing due to its superior segmentation accuracy, variety of input prompts, training capabilities, and efficient model design. However, its current model is trained on a diverse dataset not tailored to medical images, particularly ultrasound images. Ultrasound images tend to have a lot of noise, making it difficult to segment out important structures. In this project, we developed ClickSAM, which fine-tunes the Segment Anything Model using click prompts for ultrasound images. ClickSAM has two stages of training: the first stage is trained on single-click prompts centered in the ground-truth contours, and the second stage focuses on improving the model performance through additional positive and negative click prompts. By comparing the first stage's predictions to the ground-truth masks, true positive, false positive, and false negative segments are calculated. Positive clicks are generated using the true positive and false negative segments, and negative clicks are generated using the false positive segments. The Centroidal Voronoi Tessellation algorithm is then employed to collect positive and negative click prompts in each segment that are used to enhance the model performance during the second stage of training. With click-train methods, ClickSAM exhibits superior performance compared to other existing models for ultrasound image segmentation.","PeriodicalId":74505,"journal":{"name":"Proceedings of SPIE--the International Society for Optical Engineering","volume":"12932 ","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11141328/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of SPIE--the International Society for Optical Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.3005879","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/4/1 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The newly released Segment Anything Model (SAM) is a popular tool used in image processing due to its superior segmentation accuracy, variety of input prompts, training capabilities, and efficient model design. However, its current model is trained on a diverse dataset not tailored to medical images, particularly ultrasound images. Ultrasound images tend to have a lot of noise, making it difficult to segment out important structures. In this project, we developed ClickSAM, which fine-tunes the Segment Anything Model using click prompts for ultrasound images. ClickSAM has two stages of training: the first stage is trained on single-click prompts centered in the ground-truth contours, and the second stage focuses on improving the model performance through additional positive and negative click prompts. By comparing the first stage's predictions to the ground-truth masks, true positive, false positive, and false negative segments are calculated. Positive clicks are generated using the true positive and false negative segments, and negative clicks are generated using the false positive segments. The Centroidal Voronoi Tessellation algorithm is then employed to collect positive and negative click prompts in each segment that are used to enhance the model performance during the second stage of training. With click-train methods, ClickSAM exhibits superior performance compared to other existing models for ultrasound image segmentation.

查看原文本刊更多论文

ClickSAM：利用点击提示微调超声波图像分割模型。

新发布的 "任意分割模型"（Segment Anything Model，SAM）因其卓越的分割准确性、多种输入提示、训练能力和高效的模型设计而成为图像处理领域的常用工具。然而，它目前的模型是在一个多样化的数据集上训练的，并不适合医学图像，尤其是超声波图像。超声波图像往往有很多噪声，因此很难分割出重要的结构。在这个项目中，我们开发了 ClickSAM，利用超声波图像的点击提示对 "任意分割模型 "进行微调。ClickSAM 的训练分为两个阶段：第一阶段以地面实况轮廓为中心，通过单击提示进行训练；第二阶段则通过额外的正负单击提示来提高模型性能。通过将第一阶段的预测与地面实况掩码进行比较，计算出真阳性、假阳性和假阴性段。使用真阳性和假阴性片段生成阳性点击，使用假阳性片段生成阴性点击。然后采用中心 Voronoi Tessellation 算法收集每个片段中的正负点击提示，用于在第二阶段训练中提高模型性能。通过点击训练方法，ClickSAM 与其他现有的超声图像分割模型相比，表现出更优越的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of SPIE--the International Society for Optical Engineering

CiteScore

0.50

自引率

0.00%

发文量