Cross-domain subcortical brain structure segmentation algorithm based on low-rank adaptation fine-tuning SAM.

IF 2.9 3区医学 Q2 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

BMC Medical Imaging Pub Date : 2025-07-01 DOI:10.1186/s12880-025-01779-x

Yuan Sui, Qian Hu, Yujie Zhang

{"title":"Cross-domain subcortical brain structure segmentation algorithm based on low-rank adaptation fine-tuning SAM.","authors":"Yuan Sui, Qian Hu, Yujie Zhang","doi":"10.1186/s12880-025-01779-x","DOIUrl":null,"url":null,"abstract":"Purpose: Accurate and robust segmentation of anatomical structures in brain MRI provides a crucial basis for the subsequent observation, analysis, and treatment planning of various brain diseases. Deep learning foundation models trained and designed on large-scale natural scene image datasets experience significant performance degradation when applied to subcortical brain structure segmentation in MRI, limiting their direct applicability in clinical diagnosis.Methods: This paper proposes a subcortical brain structure segmentation algorithm based on Low-Rank Adaptation (LoRA) to fine-tune SAM (Segment Anything Model) by freezing SAM's image encoder and applying LoRA to approximate low-rank matrix updates to the encoder's training weights, while also fine-tuning SAM's lightweight prompt encoder and mask decoder.Results: The fine-tuned model's learnable parameters (5.92 MB) occupy only 6.39% of the original model's parameter size (92.61 MB). For training, model preheating is employed to stabilize the fine-tuning process. During inference, adaptive prompt learning with point or box prompts is introduced to enhance the model's accuracy for arbitrary brain MRI segmentation.Conclusion: This interactive prompt learning approach provides clinicians with a means of intelligent segmentation for deep brain structures, effectively addressing the challenges of limited data labels and high manual annotation costs in medical image segmentation. We use five MRI datasets of IBSR, MALC, LONI, LPBA, Hammers and CANDI for experiments across various segmentation scenarios, including cross-domain settings with inference samples from diverse MRI datasets and supervised fine-tuning settings, demonstrate the proposed segmentation algorithm's generalization and effectiveness when compared to current mainstream and supervised segmentation algorithms.","PeriodicalId":9020,"journal":{"name":"BMC Medical Imaging","volume":"25 1","pages":"248"},"PeriodicalIF":2.9000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"BMC Medical Imaging","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1186/s12880-025-01779-x","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

Abstract

Purpose: Accurate and robust segmentation of anatomical structures in brain MRI provides a crucial basis for the subsequent observation, analysis, and treatment planning of various brain diseases. Deep learning foundation models trained and designed on large-scale natural scene image datasets experience significant performance degradation when applied to subcortical brain structure segmentation in MRI, limiting their direct applicability in clinical diagnosis.

Methods: This paper proposes a subcortical brain structure segmentation algorithm based on Low-Rank Adaptation (LoRA) to fine-tune SAM (Segment Anything Model) by freezing SAM's image encoder and applying LoRA to approximate low-rank matrix updates to the encoder's training weights, while also fine-tuning SAM's lightweight prompt encoder and mask decoder.

Results: The fine-tuned model's learnable parameters (5.92 MB) occupy only 6.39% of the original model's parameter size (92.61 MB). For training, model preheating is employed to stabilize the fine-tuning process. During inference, adaptive prompt learning with point or box prompts is introduced to enhance the model's accuracy for arbitrary brain MRI segmentation.

Conclusion: This interactive prompt learning approach provides clinicians with a means of intelligent segmentation for deep brain structures, effectively addressing the challenges of limited data labels and high manual annotation costs in medical image segmentation. We use five MRI datasets of IBSR, MALC, LONI, LPBA, Hammers and CANDI for experiments across various segmentation scenarios, including cross-domain settings with inference samples from diverse MRI datasets and supervised fine-tuning settings, demonstrate the proposed segmentation algorithm's generalization and effectiveness when compared to current mainstream and supervised segmentation algorithms.

查看原文本刊更多论文

基于低秩自适应微调SAM的跨域皮质下脑结构分割算法。

目的：脑MRI对解剖结构进行准确、稳健的分割，为后续对各种脑部疾病的观察、分析和治疗规划提供重要依据。在大规模自然场景图像数据集上训练和设计的深度学习基础模型在MRI皮层下脑结构分割中表现出明显的性能下降，限制了其在临床诊断中的直接适用性。方法：提出一种基于低秩自适应（Low-Rank Adaptation, LoRA）的皮质下脑结构分割算法，通过冻结SAM的图像编码器，利用LoRA对编码器的训练权值进行低秩矩阵的近似更新，同时对SAM的轻量级提示编码器和掩码解码器进行微调，从而对SAM （Segment Anything Model）进行微调。结果：调整后模型的可学习参数（5.92 MB）仅占原始模型参数大小（92.61 MB）的6.39%。对于训练，采用模型预热来稳定微调过程。在推理过程中，引入点或框提示的自适应提示学习，以提高模型对任意脑MRI分割的准确性。结论：这种交互式提示学习方法为临床医生提供了一种对脑深部结构进行智能分割的手段，有效解决了医学图像分割中数据标签有限、人工标注成本高的难题。我们使用IBSR， MALC， LONI， LPBA， Hammers和CANDI五个MRI数据集进行了不同分割场景的实验，包括来自不同MRI数据集的推断样本的跨域设置和监督微调设置，与当前主流和监督分割算法相比，证明了所提出的分割算法的概括性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

BMC Medical Imaging RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING-

CiteScore

4.60

自引率

3.70%

发文量

198

审稿时长

27 weeks

期刊介绍： BMC Medical Imaging is an open access journal publishing original peer-reviewed research articles in the development, evaluation, and use of imaging techniques and image processing tools to diagnose and manage disease.