Along He , Yanlin Wu , Zhihong Wang , Tao Li , Huazhu Fu
{"title":"AdaptFRCNet: Semi-supervised adaptation of pre-trained model with frequency and region consistency for medical image segmentation","authors":"Along He , Yanlin Wu , Zhihong Wang , Tao Li , Huazhu Fu","doi":"10.1016/j.media.2025.103626","DOIUrl":null,"url":null,"abstract":"<div><div>Recently, large pre-trained models (LPM) have achieved great success, which provides rich feature representation for downstream tasks. Pre-training and then fine-tuning is an effective way to utilize LPM. However, the application of LPM in the medical domain is hindered by the presence of a large number of parameters and a limited amount of labeled data. In clinical practice, there exists a substantial amount of unlabeled data that remains underutilized. Semi-supervised learning emerges as a promising approach to harnessing these unlabeled data effectively. In this paper, we propose semi-supervised adaptation of pre-trained model with frequency and region consistency (AdaptFRCNet) for medical image segmentation. Specifically, the pre-trained model is frozen and the proposed lightweight attention-based adapters (Att_Adapter) are inserted into the frozen backbone for parameter-efficient fine-tuning (PEFT). We propose two consistency regularization strategies for semi-supervised learning: frequency domain consistency (FDC) and multi-granularity region similarity consistency (MRSC). FDC aids in learning features within the frequency domain, and MRSC aims to achieve multiple region-level feature consistencies, capturing local context information effectively. By leveraging the proposed Att_Adapter along with FDC and MRSC, we can effectively and efficiently harness the powerful feature representation capability of the LPM. We conduct extensive experiments on three medical image segmentation datasets, demonstrating significant performance improvements over other state-of-the-art methods. The code is available at <span><span>https://github.com/NKUhealong/AdaptFRCNet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":18328,"journal":{"name":"Medical image analysis","volume":"103 ","pages":"Article 103626"},"PeriodicalIF":10.7000,"publicationDate":"2025-05-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical image analysis","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1361841525001732","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Recently, large pre-trained models (LPM) have achieved great success, which provides rich feature representation for downstream tasks. Pre-training and then fine-tuning is an effective way to utilize LPM. However, the application of LPM in the medical domain is hindered by the presence of a large number of parameters and a limited amount of labeled data. In clinical practice, there exists a substantial amount of unlabeled data that remains underutilized. Semi-supervised learning emerges as a promising approach to harnessing these unlabeled data effectively. In this paper, we propose semi-supervised adaptation of pre-trained model with frequency and region consistency (AdaptFRCNet) for medical image segmentation. Specifically, the pre-trained model is frozen and the proposed lightweight attention-based adapters (Att_Adapter) are inserted into the frozen backbone for parameter-efficient fine-tuning (PEFT). We propose two consistency regularization strategies for semi-supervised learning: frequency domain consistency (FDC) and multi-granularity region similarity consistency (MRSC). FDC aids in learning features within the frequency domain, and MRSC aims to achieve multiple region-level feature consistencies, capturing local context information effectively. By leveraging the proposed Att_Adapter along with FDC and MRSC, we can effectively and efficiently harness the powerful feature representation capability of the LPM. We conduct extensive experiments on three medical image segmentation datasets, demonstrating significant performance improvements over other state-of-the-art methods. The code is available at https://github.com/NKUhealong/AdaptFRCNet.
期刊介绍:
Medical Image Analysis serves as a platform for sharing new research findings in the realm of medical and biological image analysis, with a focus on applications of computer vision, virtual reality, and robotics to biomedical imaging challenges. The journal prioritizes the publication of high-quality, original papers contributing to the fundamental science of processing, analyzing, and utilizing medical and biological images. It welcomes approaches utilizing biomedical image datasets across all spatial scales, from molecular/cellular imaging to tissue/organ imaging.