Mid-Net: Rethinking efficient network architectures for small-sample vascular segmentation

IF 14.7 1区计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Information Fusion Pub Date : 2024-10-28 DOI:10.1016/j.inffus.2024.102777

Dongxin Zhao , Jianhua Liu , Peng Geng , Jiaxin Yang , Ziqian Zhang , Yin Zhang

{"title":"Mid-Net: Rethinking efficient network architectures for small-sample vascular segmentation","authors":"Dongxin Zhao , Jianhua Liu , Peng Geng , Jiaxin Yang , Ziqian Zhang , Yin Zhang","doi":"10.1016/j.inffus.2024.102777","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning-based medical image segmentation methods have demonstrated significant clinical applications. However, training these methods on small-sample vascular datasets remains challenging due to the scarcity of labeled data and severe category imbalance. To address this, this paper proposes Mid-Net, which fully exploits the often-overlooked feature representation potential of the middle-layer network through cross-layer guidance to improve model learning efficiency in data-constrained environments. Mid-Net consists of three core components: the encoding path, the guidance path, and the calibration path. In the encoding path, a feature pyramid structure with large kernel convolutions is used to extract semantic information at different scales. The guidance path combines the sensitivity of the shallow-layer network to spatial details with the global perceptual abilities of the deep-layer network to provide more discriminative guidance to the middle-layer network in a feature-decoupled manner. The calibration path further calibrates the spatial location information of the middle-layer network through end-to-end supervised learning. Experiments conducted on the publicly available retinal vascular datasets DRIVE, STARE, and CHASE_DB1, as well as coronary angiography datasets DCA1 and CHUAC, demonstrate that Mid-Net achieves superior segmentation results with lower computational resource requirements compared to state-of-the-art methods.</div></div>","PeriodicalId":50367,"journal":{"name":"Information Fusion","volume":"115 ","pages":"Article 102777"},"PeriodicalIF":14.7000,"publicationDate":"2024-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Fusion","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1566253524005554","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Deep learning-based medical image segmentation methods have demonstrated significant clinical applications. However, training these methods on small-sample vascular datasets remains challenging due to the scarcity of labeled data and severe category imbalance. To address this, this paper proposes Mid-Net, which fully exploits the often-overlooked feature representation potential of the middle-layer network through cross-layer guidance to improve model learning efficiency in data-constrained environments. Mid-Net consists of three core components: the encoding path, the guidance path, and the calibration path. In the encoding path, a feature pyramid structure with large kernel convolutions is used to extract semantic information at different scales. The guidance path combines the sensitivity of the shallow-layer network to spatial details with the global perceptual abilities of the deep-layer network to provide more discriminative guidance to the middle-layer network in a feature-decoupled manner. The calibration path further calibrates the spatial location information of the middle-layer network through end-to-end supervised learning. Experiments conducted on the publicly available retinal vascular datasets DRIVE, STARE, and CHASE_DB1, as well as coronary angiography datasets DCA1 and CHUAC, demonstrate that Mid-Net achieves superior segmentation results with lower computational resource requirements compared to state-of-the-art methods.

查看原文本刊更多论文

中网：重新思考用于小样本血管分割的高效网络架构

基于深度学习的医学图像分割方法已在临床上得到广泛应用。然而，由于标记数据稀缺和严重的类别不平衡，在小样本血管数据集上训练这些方法仍然具有挑战性。针对这一问题，本文提出了 Mid-Net，通过跨层引导，充分利用中间层网络经常被忽视的特征表示潜力，提高数据受限环境下的模型学习效率。Mid-Net 由三个核心部分组成：编码路径、引导路径和校准路径。在编码路径中，使用具有大核卷积的特征金字塔结构来提取不同尺度的语义信息。引导路径结合了浅层网络对空间细节的敏感性和深层网络的全局感知能力，以特征解耦的方式为中层网络提供更具辨别力的引导。校准路径通过端到端的监督学习，进一步校准中间层网络的空间位置信息。在公开的视网膜血管数据集 DRIVE、STARE 和 CHASE_DB1 以及冠状动脉血管造影数据集 DCA1 和 CHUAC 上进行的实验表明，与最先进的方法相比，Mid-Net 能以更低的计算资源要求获得更出色的分割结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Information Fusion 工程技术-计算机：理论方法

CiteScore

33.20

自引率

4.30%

发文量

161

审稿时长

7.9 months

期刊介绍： Information Fusion serves as a central platform for showcasing advancements in multi-sensor, multi-source, multi-process information fusion, fostering collaboration among diverse disciplines driving its progress. It is the leading outlet for sharing research and development in this field, focusing on architectures, algorithms, and applications. Papers dealing with fundamental theoretical analyses as well as those demonstrating their application to real-world problems will be welcome.