Jian Li , Peng Ding , Fengwu Lin , Zhaomin Chen , Ali Asghar Heidari , Huiling Chen
{"title":"UNeSt: A fast segmentation network for colorectal polyps based on MLP and deep separable convolution","authors":"Jian Li , Peng Ding , Fengwu Lin , Zhaomin Chen , Ali Asghar Heidari , Huiling Chen","doi":"10.1016/j.bspc.2024.107165","DOIUrl":null,"url":null,"abstract":"<div><div>In medical image segmentation, conventional methodologies based on the UNet method often focus on improving the network performance but overlook the parameters and computational complexity. Due to the limitation of computing resources, these methods can hardly be applied in the landscape of point-of-care (PoC) applications. This study presents UNeSt, a rapid segmentation network tailored for colorectal polyps. The architectural foundation of UNeSt hinges upon the synergistic integration of the depth separable convolutional layer (DSC) and multilayer perceptron (MLP). UNeSt achieves an innovative fusion of these components, resulting in a substantial reduction in model parameters and computational complexity, which is concomitant with a remarkable enhancement in inference speed. Specifically, UNeSt incorporates the convolutional block attention module (CBAM) within the convolutional encoder to extract channel and spatial information proficiently. Furthermore, we introduce an attention mechanism to address the positional information discrepancies introduced in the MLP stage. This comprehensive approach contributes significantly to the augmentation of accuracy in colorectal polyp segmentation. Finally, UNeSt employs skip connections between various levels of encoders and decoders, thereby mitigating information loss problems. In the context of this investigation, UNeSt underwent rigorous evaluation using a demanding polyp segmentation dataset. Relative to UNeXt, a widely employed and exceedingly lightweight network model, the proposed model in this study exhibits a noteworthy reduction with 1.6x fewer parameters, a 2.5x decrease in computational complexity (measured in GFLOPs), and a 1.9x acceleration in inference speed.</div></div>","PeriodicalId":55362,"journal":{"name":"Biomedical Signal Processing and Control","volume":"100 ","pages":"Article 107165"},"PeriodicalIF":4.9000,"publicationDate":"2024-11-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biomedical Signal Processing and Control","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1746809424012230","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
In medical image segmentation, conventional methodologies based on the UNet method often focus on improving the network performance but overlook the parameters and computational complexity. Due to the limitation of computing resources, these methods can hardly be applied in the landscape of point-of-care (PoC) applications. This study presents UNeSt, a rapid segmentation network tailored for colorectal polyps. The architectural foundation of UNeSt hinges upon the synergistic integration of the depth separable convolutional layer (DSC) and multilayer perceptron (MLP). UNeSt achieves an innovative fusion of these components, resulting in a substantial reduction in model parameters and computational complexity, which is concomitant with a remarkable enhancement in inference speed. Specifically, UNeSt incorporates the convolutional block attention module (CBAM) within the convolutional encoder to extract channel and spatial information proficiently. Furthermore, we introduce an attention mechanism to address the positional information discrepancies introduced in the MLP stage. This comprehensive approach contributes significantly to the augmentation of accuracy in colorectal polyp segmentation. Finally, UNeSt employs skip connections between various levels of encoders and decoders, thereby mitigating information loss problems. In the context of this investigation, UNeSt underwent rigorous evaluation using a demanding polyp segmentation dataset. Relative to UNeXt, a widely employed and exceedingly lightweight network model, the proposed model in this study exhibits a noteworthy reduction with 1.6x fewer parameters, a 2.5x decrease in computational complexity (measured in GFLOPs), and a 1.9x acceleration in inference speed.
期刊介绍:
Biomedical Signal Processing and Control aims to provide a cross-disciplinary international forum for the interchange of information on research in the measurement and analysis of signals and images in clinical medicine and the biological sciences. Emphasis is placed on contributions dealing with the practical, applications-led research on the use of methods and devices in clinical diagnosis, patient monitoring and management.
Biomedical Signal Processing and Control reflects the main areas in which these methods are being used and developed at the interface of both engineering and clinical science. The scope of the journal is defined to include relevant review papers, technical notes, short communications and letters. Tutorial papers and special issues will also be published.