{"title":"SC-UneXt: Nested UNeXt Architecture based on Medical Image Segmentation","authors":"Lei Wen","doi":"10.54097/fcis.v6i2.07","DOIUrl":null,"url":null,"abstract":"UNet and its various variants are commonly used methods in medical image segmentation tasks; however, many network parameters, complex calculations, and slow usage are problems that need to be overcome. These problems hinder the specific application of fast image segmentation in real-time tasks. At the same time, the lesion area has problems such as small size, irregular shape, and blurred edges, which makes the network feature extraction difficult and the segmentation accuracy needs to be improved. At the same time, medical image segmentation provides a variety of effective methods for the accuracy and robustness of organ segmentation, lesion detection, and classification. Medical images have fixed structures, simple semantics, and diverse details, so integrating rich multi-scale features can improve segmentation accuracy. Given that the density of diseased tissue may be comparable to that of surrounding normal tissue, both global and local information are crucial to segmentation results. To this end, we propose an image segmentation method (SC -UNe X t) based on edge feature extraction and multi-scale feature fusion of convolutional multi-layer perceptron (MLP). The network is a deeply supervised encoder-decoder network, in which the encoder and decoder pass through a series of nested, multiple jump paths to reduce the semantic gap between the feature maps of the encoder and decoder sub-networks.; Multi - scale feature fusion is introduced based on the UNe Finally, we evaluate our model approach on the LIDC dataset public dataset. Experiments have proven the effectiveness of this method. Our model's similarity coefficient and intersection ratio reached 86.44% and 90.86% respectively. Compared with UNet and UNe X t, the network proposed in this article has improved in accuracy, intersection ratio of real values and predicted values, similarity coefficient, and segmentation effect.","PeriodicalId":346823,"journal":{"name":"Frontiers in Computing and Intelligent Systems","volume":"41 5","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2023-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Frontiers in Computing and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.54097/fcis.v6i2.07","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
UNet and its various variants are commonly used methods in medical image segmentation tasks; however, many network parameters, complex calculations, and slow usage are problems that need to be overcome. These problems hinder the specific application of fast image segmentation in real-time tasks. At the same time, the lesion area has problems such as small size, irregular shape, and blurred edges, which makes the network feature extraction difficult and the segmentation accuracy needs to be improved. At the same time, medical image segmentation provides a variety of effective methods for the accuracy and robustness of organ segmentation, lesion detection, and classification. Medical images have fixed structures, simple semantics, and diverse details, so integrating rich multi-scale features can improve segmentation accuracy. Given that the density of diseased tissue may be comparable to that of surrounding normal tissue, both global and local information are crucial to segmentation results. To this end, we propose an image segmentation method (SC -UNe X t) based on edge feature extraction and multi-scale feature fusion of convolutional multi-layer perceptron (MLP). The network is a deeply supervised encoder-decoder network, in which the encoder and decoder pass through a series of nested, multiple jump paths to reduce the semantic gap between the feature maps of the encoder and decoder sub-networks.; Multi - scale feature fusion is introduced based on the UNe Finally, we evaluate our model approach on the LIDC dataset public dataset. Experiments have proven the effectiveness of this method. Our model's similarity coefficient and intersection ratio reached 86.44% and 90.86% respectively. Compared with UNet and UNe X t, the network proposed in this article has improved in accuracy, intersection ratio of real values and predicted values, similarity coefficient, and segmentation effect.
UNet 及其各种变体是医学图像分割任务中常用的方法,但网络参数多、计算复杂、使用速度慢是需要克服的问题。这些问题阻碍了快速图像分割在实时任务中的具体应用。同时,病变区域存在尺寸小、形状不规则、边缘模糊等问题,给网络特征提取带来困难,分割精度有待提高。同时,医学图像分割为器官分割、病变检测和分类的准确性和鲁棒性提供了多种有效方法。医学图像结构固定、语义简单、细节多样,因此整合丰富的多尺度特征可以提高分割精度。鉴于病变组织的密度可能与周围正常组织的密度相当,全局和局部信息对分割结果至关重要。为此,我们提出了一种基于边缘特征提取和卷积多层感知器(MLP)多尺度特征融合的图像分割方法(SC -UNe X t)。该网络是一个深度监督的编码器-解码器网络,其中编码器和解码器通过一系列嵌套的多重跳转路径来减少编码器和解码器子网络的特征图之间的语义差距。 最后,我们在 LIDC 数据集公共数据集上评估了我们的模型方法。实验证明了这种方法的有效性。我们模型的相似系数和交叉率分别达到了 86.44% 和 90.86%。与 UNet 和 UNe X t 相比,本文提出的网络在准确度、真实值与预测值的交集比、相似系数和分割效果方面都有所提高。