Review of Deep Learning Models for Spine Segmentation

Proceedings of the 2022 International Conference on Multimedia Retrieval Pub Date : 2022-06-27 DOI:10.1145/3512527.3531356

Neng Zhou, Hairu Wen, Yi Wang, Yang Liu, Longfei Zhou

{"title":"Review of Deep Learning Models for Spine Segmentation","authors":"Neng Zhou, Hairu Wen, Yi Wang, Yang Liu, Longfei Zhou","doi":"10.1145/3512527.3531356","DOIUrl":null,"url":null,"abstract":"Medical image segmentation has been a long-standing chal- lenge due to the limitation in labeled datasets and the exis- tence of noise and artifacts. In recent years, deep learning has shown its capability in achieving successive progress in this field, making its automatic segmentation performance gradually catch up with that of manual segmentation. In this paper, we select twelve state-of-the-art models and compare their performance in the spine MRI segmentation task. We divide them into two categories. One of them is the U-Net family, including U-Net, Attention U-Net, ResUNet++, TransUNet, and MiniSeg. The architectures of these models often ultimately include the encoder-decoder structure, and their innovation generally lies in the way of better fusing low-level and high-level information. Models in the other category, named Models Using Backbone often use ResNet, Res2Net, or other pre-trained models on ImageNet as the backbone to extract information. These models pay more attention capturing multi-scale and rich contextual information. All models are trained and tested on the open-source spine M- RI dataset with 20 labels and no pre-training. Through the comparison, the models using backbone exceed U-Net family, and DeepLabv3+ works best. We suppose it is also necessary to extract multi-scale information in a multi-label medical segmentation task.","PeriodicalId":179895,"journal":{"name":"Proceedings of the 2022 International Conference on Multimedia Retrieval","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 International Conference on Multimedia Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3512527.3531356","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Medical image segmentation has been a long-standing chal- lenge due to the limitation in labeled datasets and the exis- tence of noise and artifacts. In recent years, deep learning has shown its capability in achieving successive progress in this field, making its automatic segmentation performance gradually catch up with that of manual segmentation. In this paper, we select twelve state-of-the-art models and compare their performance in the spine MRI segmentation task. We divide them into two categories. One of them is the U-Net family, including U-Net, Attention U-Net, ResUNet++, TransUNet, and MiniSeg. The architectures of these models often ultimately include the encoder-decoder structure, and their innovation generally lies in the way of better fusing low-level and high-level information. Models in the other category, named Models Using Backbone often use ResNet, Res2Net, or other pre-trained models on ImageNet as the backbone to extract information. These models pay more attention capturing multi-scale and rich contextual information. All models are trained and tested on the open-source spine M- RI dataset with 20 labels and no pre-training. Through the comparison, the models using backbone exceed U-Net family, and DeepLabv3+ works best. We suppose it is also necessary to extract multi-scale information in a multi-label medical segmentation task.

查看原文本刊更多论文

脊柱分割的深度学习模型综述

由于标记数据集的限制以及噪声和伪影的存在，医学图像分割一直是一个长期存在的挑战。近年来，深度学习在这一领域取得了长足的进步，其自动分割性能逐渐赶上人工分割。在本文中，我们选择了12个最先进的模型，并比较了它们在脊柱MRI分割任务中的性能。我们把它们分为两类。其中之一是U-Net家族，包括U-Net、Attention U-Net、ResUNet++、TransUNet和MiniSeg。这些模型的体系结构通常最终包括编码器-解码器结构，它们的创新通常在于更好地融合低级和高级信息的方式。另一类模型，称为使用骨干的模型，通常使用ResNet, Res2Net或其他在ImageNet上预训练的模型作为骨干提取信息。这些模型更注重捕获多尺度和丰富的上下文信息。所有模型都在开放源代码的脊柱M- RI数据集上进行训练和测试，该数据集有20个标签，没有预训练。通过对比，采用骨干网的模型优于U-Net系列，其中DeepLabv3+的效果最好。我们认为在多标签医学分割任务中提取多尺度信息也是必要的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 2022 International Conference on Multimedia Retrieval

自引率

0.00%

发文量