基于编码器-解码器的生物医学图像分割方法分析

Proceedings of the 6th International Conference on Robotics and Artificial Intelligence Pub Date : 2020-11-20 DOI:10.1145/3449301.3449320

Rongsheng Zhang, Rongguo Zhang, Jiechao Ma, Huiling Zhang

{"title":"基于编码器-解码器的生物医学图像分割方法分析","authors":"Rongsheng Zhang, Rongguo Zhang, Jiechao Ma, Huiling Zhang","doi":"10.1145/3449301.3449320","DOIUrl":null,"url":null,"abstract":"Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now. With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics. We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.","PeriodicalId":429684,"journal":{"name":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","volume":"353 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation\",\"authors\":\"Rongsheng Zhang, Rongguo Zhang, Jiechao Ma, Huiling Zhang\",\"doi\":\"10.1145/3449301.3449320\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now. With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics. We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.\",\"PeriodicalId\":429684,\"journal\":{\"name\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"volume\":\"353 \",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3449301.3449320\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3449301.3449320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

近年来，卷积神经网络在医学图像分割领域得到了广泛的应用。特别是，以U-Net为代表的编码器-解码器架构实现了最先进的分割效果，并激发了许多更复杂的网络，这些网络采用了更新和更先进的网络设计。据我们所知，到目前为止，还没有人从多个角度对这些改进版本进行全面而详细的比较。以U-Net为基准，我们选择U-Net的其他四个典型改进。为了提高可靠性，我们在四个数据集上完成了分割任务，并进行了更多的实验来测试不同条件下的性能。最后，我们使用多个评估指标来评估它们的性能。我们发现注意力U-Net在F1-score方面取得了最好的分割效果，但同时也拥有最多的可训练参数，并且耗时最长。随着训练图像的减少，即使只有不到5个训练样本可用，原始U-Net也是最鲁棒的。此外，对于任何网络，在交叉熵损失和骰子系数损失的情况下，添加小权重的辅助损失函数(如0.01或0.01)也是有益的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation

Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now. With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics. We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 6th International Conference on Robotics and Artificial Intelligence

自引率

0.00%

发文量