{"title":"Analysis of Different Encoder-decoder-based Approaches for Biomedical Imaging Segmentation","authors":"Rongsheng Zhang, Rongguo Zhang, Jiechao Ma, Huiling Zhang","doi":"10.1145/3449301.3449320","DOIUrl":null,"url":null,"abstract":"Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now. With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics. We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.","PeriodicalId":429684,"journal":{"name":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","volume":"353 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 6th International Conference on Robotics and Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3449301.3449320","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Recently, CNNs (convolutional neural networks) have been widely used in the field of medical image segmentation. In particular, the encoder-decoder architectures represented by U-Net have achieved state-of-art segmentation effects and inspired many more elaborated networks, which adopt newer and more advanced network designs. To our knowledge, the comprehensive and detailed comparison among these improved versions from a multiplicity of points of view has not been conducted up to now. With U-Net as the baseline, we select the other four typical improvements for U-Net. For higher reliability, we finish the task of segmentation on four datasets and more experiments are performed to test the performance in various conditions. Finally, we evaluate their performance using multiple evaluation metrics. We find that attention U-Net achieves the best segmentation results in terms of F1-score but also owns the most trainable parameters and is most time-consuming. As training images decrease, the original U-Net is most robust even only less than 5 training samples are available. Besides, for any networks, adding auxiliary loss function with small weighting such as 0.01 or 0.01 whatever the cross-entropy loss and the dice-coefficient loss for the other one is beneficial as well.