Semi-Supervised Semantic Segmentation of Class-Imbalanced Images: A Hierarchical Self-Attention Generative Adversarial Network

2022 7th International Conference on Image, Vision and Computing (ICIVC) Pub Date : 2022-07-26 DOI:10.1109/ICIVC55077.2022.9886496

Lu Chai, Qinyuan Liu

{"title":"Semi-Supervised Semantic Segmentation of Class-Imbalanced Images: A Hierarchical Self-Attention Generative Adversarial Network","authors":"Lu Chai, Qinyuan Liu","doi":"10.1109/ICIVC55077.2022.9886496","DOIUrl":null,"url":null,"abstract":"How to train models with unlabeled data and implement one trained model across several data sets are key problems in computer vision applications that require high-cost annotations. Recently, a generative model [1] proves its advantages in semi-supervised segmentation and out-of-domain generalization. However, this method becomes less effective when meet with class-imbalanced images whose foreground occupies small areas. To solve this problem, we introduce a hierarchical generative model with a self-attention mechanism to help with capturing features of foreground objects. Concretely, we apply a two-stage hierarchical generative model to perform image synthesis with the self-attention mechanism. Since attention maps are also semantic labels in segmentation fields, the hierarchical self-attention model can synthesize images and corresponding segmentation labels simultaneously. At test time, the segmentation is achieved by mapping input images into latent presentations with two encoders and synthesizing labels with the generative model. We evaluate our hierarchical model on three biomedical segmentation data sets. The experimental results demonstrate that our method outperforms other baselines on semi-supervised segmentation of class-imbalanced images, and meanwhile, pre-serves out-of-domain generalization ability.","PeriodicalId":227073,"journal":{"name":"2022 7th International Conference on Image, Vision and Computing (ICIVC)","volume":"50 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 7th International Conference on Image, Vision and Computing (ICIVC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICIVC55077.2022.9886496","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

How to train models with unlabeled data and implement one trained model across several data sets are key problems in computer vision applications that require high-cost annotations. Recently, a generative model [1] proves its advantages in semi-supervised segmentation and out-of-domain generalization. However, this method becomes less effective when meet with class-imbalanced images whose foreground occupies small areas. To solve this problem, we introduce a hierarchical generative model with a self-attention mechanism to help with capturing features of foreground objects. Concretely, we apply a two-stage hierarchical generative model to perform image synthesis with the self-attention mechanism. Since attention maps are also semantic labels in segmentation fields, the hierarchical self-attention model can synthesize images and corresponding segmentation labels simultaneously. At test time, the segmentation is achieved by mapping input images into latent presentations with two encoders and synthesizing labels with the generative model. We evaluate our hierarchical model on three biomedical segmentation data sets. The experimental results demonstrate that our method outperforms other baselines on semi-supervised segmentation of class-imbalanced images, and meanwhile, pre-serves out-of-domain generalization ability.

查看原文本刊更多论文

类不平衡图像的半监督语义分割:一种层次自注意生成对抗网络

如何使用未标记的数据训练模型，并在多个数据集上实现一个训练好的模型，是需要高成本注释的计算机视觉应用中的关键问题。最近，一种生成模型[1]证明了它在半监督分割和域外泛化方面的优势。但是，当遇到前景面积小、类不平衡的图像时，这种方法的效果就不太好了。为了解决这个问题，我们引入了一个具有自关注机制的分层生成模型来帮助捕获前景对象的特征。具体而言，我们采用了一种两阶段的分层生成模型，通过自注意机制进行图像合成。由于注意图也是切分领域的语义标签，分层自注意模型可以同时合成图像和相应的切分标签。在测试时，通过使用两个编码器将输入图像映射为潜在表示并使用生成模型合成标签来实现分割。我们在三个生物医学分割数据集上评估了我们的层次模型。实验结果表明，该方法在类不平衡图像的半监督分割上优于其他基线，同时保留了域外泛化能力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 7th International Conference on Image, Vision and Computing (ICIVC)

自引率

0.00%

发文量