基于生成对抗网络的自关注语义分割模型

2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT) Pub Date : 2022-09-16 DOI:10.1109/AICIT55386.2022.9930225

Hongchang Yang, Jun Zhang

{"title":"基于生成对抗网络的自关注语义分割模型","authors":"Hongchang Yang, Jun Zhang","doi":"10.1109/AICIT55386.2022.9930225","DOIUrl":null,"url":null,"abstract":"To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.","PeriodicalId":231070,"journal":{"name":"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-attentive Semantic Segmentation Model Based On Generative Adversarial Network\",\"authors\":\"Hongchang Yang, Jun Zhang\",\"doi\":\"10.1109/AICIT55386.2022.9930225\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.\",\"PeriodicalId\":231070,\"journal\":{\"name\":\"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICIT55386.2022.9930225\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICIT55386.2022.9930225","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

针对现有识别网络依赖大量标记数据、感知场局限于局部卷积区域以及缺乏对上下文信息理解的问题，提出了一种基于生成对抗网络的自关注语义分割方法。该方法基于生成对抗网络，构建语义分割网络和判别器，其中语义分割网络以Resnet101为骨干连接PSPNet的空间金字塔池模块，并采用交叉注意方法克服经典注意模型参数过多的问题。在公开的PASCAL VOC 2012数据集中对该模型进行了仿真，结果表明，在未改进分割网络的情况下，与对照组相比，在1/8、1/4和1/2带标签情况下，该模型的MIoU分别达到73.1%、74.4%和75.1%，分别提高了3.6%、2.3%和1.3%。改进分割网络后，MIoU值达到75.4%，证明了该模型的优越性和有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Self-attentive Semantic Segmentation Model Based On Generative Adversarial Network

To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)

自引率

0.00%

发文量