基于生成对抗网络的自关注语义分割模型

Hongchang Yang, Jun Zhang
{"title":"基于生成对抗网络的自关注语义分割模型","authors":"Hongchang Yang, Jun Zhang","doi":"10.1109/AICIT55386.2022.9930225","DOIUrl":null,"url":null,"abstract":"To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.","PeriodicalId":231070,"journal":{"name":"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Self-attentive Semantic Segmentation Model Based On Generative Adversarial Network\",\"authors\":\"Hongchang Yang, Jun Zhang\",\"doi\":\"10.1109/AICIT55386.2022.9930225\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.\",\"PeriodicalId\":231070,\"journal\":{\"name\":\"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AICIT55386.2022.9930225\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Artificial Intelligence and Computer Information Technology (AICIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AICIT55386.2022.9930225","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

针对现有识别网络依赖大量标记数据、感知场局限于局部卷积区域以及缺乏对上下文信息理解的问题,提出了一种基于生成对抗网络的自关注语义分割方法。该方法基于生成对抗网络,构建语义分割网络和判别器,其中语义分割网络以Resnet101为骨干连接PSPNet的空间金字塔池模块,并采用交叉注意方法克服经典注意模型参数过多的问题。在公开的PASCAL VOC 2012数据集中对该模型进行了仿真,结果表明,在未改进分割网络的情况下,与对照组相比,在1/8、1/4和1/2带标签情况下,该模型的MIoU分别达到73.1%、74.4%和75.1%,分别提高了3.6%、2.3%和1.3%。改进分割网络后,MIoU值达到75.4%,证明了该模型的优越性和有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Self-attentive Semantic Segmentation Model Based On Generative Adversarial Network
To address the problems that existing recognition networks rely on a large amount of labeled data and the perceptual field is limited to the local area of convolution, and the lack of understanding of contextual information, this paper proposes a self-attentive semantic segmentation method based on generative adversarial networks. The method is based on generating adversarial networks, constructing semantic segmentation networks and discriminators, where the semantic segmentation network uses Resnet101 as the backbone to connect the spatial pyramid pooling module of PSPNet and adopts the cross-attention method in order to overcome the problem of too many parameters of classical attention models. The model was simulated in the publicly available PASCAL VOC 2012 dataset, and the results showed that the MIoU of the model reached 73.1%, 74.4%, and 75.1% for 1/8, 1/4, and 1/2 with labels, respectively, in the first semi-supervised experiments without improving the segmentation network compared to the control group, which were 3.6%, 2.3%, and 1.3% higher, respectively. The MIoU value reached 75.4% after improving the segmentation network, which proved the superiority and effectiveness of this model.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信