Mengxu Lu, Zhenxue Chen, Hao Qin, Yujiao Zhang, Jingjing Ji
{"title":"SSIGAN: Semantic Segmentation via Improved Generative Adversarial Network","authors":"Mengxu Lu, Zhenxue Chen, Hao Qin, Yujiao Zhang, Jingjing Ji","doi":"10.1109/ICSPS58776.2022.00140","DOIUrl":null,"url":null,"abstract":"Nowadays, although conditional convolutional neural networks have applied to semantic segmentation, their loss function needs to be carefully designed. We propose an improved generative adversarial network including a generator network and a discriminator network for semantic segmentation. In some blocks, we substitute 3×1 and 1×3 factorized convolution for 3×3 convolution to make full use of transverse and longitudinal information. We concat the original image with the output of the generator as the input of the discriminator network to improve the discriminant ability. As a result, our model achieves 69.6% mean intersection over union (mIoU) on the Cityscapes test set. Our experiments exhibit that adversarial training approach leads to improved accuracy on the Cityscapes, Camvid, Kitti and Gatech dataset in road scene.","PeriodicalId":330562,"journal":{"name":"2022 14th International Conference on Signal Processing Systems (ICSPS)","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Signal Processing Systems (ICSPS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSPS58776.2022.00140","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, although conditional convolutional neural networks have applied to semantic segmentation, their loss function needs to be carefully designed. We propose an improved generative adversarial network including a generator network and a discriminator network for semantic segmentation. In some blocks, we substitute 3×1 and 1×3 factorized convolution for 3×3 convolution to make full use of transverse and longitudinal information. We concat the original image with the output of the generator as the input of the discriminator network to improve the discriminant ability. As a result, our model achieves 69.6% mean intersection over union (mIoU) on the Cityscapes test set. Our experiments exhibit that adversarial training approach leads to improved accuracy on the Cityscapes, Camvid, Kitti and Gatech dataset in road scene.