{"title":"Semantic Segmentation Based on Deeplabv3+ and Attention Mechanism","authors":"Rongrong Liu, Dongzhi He","doi":"10.1109/IMCEC51613.2021.9482207","DOIUrl":null,"url":null,"abstract":"In this paper, we propose vertical attention and spatial attention network (VSANet), which is a semantic segmentation method based on Deeplabv3+ and attention module, for improving semantic segmentation effect for autonomous driving road scene images. The improvement of this paper is primarily on the following two aspects. One is that this paper introduces the spatial attention module (SAM) after the atrous convolution, which effectively obtains more spatial context information. Second, by studying the road scene image, it is found that there are considerable differences in the pixel-level distribution of the horizontal segmentation area in the image. For this reason, this paper introduces the vertical attention module (VAM), which can better segment the road scene image. A large number of experimental results indicate that the segmentation accuracy of the proposed model is improved by 1.94% compared with the Deeplabv3+ network model on the test dataset of Cityscapes dataset.","PeriodicalId":240400,"journal":{"name":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","volume":"142 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-06-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE 4th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IMCEC51613.2021.9482207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
In this paper, we propose vertical attention and spatial attention network (VSANet), which is a semantic segmentation method based on Deeplabv3+ and attention module, for improving semantic segmentation effect for autonomous driving road scene images. The improvement of this paper is primarily on the following two aspects. One is that this paper introduces the spatial attention module (SAM) after the atrous convolution, which effectively obtains more spatial context information. Second, by studying the road scene image, it is found that there are considerable differences in the pixel-level distribution of the horizontal segmentation area in the image. For this reason, this paper introduces the vertical attention module (VAM), which can better segment the road scene image. A large number of experimental results indicate that the segmentation accuracy of the proposed model is improved by 1.94% compared with the Deeplabv3+ network model on the test dataset of Cityscapes dataset.