Zetao Fei, Qinghao Guo, Yao Zhang, Yunfeng Hu, Kui Tang
{"title":"Multi-scale Network and Ghost Attention Head for Semantic Segmentation","authors":"Zetao Fei, Qinghao Guo, Yao Zhang, Yunfeng Hu, Kui Tang","doi":"10.1109/ICCET58756.2023.00037","DOIUrl":null,"url":null,"abstract":"In view of the existing problems in semantic segmentation, such as insufficient utilization of information at different scales, excessive computing costs for generating redundant information, and ignoring spatial information. In this end, we propose a multi-scale convolution and ghost attention head. The former is inserted into the encoder, which increases the receptive field of the network and makes the network more effective to obtain context information. At the same time, a Ghost Attention Head is designed in the process of decoding, which adopts cheap operation and separable attention to guide the network, so as to solve the problems such as computational cost and ignoring spatial information. Experimental results on CamVid show that the mIoU of the proposed module reaches 71.32%, which is 9.7% higher than that of the traditional semantic segmentation network FCN. Ablation experiments were also carried out to further verify the effectiveness of the added module.","PeriodicalId":170939,"journal":{"name":"2023 6th International Conference on Communication Engineering and Technology (ICCET)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 6th International Conference on Communication Engineering and Technology (ICCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCET58756.2023.00037","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In view of the existing problems in semantic segmentation, such as insufficient utilization of information at different scales, excessive computing costs for generating redundant information, and ignoring spatial information. In this end, we propose a multi-scale convolution and ghost attention head. The former is inserted into the encoder, which increases the receptive field of the network and makes the network more effective to obtain context information. At the same time, a Ghost Attention Head is designed in the process of decoding, which adopts cheap operation and separable attention to guide the network, so as to solve the problems such as computational cost and ignoring spatial information. Experimental results on CamVid show that the mIoU of the proposed module reaches 71.32%, which is 9.7% higher than that of the traditional semantic segmentation network FCN. Ablation experiments were also carried out to further verify the effectiveness of the added module.