{"title":"Improved Lightweight DeepLabv3+ Algorithm Based on Attention Mechanism","authors":"Lin Wu, J. Xiao, Zhe Zhang","doi":"10.1109/icaci55529.2022.9837577","DOIUrl":null,"url":null,"abstract":"DeepLabv3+ has a wide range of applications in autonomous driving, geographic information systems, etc. However, its deployment on the mobile terminal faces a trade-off between model size and accuracy. Consecutive downsampling operations also result in a great loss of detail information. To solve these problems, this paper proposes an improved algorithm based on DeepLabv3+. Firstly, backbone is replaced by MobileNetv2 to reduce the size of the model; Secondly, the improved Atrous Spatial Pyramid Pooling module is proposed to augment the segmentation result while reducing the parameters. The performance is further ameliorated by applying attention mechanism; Finally, through refining decoder module, the proposed network makes up for lost detail information. Experiment shows that the algorithm achieves an mIoU of 73.31% on the validation set of the PASCAL VOC2012 dataset. Compared with typical algorithms, proposed algorithm has a better effect on trade-off between model size and accuracy.","PeriodicalId":412347,"journal":{"name":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 14th International Conference on Advanced Computational Intelligence (ICACI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/icaci55529.2022.9837577","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
DeepLabv3+ has a wide range of applications in autonomous driving, geographic information systems, etc. However, its deployment on the mobile terminal faces a trade-off between model size and accuracy. Consecutive downsampling operations also result in a great loss of detail information. To solve these problems, this paper proposes an improved algorithm based on DeepLabv3+. Firstly, backbone is replaced by MobileNetv2 to reduce the size of the model; Secondly, the improved Atrous Spatial Pyramid Pooling module is proposed to augment the segmentation result while reducing the parameters. The performance is further ameliorated by applying attention mechanism; Finally, through refining decoder module, the proposed network makes up for lost detail information. Experiment shows that the algorithm achieves an mIoU of 73.31% on the validation set of the PASCAL VOC2012 dataset. Compared with typical algorithms, proposed algorithm has a better effect on trade-off between model size and accuracy.