{"title":"卷积块注意模块Unet在语义分割任务中的应用研究","authors":"Xiaotong Fang","doi":"10.1145/3577148.3577151","DOIUrl":null,"url":null,"abstract":"Attention mechanism can focus on important features and suppress unnecessary features, to increase the representational power of the network, which plays an important role in visual tasks such as semantic segmentation. To this end, the UNET of Convolutional Block Attention Module (CBAM) is applied in semantic segmentation tasks to improve the performance of convolutional neural networks. The channel attention adopts maximum pooling and average pooling, and for the spatial attention, a smaller convolution kernel is proposed to reduce computation and loss of important features. Through experiments, the introduction of CBAM has a improvement in semantic segmentation tasks increasing the Validation Dice from 0.98 to 0.9871 in the Kaggle Carvana image segmentation dataset.","PeriodicalId":107500,"journal":{"name":"Proceedings of the 2022 5th International Conference on Sensors, Signal and Image Processing","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-10-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Research on the Application of Unet with Convolutional Block Attention Module to Semantic Segmentation Task\",\"authors\":\"Xiaotong Fang\",\"doi\":\"10.1145/3577148.3577151\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Attention mechanism can focus on important features and suppress unnecessary features, to increase the representational power of the network, which plays an important role in visual tasks such as semantic segmentation. To this end, the UNET of Convolutional Block Attention Module (CBAM) is applied in semantic segmentation tasks to improve the performance of convolutional neural networks. The channel attention adopts maximum pooling and average pooling, and for the spatial attention, a smaller convolution kernel is proposed to reduce computation and loss of important features. Through experiments, the introduction of CBAM has a improvement in semantic segmentation tasks increasing the Validation Dice from 0.98 to 0.9871 in the Kaggle Carvana image segmentation dataset.\",\"PeriodicalId\":107500,\"journal\":{\"name\":\"Proceedings of the 2022 5th International Conference on Sensors, Signal and Image Processing\",\"volume\":\"20 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-10-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2022 5th International Conference on Sensors, Signal and Image Processing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3577148.3577151\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2022 5th International Conference on Sensors, Signal and Image Processing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3577148.3577151","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Research on the Application of Unet with Convolutional Block Attention Module to Semantic Segmentation Task
Attention mechanism can focus on important features and suppress unnecessary features, to increase the representational power of the network, which plays an important role in visual tasks such as semantic segmentation. To this end, the UNET of Convolutional Block Attention Module (CBAM) is applied in semantic segmentation tasks to improve the performance of convolutional neural networks. The channel attention adopts maximum pooling and average pooling, and for the spatial attention, a smaller convolution kernel is proposed to reduce computation and loss of important features. Through experiments, the introduction of CBAM has a improvement in semantic segmentation tasks increasing the Validation Dice from 0.98 to 0.9871 in the Kaggle Carvana image segmentation dataset.