Rui Li, Xiaodan Wang, Jian Wang, Yafei Song, Lei Lei
{"title":"SAR Target Recognition Based on Efficient Fully Convolutional Attention Block CNN","authors":"Rui Li, Xiaodan Wang, Jian Wang, Yafei Song, Lei Lei","doi":"10.1109/LGRS.2020.3037256","DOIUrl":null,"url":null,"abstract":"Attention mechanisms have recently shown strong potential in improving the performance of convolutional neural networks (CNNs). This letter proposes a fully convolutional attention block (FCAB) that can be combined with a CNN to refine important features and suppress unnecessary ones in synthetic aperture radar (SAR) images. The FCAB consists of a channel attention module and a spatial attention module. For the channel attention module, we use average-pooling and max-pooling to learn complementary features, and apply group convolution to aggregate the information of the two types of channels. Global average-pooling is then used to encode the channel-wise importance. For the spatial attention module, the average-pooling and max-pooling along the channel axis are used to generate two spatial feature maps, and then two very lightweight convolutional layers are used to encode the spatial weight map. Experimental results on SAR images demonstrate that our FCAB can focus on important channels and object regions. It uses relatively few parameters and is computationally efficient, while bringing about significant performance gain for SAR recognition.","PeriodicalId":13046,"journal":{"name":"IEEE Geoscience and Remote Sensing Letters","volume":"19 1","pages":"1-5"},"PeriodicalIF":4.0000,"publicationDate":"2022-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1109/LGRS.2020.3037256","citationCount":"15","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Geoscience and Remote Sensing Letters","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1109/LGRS.2020.3037256","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 15
Abstract
Attention mechanisms have recently shown strong potential in improving the performance of convolutional neural networks (CNNs). This letter proposes a fully convolutional attention block (FCAB) that can be combined with a CNN to refine important features and suppress unnecessary ones in synthetic aperture radar (SAR) images. The FCAB consists of a channel attention module and a spatial attention module. For the channel attention module, we use average-pooling and max-pooling to learn complementary features, and apply group convolution to aggregate the information of the two types of channels. Global average-pooling is then used to encode the channel-wise importance. For the spatial attention module, the average-pooling and max-pooling along the channel axis are used to generate two spatial feature maps, and then two very lightweight convolutional layers are used to encode the spatial weight map. Experimental results on SAR images demonstrate that our FCAB can focus on important channels and object regions. It uses relatively few parameters and is computationally efficient, while bringing about significant performance gain for SAR recognition.
期刊介绍:
IEEE Geoscience and Remote Sensing Letters (GRSL) is a monthly publication for short papers (maximum length 5 pages) addressing new ideas and formative concepts in remote sensing as well as important new and timely results and concepts. Papers should relate to the theory, concepts and techniques of science and engineering as applied to sensing the earth, oceans, atmosphere, and space, and the processing, interpretation, and dissemination of this information. The technical content of papers must be both new and significant. Experimental data must be complete and include sufficient description of experimental apparatus, methods, and relevant experimental conditions. GRSL encourages the incorporation of "extended objects" or "multimedia" such as animations to enhance the shorter papers.