{"title":"CABDet:基于上下文和注意力的探测器,用于遥感图像中的小物体探测","authors":"Mingzhi Zhang, Xiaohai He, Qizhi Teng, Tong Niu, Honggang Chen","doi":"10.1117/1.JRS.17.044515","DOIUrl":null,"url":null,"abstract":"Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.","PeriodicalId":54879,"journal":{"name":"Journal of Applied Remote Sensing","volume":"168-169 1","pages":"044515 - 044515"},"PeriodicalIF":1.4000,"publicationDate":"2023-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CABDet: context-and-attention-based detector for small object detection in remote sensing images\",\"authors\":\"Mingzhi Zhang, Xiaohai He, Qizhi Teng, Tong Niu, Honggang Chen\",\"doi\":\"10.1117/1.JRS.17.044515\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.\",\"PeriodicalId\":54879,\"journal\":{\"name\":\"Journal of Applied Remote Sensing\",\"volume\":\"168-169 1\",\"pages\":\"044515 - 044515\"},\"PeriodicalIF\":1.4000,\"publicationDate\":\"2023-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Applied Remote Sensing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1117/1.JRS.17.044515\",\"RegionNum\":4,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"ENVIRONMENTAL SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Applied Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1117/1.JRS.17.044515","RegionNum":4,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
CABDet: context-and-attention-based detector for small object detection in remote sensing images
Abstract. Detecting small objects in remote sensing images is a challenging task. Existing object detectors for remote sensing images suffer from two issues: (1) insufficient feature extraction for small objects in the backbone network and (2) feature misalignment and information loss for small objects in the neck network, leading to poor detection performance on small objects. To address these challenges, a small object detector named CABDet for remote sensing images that combines context and attention mechanisms is proposed. Specifically, an enhanced ResNet50 is designed as a novel backbone network that adaptively adjusts the size of receptive fields to fully extract feature information of small objects. Additionally, an adaptive multiscale feature pyramid network (AM-FPN) is proposed. To alleviate the problem of feature misalignment for small objects, AM-FPN leverages self-attention mechanisms to establish semantic and spatial dependencies between adjacent feature layers. Then to mitigate the issue of information loss for small objects, AM-FPN captures semantic dependencies between subregions of current layer features through self-attention mechanisms to preserve channel information. Extensive experiments were conducted on two demanding remote sensing datasets, namely dataset for object detection in aerial images and UCAS-high resolution aerial object detection dataset, to demonstrate the effectiveness of the proposed methodology in achieving superior detection performance when compared with contemporary state-of-the-art approaches.
期刊介绍:
The Journal of Applied Remote Sensing is a peer-reviewed journal that optimizes the communication of concepts, information, and progress among the remote sensing community.