{"title":"基于自适应局部上下文的动态卷积遥感目标检测","authors":"Ruyi Feng;Zhixin Zhao;Tao Zhao;Lizhe Wang","doi":"10.1109/LGRS.2025.3602896","DOIUrl":null,"url":null,"abstract":"Remote sensing image target detection plays a pivotal role in Earth observation, offering substantial value for applications such as urban planning and environmental monitoring. Due to the significant scale variations among targets, complex backgrounds with dense small object distributions, and strong intertarget scene correlations, existing target detection methods usually fail to effectively model target relationships and contextual information for remote sensing imagery. To address these limitations, we proposed YOLO-ALS, a novel remote sensing target detection network that integrates adaptive local scene context. The proposed framework introduces three key points. First, a full-dimensional dynamic convolution reconstruction C2f module enhances target feature representation by overcoming local context extraction limitations and target co-occurrence prior deficiencies. Second, an adaptive local scene context module (ALSCM) dynamically integrates multiscale receptive field features through spatial attention, enabling background window adaptive selection and cross-scale feature alignment. Finally, a co-occurrence matrix-integrated classification auxiliary module mines target association rules through data-driven learning, correcting classification probabilities in low-confidence areas by combining high-confidence areas’ co-occurrence information with an optimal threshold, which can significantly reduce missed detection rates. Comprehensive experiments on multiple public remote sensing datasets demonstrate the superiority of the proposed method through extensive ablation studies and comparative analyses. The proposed method has achieved state-of-the-art performance while addressing the unique challenges of remote sensing target detection.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":4.4000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-ALS: Dynamic Convolution With Adaptive Local Context for Remote Sensing Target Detection\",\"authors\":\"Ruyi Feng;Zhixin Zhao;Tao Zhao;Lizhe Wang\",\"doi\":\"10.1109/LGRS.2025.3602896\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Remote sensing image target detection plays a pivotal role in Earth observation, offering substantial value for applications such as urban planning and environmental monitoring. Due to the significant scale variations among targets, complex backgrounds with dense small object distributions, and strong intertarget scene correlations, existing target detection methods usually fail to effectively model target relationships and contextual information for remote sensing imagery. To address these limitations, we proposed YOLO-ALS, a novel remote sensing target detection network that integrates adaptive local scene context. The proposed framework introduces three key points. First, a full-dimensional dynamic convolution reconstruction C2f module enhances target feature representation by overcoming local context extraction limitations and target co-occurrence prior deficiencies. Second, an adaptive local scene context module (ALSCM) dynamically integrates multiscale receptive field features through spatial attention, enabling background window adaptive selection and cross-scale feature alignment. Finally, a co-occurrence matrix-integrated classification auxiliary module mines target association rules through data-driven learning, correcting classification probabilities in low-confidence areas by combining high-confidence areas’ co-occurrence information with an optimal threshold, which can significantly reduce missed detection rates. Comprehensive experiments on multiple public remote sensing datasets demonstrate the superiority of the proposed method through extensive ablation studies and comparative analyses. The proposed method has achieved state-of-the-art performance while addressing the unique challenges of remote sensing target detection.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":4.4000,\"publicationDate\":\"2025-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11141785/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11141785/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
YOLO-ALS: Dynamic Convolution With Adaptive Local Context for Remote Sensing Target Detection
Remote sensing image target detection plays a pivotal role in Earth observation, offering substantial value for applications such as urban planning and environmental monitoring. Due to the significant scale variations among targets, complex backgrounds with dense small object distributions, and strong intertarget scene correlations, existing target detection methods usually fail to effectively model target relationships and contextual information for remote sensing imagery. To address these limitations, we proposed YOLO-ALS, a novel remote sensing target detection network that integrates adaptive local scene context. The proposed framework introduces three key points. First, a full-dimensional dynamic convolution reconstruction C2f module enhances target feature representation by overcoming local context extraction limitations and target co-occurrence prior deficiencies. Second, an adaptive local scene context module (ALSCM) dynamically integrates multiscale receptive field features through spatial attention, enabling background window adaptive selection and cross-scale feature alignment. Finally, a co-occurrence matrix-integrated classification auxiliary module mines target association rules through data-driven learning, correcting classification probabilities in low-confidence areas by combining high-confidence areas’ co-occurrence information with an optimal threshold, which can significantly reduce missed detection rates. Comprehensive experiments on multiple public remote sensing datasets demonstrate the superiority of the proposed method through extensive ablation studies and comparative analyses. The proposed method has achieved state-of-the-art performance while addressing the unique challenges of remote sensing target detection.