{"title":"A multiscale and cross-level feature fusion method for remote sensing image target detection","authors":"Shan Wenchao, Yang Shuwen, Li Yikun, Kou Ruixiong","doi":"10.1016/j.asr.2025.03.044","DOIUrl":null,"url":null,"abstract":"<div><div>Multiscale feature extraction and multilevel feature fusion are essential in the multiscale target detection of remote sensing images under complex backgrounds. However, mainstream feature extraction networks exhibit limited adaptability in their receptive fields and cannot accurately detect small targets with sparse features and large targets with blurred boundaries. Additionally, the feature fusion networks cannot effectively assess the importance of feature maps when integrating different depths of feature maps, resulting in an inaccurate representation of target area features. For this reason, we proposes a multiscale and cross-level feature fusion method for remote sensing image target detection. First, we construct a Multiscale Feature Extraction Network (MSFENet) to achieve comprehensive extraction of multiscale target features. We design a Scale-Adaptive Feature Extractor (SAFE) that encourages the model to realize multilevel synergistic cooperation within the sensing field, enabling accurate capture and efficient utilization of target feature information across different scales, thereby significantly enhancing the network’s capability to extract multiscale features. Additionally, in the deeper layers of the extraction network, we introduce a Cascading Context-Aware Module (CCAM) to enhance further the network’s ability to acquire deeper semantic richness while mitigating interference from background information. To fully exploit the latent information in feature maps at all levels of the backbone network, this study proposes a novel Cross-Level Feature Fusion Network (CLFFNet). This network incorporates a Sparse Feature Extraction Block (SFEB) to improve feature extraction quality. Through a carefully designed feature-adaptive cross-level interaction mechanism, the SFEB dynamically integrates the rich and critical detailed information with deep semantic information of the backbone network, significantly enhancing the integrity and discriminability of target feature information. The proposed module effectively improves the detection performance of small targets. The experimental results confirm the effectiveness and generalizability of the proposed method. Compared to existing target detection models, the proposed method achieves mAP values of 77.6 % and 90.1 % on the DIOR and HRRSD datasets, significantly outperforming the competitive models.</div></div>","PeriodicalId":50850,"journal":{"name":"Advances in Space Research","volume":"75 11","pages":"Pages 7944-7959"},"PeriodicalIF":2.8000,"publicationDate":"2025-03-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advances in Space Research","FirstCategoryId":"89","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0273117725002820","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ASTRONOMY & ASTROPHYSICS","Score":null,"Total":0}
引用次数: 0
Abstract
Multiscale feature extraction and multilevel feature fusion are essential in the multiscale target detection of remote sensing images under complex backgrounds. However, mainstream feature extraction networks exhibit limited adaptability in their receptive fields and cannot accurately detect small targets with sparse features and large targets with blurred boundaries. Additionally, the feature fusion networks cannot effectively assess the importance of feature maps when integrating different depths of feature maps, resulting in an inaccurate representation of target area features. For this reason, we proposes a multiscale and cross-level feature fusion method for remote sensing image target detection. First, we construct a Multiscale Feature Extraction Network (MSFENet) to achieve comprehensive extraction of multiscale target features. We design a Scale-Adaptive Feature Extractor (SAFE) that encourages the model to realize multilevel synergistic cooperation within the sensing field, enabling accurate capture and efficient utilization of target feature information across different scales, thereby significantly enhancing the network’s capability to extract multiscale features. Additionally, in the deeper layers of the extraction network, we introduce a Cascading Context-Aware Module (CCAM) to enhance further the network’s ability to acquire deeper semantic richness while mitigating interference from background information. To fully exploit the latent information in feature maps at all levels of the backbone network, this study proposes a novel Cross-Level Feature Fusion Network (CLFFNet). This network incorporates a Sparse Feature Extraction Block (SFEB) to improve feature extraction quality. Through a carefully designed feature-adaptive cross-level interaction mechanism, the SFEB dynamically integrates the rich and critical detailed information with deep semantic information of the backbone network, significantly enhancing the integrity and discriminability of target feature information. The proposed module effectively improves the detection performance of small targets. The experimental results confirm the effectiveness and generalizability of the proposed method. Compared to existing target detection models, the proposed method achieves mAP values of 77.6 % and 90.1 % on the DIOR and HRRSD datasets, significantly outperforming the competitive models.
期刊介绍:
The COSPAR publication Advances in Space Research (ASR) is an open journal covering all areas of space research including: space studies of the Earth''s surface, meteorology, climate, the Earth-Moon system, planets and small bodies of the solar system, upper atmospheres, ionospheres and magnetospheres of the Earth and planets including reference atmospheres, space plasmas in the solar system, astrophysics from space, materials sciences in space, fundamental physics in space, space debris, space weather, Earth observations of space phenomena, etc.
NB: Please note that manuscripts related to life sciences as related to space are no more accepted for submission to Advances in Space Research. Such manuscripts should now be submitted to the new COSPAR Journal Life Sciences in Space Research (LSSR).
All submissions are reviewed by two scientists in the field. COSPAR is an interdisciplinary scientific organization concerned with the progress of space research on an international scale. Operating under the rules of ICSU, COSPAR ignores political considerations and considers all questions solely from the scientific viewpoint.