{"title":"EMS-Net: Efficient Multiscale Perceptual Enhancement Tiny Object Detector for Remote Sensing Images","authors":"Pinwei Chen;Wentao Lyu;Qing Guo;Zhijiang Deng;Weiqiang Xu","doi":"10.1109/LGRS.2025.3565583","DOIUrl":null,"url":null,"abstract":"Detecting tiny objects in remote sensing images has always been a challenging and intensive research area. This problem has not been well-solved due to the fact that object detection (OD) in remote sensing images is characterized by large-scale variations and complex backgrounds. On this basis, we propose the efficient multi-scale semantic-aware network (EMS-Net) constructed based on YOLOv8s for tiny OD network in remote sensing images. First, a new module multibranch context aggregation (MCA) is proposed to improve deep feature extraction and deep feature fusion of the model. In addition, we use our self-designed multiscale feature communication module (MFCM) aimed at reducing the loss of semantic information of object and mitigating the obstruction of foreground object by complex background. Finally, Wise IoU-Normalized Wasserstein distance (WIoU-NWD) is used as the bounding box regression loss to adapt the model to different object scale while improving the ability to localize tiny object. Comprehensive experiments on three popular datasets demonstrate that our method outperforms existing detectors, particularly in detecting tiny objects. Specifically, our approach achieves the mean average precision (mAP) of 77.2% on the DIOR dataset, 96.7% on the Remote Sensing Object Detection (RSOD) dataset, and 75.1% on the DOTA-v1.5 dataset.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10980106/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Detecting tiny objects in remote sensing images has always been a challenging and intensive research area. This problem has not been well-solved due to the fact that object detection (OD) in remote sensing images is characterized by large-scale variations and complex backgrounds. On this basis, we propose the efficient multi-scale semantic-aware network (EMS-Net) constructed based on YOLOv8s for tiny OD network in remote sensing images. First, a new module multibranch context aggregation (MCA) is proposed to improve deep feature extraction and deep feature fusion of the model. In addition, we use our self-designed multiscale feature communication module (MFCM) aimed at reducing the loss of semantic information of object and mitigating the obstruction of foreground object by complex background. Finally, Wise IoU-Normalized Wasserstein distance (WIoU-NWD) is used as the bounding box regression loss to adapt the model to different object scale while improving the ability to localize tiny object. Comprehensive experiments on three popular datasets demonstrate that our method outperforms existing detectors, particularly in detecting tiny objects. Specifically, our approach achieves the mean average precision (mAP) of 77.2% on the DIOR dataset, 96.7% on the Remote Sensing Object Detection (RSOD) dataset, and 75.1% on the DOTA-v1.5 dataset.