{"title":"Underwater Sonar Image Targets Detection Based on Improved RT-DETR","authors":"Ang Li;Raseeda Hamzah;Yousheng Gao","doi":"10.1109/LGRS.2025.3560769","DOIUrl":null,"url":null,"abstract":"Underwater sonar imagery is characterized by small target sizes and low resolution, which can result in detection failures or false positives. To counteract these challenges, we introduce the underwater sonar detection transformer (US-DETR), an underwater sonar object detection model derived from the real-time detection transformer (RT-DETR) framework, incorporating attention-based feature fusion. US-DETR includes a novel enhanced feature interaction (EFI) module, which enhances the feature extraction network’s ability to perceive global information of the detected target. In addition, we propose a novel nonlocal attention feature fusion (NAFF) module to heighten the network’s sensitivity to the spatial relationships between feature channels across different scales, thereby enhancing its channel position and global information awareness. Experiments are conducted on a benchmark underwater sonar image dataset. Experimental results show that compared with RT-DETR, US-DETR achieves a 2.2% higher mean average precision (mAP) and a 2.1% higher <inline-formula> <tex-math>$F1$ </tex-math></inline-formula> score compared with RT-DETR. The model also strikes an effective balance between detection speed and accuracy, achieving real-time performance of 126 FPS, which can meet the real-time requirements in industrial production.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-04-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10965724/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Underwater sonar imagery is characterized by small target sizes and low resolution, which can result in detection failures or false positives. To counteract these challenges, we introduce the underwater sonar detection transformer (US-DETR), an underwater sonar object detection model derived from the real-time detection transformer (RT-DETR) framework, incorporating attention-based feature fusion. US-DETR includes a novel enhanced feature interaction (EFI) module, which enhances the feature extraction network’s ability to perceive global information of the detected target. In addition, we propose a novel nonlocal attention feature fusion (NAFF) module to heighten the network’s sensitivity to the spatial relationships between feature channels across different scales, thereby enhancing its channel position and global information awareness. Experiments are conducted on a benchmark underwater sonar image dataset. Experimental results show that compared with RT-DETR, US-DETR achieves a 2.2% higher mean average precision (mAP) and a 2.1% higher $F1$ score compared with RT-DETR. The model also strikes an effective balance between detection speed and accuracy, achieving real-time performance of 126 FPS, which can meet the real-time requirements in industrial production.