Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo
{"title":"LS-DETR:用于前视声纳图像中目标检测的轻型变压器","authors":"Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo","doi":"10.1109/LGRS.2025.3575615","DOIUrl":null,"url":null,"abstract":"A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LS-DETR: Lightweight Transformer for Object Detection in Forward-Looking Sonar Images\",\"authors\":\"Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo\",\"doi\":\"10.1109/LGRS.2025.3575615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11020690/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11020690/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
LS-DETR: Lightweight Transformer for Object Detection in Forward-Looking Sonar Images
A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.