LS-DETR:用于前视声纳图像中目标检测的轻型变压器

Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo
{"title":"LS-DETR:用于前视声纳图像中目标检测的轻型变压器","authors":"Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo","doi":"10.1109/LGRS.2025.3575615","DOIUrl":null,"url":null,"abstract":"A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.","PeriodicalId":91017,"journal":{"name":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","volume":"22 ","pages":"1-5"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"LS-DETR: Lightweight Transformer for Object Detection in Forward-Looking Sonar Images\",\"authors\":\"Junzhe Wang;Xinke Chen;Anbang Dai;Yan Liu;Guanying Huo\",\"doi\":\"10.1109/LGRS.2025.3575615\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.\",\"PeriodicalId\":91017,\"journal\":{\"name\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"volume\":\"22 \",\"pages\":\"1-5\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-06-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/11020690/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE geoscience and remote sensing letters : a publication of the IEEE Geoscience and Remote Sensing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11020690/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

提出了一种基于变压器的端到端检测方法——轻型声纳检测变压器(LS-DETR),该方法是专门为提高前视声纳图像的检测精度而量身定制的,同时显著降低了计算量。尽管水下环境的复杂性带来了挑战,导致水下设备的检测性能不理想,并且缺乏轻量级优化,但LS-DETR有效地解决了这些问题。在LS-DETR中,主干采用了一种新提出的轻量级门控注意块(LGABlock),通过低复杂度卷积和门控注意减少了计算冗余。设计了一种轻型混合编码器(LHE),以促进尺度-内部特征交互,并优化特征融合方法。在此基础上,提出了WCIoU感知查询选择方法,并将其与解码器中的NWDLoss相结合,使分数能够在关注小目标的同时,整合分类和位置信息。结果表明,在多波束前视声呐数据集UATD上,LS-DETR的精度提高了2.8%,参数数量减少了31.5%,证明了LS-DETR的有效性和优越性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
LS-DETR: Lightweight Transformer for Object Detection in Forward-Looking Sonar Images
A transformer-based end-to-end detection method lightweight sonar detection transformer (LS-DETR) is proposed, which is specifically tailored for enhancing detection accuracy in forward-looking sonar images while significantly reducing the computational load. Despite the challenges posed by the complexity of underwater environments that have led to suboptimal detection performance and the lack of lightweight optimization for underwater devices, LS-DETR addresses these issues effectively. In LS-DETR, the backbone employs a newly proposed lightweight-gated attention block (LGABlock), which reduces computational redundancy through low-complexity convolutions and gated attention. A lightweight hybrid encoder (LHE) is designed to facilitate scale-internal feature interaction and optimize the feature fusion approach. Furthermore, wise complete IoU (WCIoU)-aware query selection is proposed and integrated with NWDLoss in the decoder, enabling the scores to integrate classification and positional information while focusing on the small targets. Results demonstrate that on the multibeam forward-looking sonar dataset UATD, LS-DETR achieved a 2.8% increase in accuracy and a 31.5% reduction in parameter count, proving the effectiveness and superiority.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信