基于改进变压器和注意监督融合的水下目标检测

IF 2 4区 计算机科学 Q3 AUTOMATION & CONTROL SYSTEMS
Zhi Li, Chaofeng Li, Tuxin Guan, Shaopeng Shang
{"title":"基于改进变压器和注意监督融合的水下目标检测","authors":"Zhi Li, Chaofeng Li, Tuxin Guan, Shaopeng Shang","doi":"10.5755/j01.itc.52.2.33214","DOIUrl":null,"url":null,"abstract":"Underwater object detection is one of the important technologies for improving the efficiency of underwater inspection, but the existing methods still suffer from the problems of missed detection and insufficient target localization capability of targets. To address these problems, an improved Transformer and multi-scale attentional supervised feature fusion-based underwater object detection method is proposed. In our method, the underwater objects are preprocessed by prior knowledge first. Then, a new coordinate decomposition window-based (CDW) Transformer block is proposed to extract spatial location information more accurately, and scaling factors are introduced to reduce the intermediate computation. Finally, an attentional supervised fusion (ASF) method is proposed to strengthen the link between feature extraction and feature fusion, and further improve the detected performance by using compound attention weights. The cascade detection head is improved, where the information flow is reversed to enhance the prediction of coordinates. The average accuracy of the proposed method on the URPC and DUO datasets is 3.7% and 3.8% higher than that of the baseline network through the cross-test, and outperforms the state-of-the-art methods. This study can provide a reference for engineering applications such as automated marine operations and biodetected fishing techniques.","PeriodicalId":54982,"journal":{"name":"Information Technology and Control","volume":"34 1","pages":"397-415"},"PeriodicalIF":2.0000,"publicationDate":"2023-07-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Underwater Object Detection Based on Improved Transformer and Attentional Supervised Fusion\",\"authors\":\"Zhi Li, Chaofeng Li, Tuxin Guan, Shaopeng Shang\",\"doi\":\"10.5755/j01.itc.52.2.33214\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Underwater object detection is one of the important technologies for improving the efficiency of underwater inspection, but the existing methods still suffer from the problems of missed detection and insufficient target localization capability of targets. To address these problems, an improved Transformer and multi-scale attentional supervised feature fusion-based underwater object detection method is proposed. In our method, the underwater objects are preprocessed by prior knowledge first. Then, a new coordinate decomposition window-based (CDW) Transformer block is proposed to extract spatial location information more accurately, and scaling factors are introduced to reduce the intermediate computation. Finally, an attentional supervised fusion (ASF) method is proposed to strengthen the link between feature extraction and feature fusion, and further improve the detected performance by using compound attention weights. The cascade detection head is improved, where the information flow is reversed to enhance the prediction of coordinates. The average accuracy of the proposed method on the URPC and DUO datasets is 3.7% and 3.8% higher than that of the baseline network through the cross-test, and outperforms the state-of-the-art methods. This study can provide a reference for engineering applications such as automated marine operations and biodetected fishing techniques.\",\"PeriodicalId\":54982,\"journal\":{\"name\":\"Information Technology and Control\",\"volume\":\"34 1\",\"pages\":\"397-415\"},\"PeriodicalIF\":2.0000,\"publicationDate\":\"2023-07-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Information Technology and Control\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://doi.org/10.5755/j01.itc.52.2.33214\",\"RegionNum\":4,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Information Technology and Control","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.5755/j01.itc.52.2.33214","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}
引用次数: 0

摘要

水下目标检测是提高水下检测效率的重要技术之一,但现有方法仍然存在漏检和目标定位能力不足的问题。针对这些问题,提出了一种改进的基于Transformer和多尺度注意监督特征融合的水下目标检测方法。该方法首先利用先验知识对水下目标进行预处理。在此基础上,提出了一种新的基于坐标分解窗口(CDW)的Transformer块来更准确地提取空间位置信息,并引入比例因子来减少中间计算量。最后,提出了一种注意监督融合(attention supervised fusion, ASF)方法,加强特征提取和特征融合之间的联系,并利用复合注意权值进一步提高检测性能。改进了级联检测头,将信息流反向,增强了对坐标的预测。交叉检验表明,该方法在URPC和DUO数据集上的平均准确率分别比基线网络高3.7%和3.8%,优于现有方法。该研究可为海洋自动化作业和生物探测捕鱼技术等工程应用提供参考。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Underwater Object Detection Based on Improved Transformer and Attentional Supervised Fusion
Underwater object detection is one of the important technologies for improving the efficiency of underwater inspection, but the existing methods still suffer from the problems of missed detection and insufficient target localization capability of targets. To address these problems, an improved Transformer and multi-scale attentional supervised feature fusion-based underwater object detection method is proposed. In our method, the underwater objects are preprocessed by prior knowledge first. Then, a new coordinate decomposition window-based (CDW) Transformer block is proposed to extract spatial location information more accurately, and scaling factors are introduced to reduce the intermediate computation. Finally, an attentional supervised fusion (ASF) method is proposed to strengthen the link between feature extraction and feature fusion, and further improve the detected performance by using compound attention weights. The cascade detection head is improved, where the information flow is reversed to enhance the prediction of coordinates. The average accuracy of the proposed method on the URPC and DUO datasets is 3.7% and 3.8% higher than that of the baseline network through the cross-test, and outperforms the state-of-the-art methods. This study can provide a reference for engineering applications such as automated marine operations and biodetected fishing techniques.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Information Technology and Control
Information Technology and Control 工程技术-计算机:人工智能
CiteScore
2.70
自引率
9.10%
发文量
36
审稿时长
12 months
期刊介绍: Periodical journal covers a wide field of computer science and control systems related problems including: -Software and hardware engineering; -Management systems engineering; -Information systems and databases; -Embedded systems; -Physical systems modelling and application; -Computer networks and cloud computing; -Data visualization; -Human-computer interface; -Computer graphics, visual analytics, and multimedia systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信