Pas: a scale-invariant approach to maritime search and rescue object detection using preprocessing and attention scaling

IF 2.3 4区 计算机科学 Q3 ROBOTICS
Shibao Li, Chen Li, Zhaoyu Wang, Zekun Jia, Jinze Zhu, Xuerong Cui, Jianhang Liu
{"title":"Pas: a scale-invariant approach to maritime search and rescue object detection using preprocessing and attention scaling","authors":"Shibao Li, Chen Li, Zhaoyu Wang, Zekun Jia, Jinze Zhu, Xuerong Cui, Jianhang Liu","doi":"10.1007/s11370-024-00526-5","DOIUrl":null,"url":null,"abstract":"<p>Object detection is a primary means of unmanned aerial vehicle (UAV) maritime search and rescue. The problem of scale variation caused by UAV flight height changes, shooting angle changes, and giant waves seriously affects the detection performance. However, most work does not explicitly consider the effects of these factors. In this work, we propose an algorithm called Preprocessing and Attention Scaling, which explicitly considers the scale variation problem caused by height, angle changes, and giant waves for the first time and solves it through Preprocessing Scaling and Attention Scaling. The Preprocessing Scaling module scales and perspective changes the images according to each photograph’s recorded flight altitude and shooting angle and crops them to the appropriate size, significantly improving the detection accuracy and shortening the inference time. At the same time, the scale variation caused by the up and down of the object due to the vast swells cannot be solved by the Preprocessing Scaling module, so we designed the Attention Scaling module again to quickly capture the area that needs further scale change by fusing the horizontal attention and vertical attention, and then transform it to the appropriate scale by the affine transformation, further improving detection accuracy. We extensively tested PAS on the well-known SeaDronesSee-DET and the SeaDronesSee-DET v2 (S-ODv2) datasets, significantly improving the detection accuracy. In addition, we successfully tested our method on a height-angle transfer task, where we trained on some height-angle intervals and tested on different height-angle intervals, achieving good results.</p>","PeriodicalId":48813,"journal":{"name":"Intelligent Service Robotics","volume":"102 1","pages":""},"PeriodicalIF":2.3000,"publicationDate":"2024-03-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Intelligent Service Robotics","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11370-024-00526-5","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"ROBOTICS","Score":null,"Total":0}
引用次数: 0

Abstract

Object detection is a primary means of unmanned aerial vehicle (UAV) maritime search and rescue. The problem of scale variation caused by UAV flight height changes, shooting angle changes, and giant waves seriously affects the detection performance. However, most work does not explicitly consider the effects of these factors. In this work, we propose an algorithm called Preprocessing and Attention Scaling, which explicitly considers the scale variation problem caused by height, angle changes, and giant waves for the first time and solves it through Preprocessing Scaling and Attention Scaling. The Preprocessing Scaling module scales and perspective changes the images according to each photograph’s recorded flight altitude and shooting angle and crops them to the appropriate size, significantly improving the detection accuracy and shortening the inference time. At the same time, the scale variation caused by the up and down of the object due to the vast swells cannot be solved by the Preprocessing Scaling module, so we designed the Attention Scaling module again to quickly capture the area that needs further scale change by fusing the horizontal attention and vertical attention, and then transform it to the appropriate scale by the affine transformation, further improving detection accuracy. We extensively tested PAS on the well-known SeaDronesSee-DET and the SeaDronesSee-DET v2 (S-ODv2) datasets, significantly improving the detection accuracy. In addition, we successfully tested our method on a height-angle transfer task, where we trained on some height-angle intervals and tested on different height-angle intervals, achieving good results.

Abstract Image

Pas:利用预处理和注意力缩放进行海上搜救目标检测的尺度不变方法
物体探测是无人机(UAV)海上搜救的主要手段。无人机飞行高度变化、拍摄角度变化和巨浪造成的尺度变化问题严重影响了探测性能。然而,大多数工作并没有明确考虑这些因素的影响。在这项工作中,我们提出了一种名为 "预处理和注意力缩放 "的算法,首次明确考虑了高度、角度变化和巨浪引起的尺度变化问题,并通过 "预处理缩放 "和 "注意力缩放 "解决了这一问题。预处理缩放模块根据每张照片记录的飞行高度和拍摄角度对图像进行缩放和透视变化,并裁剪成合适的大小,从而显著提高了检测精度,缩短了推理时间。同时,预处理缩放模块无法解决物体因巨大的海浪而上下起伏造成的尺度变化,因此我们又设计了注意力缩放模块,通过融合水平注意力和垂直注意力,快速捕捉到需要进一步改变尺度的区域,然后通过仿射变换将其变换到合适的尺度,进一步提高了检测精度。我们在著名的 SeaDronesSee-DET 和 SeaDronesSee-DET v2(S-ODv2)数据集上对 PAS 进行了广泛测试,显著提高了检测精度。此外,我们还成功地在高度角转移任务中测试了我们的方法,即在某些高度角区间进行训练,在不同高度角区间进行测试,取得了良好的效果。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
4.00%
发文量
46
期刊介绍: The journal directs special attention to the emerging significance of integrating robotics with information technology and cognitive science (such as ubiquitous and adaptive computing,information integration in a distributed environment, and cognitive modelling for human-robot interaction), which spurs innovation toward a new multi-dimensional robotic service to humans. The journal intends to capture and archive this emerging yet significant advancement in the field of intelligent service robotics. The journal will publish original papers of innovative ideas and concepts, new discoveries and improvements, as well as novel applications and business models which are related to the field of intelligent service robotics described above and are proven to be of high quality. The areas that the Journal will cover include, but are not limited to: Intelligent robots serving humans in daily life or in a hazardous environment, such as home or personal service robots, entertainment robots, education robots, medical robots, healthcare and rehabilitation robots, and rescue robots (Service Robotics); Intelligent robotic functions in the form of embedded systems for applications to, for example, intelligent space, intelligent vehicles and transportation systems, intelligent manufacturing systems, and intelligent medical facilities (Embedded Robotics); The integration of robotics with network technologies, generating such services and solutions as distributed robots, distance robotic education-aides, and virtual laboratories or museums (Networked Robotics).
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信