DIR-YOLOv5:基于 YOLOv5 的实时无人机透视红外物体探测方法

Yuexing Wang, Chuangang Xu, Jianzhong Su, Jinwen Tian
{"title":"DIR-YOLOv5:基于 YOLOv5 的实时无人机透视红外物体探测方法","authors":"Yuexing Wang, Chuangang Xu, Jianzhong Su, Jinwen Tian","doi":"10.1117/12.2692731","DOIUrl":null,"url":null,"abstract":"With the advancement of drone technology, object detection from the perspective of drones has found extensive applications in various fields, including surveillance, search operations, and reconnaissance tasks. Currently, most drones in the market are equipped with visible light imagers, while some high-end drones are equipped with infrared imaging detectors capable of performing infrared object detection tasks. Infrared imaging utilizes a passive imaging mode, enabling it to detect thermal radiation emitted by objects. As a result, it offers the distinct advantage of continuous operation without being restricted by daylight conditions. In comparison to visible imaging, infrared imaging uses longer wavelengths and possesses a certain level of penetration capability through clouds and smoke. Consequently, infrared object detection represents a significant research area within the field of object detection. However, detecting infrared objects, especially small ones, remains challenging due to the complexity of background information, lower resolution compared to visible images, and the lack of shape and texture information in infrared images. In response to these challenges, this study proposes a real-time drone-perspective infrared (IR) object detection method based on the YOLOv5 framework, known as DIR-YOLOv5. To effectively address the challenge of infrared vehicles occupying fewer pixels in the drone’s perspective image and making objects difficult to detect, the coordinate attention (CA) for feature enhancement is introduced. we also introduce a Spatial-Channel dynamic and query-aware sparse attention mechanism (SCBiFormer), which is optimized based on BiFormer. Additionally, we redefine the loss function as the Repulsion Loss function to tackle the problem of infrared vehicle objects gathering and overlapping occlusion in scenarios like parking lots. Furthermore, we expand the ISVD infrared image object detection dataset to include multiple scenarios and conduct experiments using this dataset. The experimental results demonstrate the excellent performance of the proposed method in infrared image object detection tasks, showing improved object detection accuracy and reduced false detection rate compared to current mainstream methods.","PeriodicalId":298662,"journal":{"name":"Applied Optics and Photonics China","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"DIR-YOLOv5: a real-time drone-perspective infrared object detection method based on YOLOv5\",\"authors\":\"Yuexing Wang, Chuangang Xu, Jianzhong Su, Jinwen Tian\",\"doi\":\"10.1117/12.2692731\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the advancement of drone technology, object detection from the perspective of drones has found extensive applications in various fields, including surveillance, search operations, and reconnaissance tasks. Currently, most drones in the market are equipped with visible light imagers, while some high-end drones are equipped with infrared imaging detectors capable of performing infrared object detection tasks. Infrared imaging utilizes a passive imaging mode, enabling it to detect thermal radiation emitted by objects. As a result, it offers the distinct advantage of continuous operation without being restricted by daylight conditions. In comparison to visible imaging, infrared imaging uses longer wavelengths and possesses a certain level of penetration capability through clouds and smoke. Consequently, infrared object detection represents a significant research area within the field of object detection. However, detecting infrared objects, especially small ones, remains challenging due to the complexity of background information, lower resolution compared to visible images, and the lack of shape and texture information in infrared images. In response to these challenges, this study proposes a real-time drone-perspective infrared (IR) object detection method based on the YOLOv5 framework, known as DIR-YOLOv5. To effectively address the challenge of infrared vehicles occupying fewer pixels in the drone’s perspective image and making objects difficult to detect, the coordinate attention (CA) for feature enhancement is introduced. we also introduce a Spatial-Channel dynamic and query-aware sparse attention mechanism (SCBiFormer), which is optimized based on BiFormer. Additionally, we redefine the loss function as the Repulsion Loss function to tackle the problem of infrared vehicle objects gathering and overlapping occlusion in scenarios like parking lots. Furthermore, we expand the ISVD infrared image object detection dataset to include multiple scenarios and conduct experiments using this dataset. The experimental results demonstrate the excellent performance of the proposed method in infrared image object detection tasks, showing improved object detection accuracy and reduced false detection rate compared to current mainstream methods.\",\"PeriodicalId\":298662,\"journal\":{\"name\":\"Applied Optics and Photonics China\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Optics and Photonics China\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2692731\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Optics and Photonics China","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2692731","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

随着无人机技术的发展,从无人机的角度进行物体探测已广泛应用于各个领域,包括监视、搜索行动和侦察任务。目前,市场上大多数无人机都配备了可见光成像仪,而一些高端无人机则配备了红外成像探测器,能够执行红外物体探测任务。红外成像采用被动成像模式,能够探测物体发出的热辐射。因此,它具有不受日光条件限制,可连续工作的显著优势。与可见光成像相比,红外成像使用的波长更长,具有一定的穿透云层和烟雾的能力。因此,红外物体检测是物体检测领域的一个重要研究领域。然而,由于背景信息复杂、分辨率低于可见光图像以及红外图像中缺乏形状和纹理信息,检测红外物体(尤其是小物体)仍然具有挑战性。针对这些挑战,本研究提出了一种基于 YOLOv5 框架的无人机实时透视红外(IR)物体检测方法,即 DIR-YOLOv5。为了有效解决红外飞行器在无人机透视图像中占据较少像素而导致物体难以检测的难题,我们引入了用于特征增强的坐标注意(CA)。我们还引入了一种基于 BiFormer 优化的空间通道动态和查询感知稀疏注意机制(SCBiFormer)。此外,我们将损失函数重新定义为斥力损失函数,以解决停车场等场景中红外车辆物体聚集和重叠遮挡的问题。此外,我们将 ISVD 红外图像物体检测数据集扩展到多个场景,并使用该数据集进行了实验。实验结果证明了所提出的方法在红外图像物体检测任务中的优异性能,与当前主流方法相比,提高了物体检测精度,降低了误检率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
DIR-YOLOv5: a real-time drone-perspective infrared object detection method based on YOLOv5
With the advancement of drone technology, object detection from the perspective of drones has found extensive applications in various fields, including surveillance, search operations, and reconnaissance tasks. Currently, most drones in the market are equipped with visible light imagers, while some high-end drones are equipped with infrared imaging detectors capable of performing infrared object detection tasks. Infrared imaging utilizes a passive imaging mode, enabling it to detect thermal radiation emitted by objects. As a result, it offers the distinct advantage of continuous operation without being restricted by daylight conditions. In comparison to visible imaging, infrared imaging uses longer wavelengths and possesses a certain level of penetration capability through clouds and smoke. Consequently, infrared object detection represents a significant research area within the field of object detection. However, detecting infrared objects, especially small ones, remains challenging due to the complexity of background information, lower resolution compared to visible images, and the lack of shape and texture information in infrared images. In response to these challenges, this study proposes a real-time drone-perspective infrared (IR) object detection method based on the YOLOv5 framework, known as DIR-YOLOv5. To effectively address the challenge of infrared vehicles occupying fewer pixels in the drone’s perspective image and making objects difficult to detect, the coordinate attention (CA) for feature enhancement is introduced. we also introduce a Spatial-Channel dynamic and query-aware sparse attention mechanism (SCBiFormer), which is optimized based on BiFormer. Additionally, we redefine the loss function as the Repulsion Loss function to tackle the problem of infrared vehicle objects gathering and overlapping occlusion in scenarios like parking lots. Furthermore, we expand the ISVD infrared image object detection dataset to include multiple scenarios and conduct experiments using this dataset. The experimental results demonstrate the excellent performance of the proposed method in infrared image object detection tasks, showing improved object detection accuracy and reduced false detection rate compared to current mainstream methods.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信