CBAM-YOLOv5用于红外图像目标检测

Security and Defence Quarterly Pub Date : 2022-11-07 DOI:10.1117/12.2640690

Viet Pham Hoang, Huong Ninh, Tran Tien Hai

{"title":"CBAM-YOLOv5用于红外图像目标检测","authors":"Viet Pham Hoang, Huong Ninh, Tran Tien Hai","doi":"10.1117/12.2640690","DOIUrl":null,"url":null,"abstract":"Identifying an object of interest in thermal images plays a vital role in several military and civilian applications. The deep learning approach has shown its superiority in object detection in various RGB datasets. However, regarding to thermal images, their low resolution and shortage of detail properties impose a huge challenge that hinders the accuracy. In this paper, we propose an improved version of YOLOv5 model to tackle this problem. Convolution Block Attention Module (CBAM) is integrated into traditional YOLOv5 for better representation of objects by focusing on important features and neglecting unnecessary ones. The Selective Kernel Network(SENet) is added to maximize the shallow features usage. Furthermore, the multiscale detection mechanism is utilized to improve small object detection accuracy. We train our model on the mixed visible-thermal images collected from LSOTB-TIR, LLVIP, and COCO datasets. We evaluate the performance of our method on 8 classes of objects: person, bicycle, airplane, helicopter, car, motorbike, boat, and tank. Experiment results show that our approach can achieve mAP up to 90.2%, which outperforms the original YOLOv5 and other popular methods.","PeriodicalId":52940,"journal":{"name":"Security and Defence Quarterly","volume":"13 1","pages":"122760E - 122760E-12"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"CBAM-YOLOv5 for infrared image object detection\",\"authors\":\"Viet Pham Hoang, Huong Ninh, Tran Tien Hai\",\"doi\":\"10.1117/12.2640690\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Identifying an object of interest in thermal images plays a vital role in several military and civilian applications. The deep learning approach has shown its superiority in object detection in various RGB datasets. However, regarding to thermal images, their low resolution and shortage of detail properties impose a huge challenge that hinders the accuracy. In this paper, we propose an improved version of YOLOv5 model to tackle this problem. Convolution Block Attention Module (CBAM) is integrated into traditional YOLOv5 for better representation of objects by focusing on important features and neglecting unnecessary ones. The Selective Kernel Network(SENet) is added to maximize the shallow features usage. Furthermore, the multiscale detection mechanism is utilized to improve small object detection accuracy. We train our model on the mixed visible-thermal images collected from LSOTB-TIR, LLVIP, and COCO datasets. We evaluate the performance of our method on 8 classes of objects: person, bicycle, airplane, helicopter, car, motorbike, boat, and tank. Experiment results show that our approach can achieve mAP up to 90.2%, which outperforms the original YOLOv5 and other popular methods.\",\"PeriodicalId\":52940,\"journal\":{\"name\":\"Security and Defence Quarterly\",\"volume\":\"13 1\",\"pages\":\"122760E - 122760E-12\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-11-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Security and Defence Quarterly\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1117/12.2640690\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Security and Defence Quarterly","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1117/12.2640690","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

摘要

在热图像中识别感兴趣的目标在一些军事和民用应用中起着至关重要的作用。深度学习方法在各种RGB数据集的目标检测中显示出其优越性。然而，对于热图像来说，其低分辨率和缺乏细节特性给精度带来了巨大的挑战。在本文中，我们提出了一个改进版本的YOLOv5模型来解决这个问题。卷积块注意模块(CBAM)被集成到传统的YOLOv5中，通过关注重要的特征而忽略不必要的特征来更好地表征对象。加入选择性内核网络(SENet)来最大化浅层特征的使用。此外，利用多尺度检测机制提高了小目标的检测精度。我们在LSOTB-TIR, LLVIP和COCO数据集收集的混合可见光-热图像上训练我们的模型。我们评估了我们的方法在8类对象上的性能:人、自行车、飞机、直升机、汽车、摩托车、船和坦克。实验结果表明，我们的方法可以实现高达90.2%的mAP，优于原来的YOLOv5和其他流行的方法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

CBAM-YOLOv5 for infrared image object detection

Identifying an object of interest in thermal images plays a vital role in several military and civilian applications. The deep learning approach has shown its superiority in object detection in various RGB datasets. However, regarding to thermal images, their low resolution and shortage of detail properties impose a huge challenge that hinders the accuracy. In this paper, we propose an improved version of YOLOv5 model to tackle this problem. Convolution Block Attention Module (CBAM) is integrated into traditional YOLOv5 for better representation of objects by focusing on important features and neglecting unnecessary ones. The Selective Kernel Network(SENet) is added to maximize the shallow features usage. Furthermore, the multiscale detection mechanism is utilized to improve small object detection accuracy. We train our model on the mixed visible-thermal images collected from LSOTB-TIR, LLVIP, and COCO datasets. We evaluate the performance of our method on 8 classes of objects: person, bicycle, airplane, helicopter, car, motorbike, boat, and tank. Experiment results show that our approach can achieve mAP up to 90.2%, which outperforms the original YOLOv5 and other popular methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Security and Defence Quarterly

自引率

0.00%

发文量

审稿时长

9 weeks