YOFIR: High precise infrared object detection algorithm based on YOLO and FasterNet

IF 3.1 3区 物理与天体物理 Q2 INSTRUMENTS & INSTRUMENTATION
Mi Wen , ChenYang Li , YunSheng Xue , Man Xu , ZengHui Xi , WeiDong Qiu
{"title":"YOFIR: High precise infrared object detection algorithm based on YOLO and FasterNet","authors":"Mi Wen ,&nbsp;ChenYang Li ,&nbsp;YunSheng Xue ,&nbsp;Man Xu ,&nbsp;ZengHui Xi ,&nbsp;WeiDong Qiu","doi":"10.1016/j.infrared.2024.105627","DOIUrl":null,"url":null,"abstract":"<div><div>Infrared images often suffer from issues such as blurriness and unclear object boundaries, and existing object detection algorithms are developed based on visible light images, which makes infrared object detection more challenging. Therefore, this paper proposes an infrared image enhancement method and an infrared object detection algorithm based on YOLO and FasterNet, named YOFIR. Specifically, we apply CHALE, Auto Gamma, histogram equalization, and bilateral filtering to process images individually, then fuse the results with different weights to address the poor imaging quality of infrared images. Moreover, we utilize the FasterNet network for multi-scale feature extraction to adapt to low-resolution infrared images. We also reduce model parameters through GSConv and propose a novel Efficient Multi-Scale Group Convolution module, EMSGC, which enhances feature fusion by processing feature maps from different channels, effectively improving detection accuracy. Finally, the DyHead Block is incorporated into the head to enhance the capability of infrared object detection. Experimental results on the HIT-UAV infrared remote sensing dataset show that the proposed algorithm achieves a 4% improvement in <span><math><mrow><mi>m</mi><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>0</mn><mo>.</mo><mn>5</mn></mrow></msub></mrow></math></span> compared to YOLOv8. Moreover, on the FLIR dataset, the algorithm shows a 1.6% improvement in <span><math><mrow><mi>m</mi><mi>A</mi><msub><mrow><mi>P</mi></mrow><mrow><mn>0</mn><mo>.</mo><mn>95</mn></mrow></msub></mrow></math></span> over YOLOv8, with significant advantages in terms of model parameters and FLOPs.</div></div>","PeriodicalId":13549,"journal":{"name":"Infrared Physics & Technology","volume":"144 ","pages":"Article 105627"},"PeriodicalIF":3.1000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Infrared Physics & Technology","FirstCategoryId":"101","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1350449524005115","RegionNum":3,"RegionCategory":"物理与天体物理","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"INSTRUMENTS & INSTRUMENTATION","Score":null,"Total":0}
引用次数: 0

Abstract

Infrared images often suffer from issues such as blurriness and unclear object boundaries, and existing object detection algorithms are developed based on visible light images, which makes infrared object detection more challenging. Therefore, this paper proposes an infrared image enhancement method and an infrared object detection algorithm based on YOLO and FasterNet, named YOFIR. Specifically, we apply CHALE, Auto Gamma, histogram equalization, and bilateral filtering to process images individually, then fuse the results with different weights to address the poor imaging quality of infrared images. Moreover, we utilize the FasterNet network for multi-scale feature extraction to adapt to low-resolution infrared images. We also reduce model parameters through GSConv and propose a novel Efficient Multi-Scale Group Convolution module, EMSGC, which enhances feature fusion by processing feature maps from different channels, effectively improving detection accuracy. Finally, the DyHead Block is incorporated into the head to enhance the capability of infrared object detection. Experimental results on the HIT-UAV infrared remote sensing dataset show that the proposed algorithm achieves a 4% improvement in mAP0.5 compared to YOLOv8. Moreover, on the FLIR dataset, the algorithm shows a 1.6% improvement in mAP0.95 over YOLOv8, with significant advantages in terms of model parameters and FLOPs.
YOFIR:基于 YOLO 和 FasterNet 的高精度红外物体检测算法
红外图像通常存在模糊和物体边界不清晰等问题,而现有的物体检测算法都是基于可见光图像开发的,这使得红外物体检测更具挑战性。因此,本文提出了一种基于 YOLO 和 FasterNet 的红外图像增强方法和红外物体检测算法,命名为 YOFIR。具体来说,我们应用 CHALE、Auto Gamma、直方图均衡化和双边滤波对图像进行单独处理,然后将处理结果与不同权重进行融合,以解决红外图像成像质量差的问题。此外,我们还利用 FasterNet 网络进行多尺度特征提取,以适应低分辨率的红外图像。我们还通过 GSConv 减少了模型参数,并提出了新颖的高效多尺度群卷积模块(EMSGC),通过处理不同通道的特征图来增强特征融合,从而有效提高了检测精度。最后,DyHead Block 被集成到云台中,以增强红外物体检测能力。在 HIT-UAV 红外遥感数据集上的实验结果表明,与 YOLOv8 相比,拟议算法的 mAP0.5 提高了 4%。此外,在 FLIR 数据集上,该算法的 mAP0.95 比 YOLOv8 提高了 1.6%,在模型参数和 FLOP 方面具有显著优势。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
5.70
自引率
12.10%
发文量
400
审稿时长
67 days
期刊介绍: The Journal covers the entire field of infrared physics and technology: theory, experiment, application, devices and instrumentation. Infrared'' is defined as covering the near, mid and far infrared (terahertz) regions from 0.75um (750nm) to 1mm (300GHz.) Submissions in the 300GHz to 100GHz region may be accepted at the editors discretion if their content is relevant to shorter wavelengths. Submissions must be primarily concerned with and directly relevant to this spectral region. Its core topics can be summarized as the generation, propagation and detection, of infrared radiation; the associated optics, materials and devices; and its use in all fields of science, industry, engineering and medicine. Infrared techniques occur in many different fields, notably spectroscopy and interferometry; material characterization and processing; atmospheric physics, astronomy and space research. Scientific aspects include lasers, quantum optics, quantum electronics, image processing and semiconductor physics. Some important applications are medical diagnostics and treatment, industrial inspection and environmental monitoring.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信