YOLO-MIF：利用多信息融合技术改进 YOLOv8，用于灰度图像中的物体检测

IF 8 1区工程技术 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Advanced Engineering Informatics Pub Date : 2024-07-25 DOI:10.1016/j.aei.2024.102709

Dahang Wan, Rongsheng Lu, Bingtao Hu, Jiajie Yin, Siyuan Shen, Ting xu, Xianli Lang

{"title":"YOLO-MIF：利用多信息融合技术改进 YOLOv8，用于灰度图像中的物体检测","authors":"Dahang Wan, Rongsheng Lu, Bingtao Hu, Jiajie Yin, Siyuan Shen, Ting xu, Xianli Lang","doi":"10.1016/j.aei.2024.102709","DOIUrl":null,"url":null,"abstract":"<div><p>Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at <span><span>https://github.com/wandahangFY/YOLO-MIF</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102709"},"PeriodicalIF":8.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-MIF: Improved YOLOv8 with Multi-Information fusion for object detection in Gray-Scale images\",\"authors\":\"Dahang Wan, Rongsheng Lu, Bingtao Hu, Jiajie Yin, Siyuan Shen, Ting xu, Xianli Lang\",\"doi\":\"10.1016/j.aei.2024.102709\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at <span><span>https://github.com/wandahangFY/YOLO-MIF</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102709\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624003574\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624003574","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

摘要

灰度图像的特点是具有单通道灰度信息，它提供了一种简化的表示方法，可降低处理和存储的复杂性和成本。灰度图像广泛应用于医疗成像、工业检测、自动驾驶、遥感成像和安全监控等领域的各种检测任务中。然而，灰度图像面临着一些挑战，如物体辨别能力有限、易受噪声影响，以及由于缺乏颜色信息而导致的亮度不均匀。此外，这些领域的实时性限制也为使用灰度图像进行物体检测带来了额外的障碍。本文通过引入增强型物体检测网络 YOLO-MIF 来应对这些挑战，该网络集成了多种多信息融合策略，以增强 YOLOv8 网络。首先，引入了一种生成伪多信道灰度图像的技术，以增强网络的信道信息，减少潜在的图像噪声和散焦模糊问题。随后，采用了网络结构重参数化技术，在不增加推理时间的情况下提高了网络的检测性能。此外，还引入了一种新型解耦检测头，以增强模型在处理灰度图像时的表现力。我们在两个开源灰度图像检测数据集（NEU-DET 和 FLIR-ADAS）上评估了所提算法的功效。结果表明，在推理速度相近的情况下，所提出的算法在 mAP（平均精度）方面比 YOLOv8 高出 2.1%，比 Faster R-CNN 高出 4.8%，在检测效率和效果之间实现了出色的平衡。该算法的代码将公布在 https://github.com/wandahangFY/YOLO-MIF 网站上。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

YOLO-MIF: Improved YOLOv8 with Multi-Information fusion for object detection in Gray-Scale images

Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at https://github.com/wandahangFY/YOLO-MIF.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Advanced Engineering Informatics 工程技术-工程：综合

CiteScore

12.40

自引率

18.20%

发文量

292

审稿时长

45 days

期刊介绍： Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.