{"title":"YOLO-MIF:利用多信息融合技术改进 YOLOv8,用于灰度图像中的物体检测","authors":"Dahang Wan, Rongsheng Lu, Bingtao Hu, Jiajie Yin, Siyuan Shen, Ting xu, Xianli Lang","doi":"10.1016/j.aei.2024.102709","DOIUrl":null,"url":null,"abstract":"<div><p>Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at <span><span>https://github.com/wandahangFY/YOLO-MIF</span><svg><path></path></svg></span>.</p></div>","PeriodicalId":50941,"journal":{"name":"Advanced Engineering Informatics","volume":"62 ","pages":"Article 102709"},"PeriodicalIF":8.0000,"publicationDate":"2024-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"YOLO-MIF: Improved YOLOv8 with Multi-Information fusion for object detection in Gray-Scale images\",\"authors\":\"Dahang Wan, Rongsheng Lu, Bingtao Hu, Jiajie Yin, Siyuan Shen, Ting xu, Xianli Lang\",\"doi\":\"10.1016/j.aei.2024.102709\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at <span><span>https://github.com/wandahangFY/YOLO-MIF</span><svg><path></path></svg></span>.</p></div>\",\"PeriodicalId\":50941,\"journal\":{\"name\":\"Advanced Engineering Informatics\",\"volume\":\"62 \",\"pages\":\"Article 102709\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2024-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Advanced Engineering Informatics\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1474034624003574\",\"RegionNum\":1,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Advanced Engineering Informatics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1474034624003574","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
YOLO-MIF: Improved YOLOv8 with Multi-Information fusion for object detection in Gray-Scale images
Grayscale images, characterized by their single-channel grayscale information, offer a simplified representation that reduces the complexity and cost associated with processing and storage. They find widespread application in various detection tasks across domains such as medical imaging, industrial inspection, autonomous driving, remote sensing imaging, and surveillance security. However, grayscale images face challenges such as limited object discrimination, susceptibility to noise, and uneven brightness due to the absence of color information. Moreover, the real-time constraints in these fields present additional hurdles for object detection using grayscale images. This article tackles these challenges by introducing an enhanced object detection network, YOLO-MIF, which integrates diverse multi-information fusion strategies to enhance the YOLOv8 network. Initially, a technique was introduced to generate pseudo-multi-channel grayscale images, enhancing the network’s channel information and reducing potential image noise and defocus blur issues. Subsequently, network structure reparameterization technology was employed to boost the network’s detection performance without increasing the inference time. Additionally, a novel decoupled detection head was introduced to amplify the model’s expressive power when dealing with grayscale images. The efficacy of the proposed algorithm was evaluated on two open-source grayscale image detection datasets (NEU-DET and FLIR-ADAS). Results indicate that, at similar inference speeds, the proposed algorithm outperformed YOLOv8 by 2.1% and Faster R-CNN by 4.8% in terms of mAP(mean Average Precision), achieving a superior balance between detection efficiency and effectiveness. The code for the algorithm will be made available at https://github.com/wandahangFY/YOLO-MIF.
期刊介绍:
Advanced Engineering Informatics is an international Journal that solicits research papers with an emphasis on 'knowledge' and 'engineering applications'. The Journal seeks original papers that report progress in applying methods of engineering informatics. These papers should have engineering relevance and help provide a scientific base for more reliable, spontaneous, and creative engineering decision-making. Additionally, papers should demonstrate the science of supporting knowledge-intensive engineering tasks and validate the generality, power, and scalability of new methods through rigorous evaluation, preferably both qualitatively and quantitatively. Abstracting and indexing for Advanced Engineering Informatics include Science Citation Index Expanded, Scopus and INSPEC.