Fs-yolo: fire-smoke detection based on improved YOLOv7

IF 3.5 3区计算机科学 Q2 COMPUTER SCIENCE, INFORMATION SYSTEMS

Multimedia Systems Pub Date : 2024-07-24 DOI:10.1007/s00530-024-01359-z

Dongmei Wang, Ying Qian, Jingyi Lu, Peng Wang, Zhongrui Hu, Yongkang Chai

{"title":"Fs-yolo: fire-smoke detection based on improved YOLOv7","authors":"Dongmei Wang, Ying Qian, Jingyi Lu, Peng Wang, Zhongrui Hu, Yongkang Chai","doi":"10.1007/s00530-024-01359-z","DOIUrl":null,"url":null,"abstract":"Fire has emerged as a major danger to the Earth’s ecological equilibrium and human well-being. Fire detection and alert systems are essential. There is a scarcity of public fire datasets with examples of fire and smoke in real-world situations. Moreover, techniques for recognizing items in fire smoke are imprecise and unreliable when it comes to identifying small objects. We developed a dual dataset to evaluate the model’s ability to handle these difficulties. Introducing FS-YOLO, a new fire detection model with improved accuracy. Training YOLOv7 may lead to overfitting because of the large number of parameters and the limited fire detection object categories. YOLOv7 struggles to recognize small dense objects during feature extraction, resulting in missed detections. The Swin Transformer module has been enhanced to decrease local feature interdependence, obtain a wider range of parameters, and handle features at several levels. The improvements strengthen the model’s robustness and the network’s ability to recognize dense tiny objects. The efficient channel attention was incorporated to reduce the occurrence of false fire detections. Localizing the region of interest and extracting meaningful information aids the model in identifying pertinent areas and minimizing false detections. The proposal also considers using fire-smoke and real-fire-smoke datasets. The latter dataset simulates real-world conditions with occlusions, lens blur, and motion blur. This dataset tests the model’s robustness and adaptability in complex situations. On both datasets, the mAP of FS-YOLO is improved by 6.4\\(\\%\\) and 5.4\\(\\%\\) compared to YOLOv7. In the robustness check experiments, the mAP of FS-YOLO is 4.1\\(\\%\\) and 3.1\\(\\%\\) higher than that of today’s SOTA models YOLOv8s, DINO.","PeriodicalId":51138,"journal":{"name":"Multimedia Systems","volume":"180 1","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2024-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Multimedia Systems","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s00530-024-01359-z","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Fire has emerged as a major danger to the Earth’s ecological equilibrium and human well-being. Fire detection and alert systems are essential. There is a scarcity of public fire datasets with examples of fire and smoke in real-world situations. Moreover, techniques for recognizing items in fire smoke are imprecise and unreliable when it comes to identifying small objects. We developed a dual dataset to evaluate the model’s ability to handle these difficulties. Introducing FS-YOLO, a new fire detection model with improved accuracy. Training YOLOv7 may lead to overfitting because of the large number of parameters and the limited fire detection object categories. YOLOv7 struggles to recognize small dense objects during feature extraction, resulting in missed detections. The Swin Transformer module has been enhanced to decrease local feature interdependence, obtain a wider range of parameters, and handle features at several levels. The improvements strengthen the model’s robustness and the network’s ability to recognize dense tiny objects. The efficient channel attention was incorporated to reduce the occurrence of false fire detections. Localizing the region of interest and extracting meaningful information aids the model in identifying pertinent areas and minimizing false detections. The proposal also considers using fire-smoke and real-fire-smoke datasets. The latter dataset simulates real-world conditions with occlusions, lens blur, and motion blur. This dataset tests the model’s robustness and adaptability in complex situations. On both datasets, the mAP of FS-YOLO is improved by 6.4\(\%\) and 5.4\(\%\) compared to YOLOv7. In the robustness check experiments, the mAP of FS-YOLO is 4.1\(\%\) and 3.1\(\%\) higher than that of today’s SOTA models YOLOv8s, DINO.

Abstract Image

查看原文本刊更多论文

Fs-yolo：基于改进型 YOLOv7 的烟火探测技术

火灾已成为地球生态平衡和人类福祉的一大威胁。火灾探测和警报系统至关重要。在现实世界中，具有火灾和烟雾实例的公共火灾数据集非常稀缺。此外，识别火灾烟雾中物品的技术在识别小物体时并不精确和可靠。我们开发了一个双重数据集来评估模型处理这些困难的能力。引入 FS-YOLO，一种新的火灾检测模型，其准确性有所提高。由于参数数量庞大且火灾探测对象类别有限，训练 YOLOv7 可能会导致过度拟合。在特征提取过程中，YOLOv7 难以识别小而密集的物体，从而导致漏检。我们对 Swin Transformer 模块进行了改进，以降低局部特征的相互依赖性，获得更广泛的参数范围，并处理多个层次的特征。这些改进增强了模型的鲁棒性和网络识别密集微小物体的能力。高效的通道关注被纳入其中，以减少错误火情检测的发生。对感兴趣区域进行定位并提取有意义的信息，有助于模型识别相关区域并将误报率降至最低。该建议还考虑使用烟火数据集和真实烟火数据集。后一种数据集模拟了真实世界中的遮挡、镜头模糊和运动模糊等情况。该数据集测试了模型在复杂情况下的鲁棒性和适应性。在这两个数据集上，与YOLOv7相比，FS-YOLO的mAP分别提高了6.4和5.4。在鲁棒性检查实验中，FS-YOLO的mAP比现在的SOTA模型YOLOv8s、DINO分别高出4.1和3.1。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Multimedia Systems 工程技术-计算机：理论方法

CiteScore

5.40

自引率

7.70%

发文量

148

审稿时长

4.5 months

期刊介绍： This journal details innovative research ideas, emerging technologies, state-of-the-art methods and tools in all aspects of multimedia computing, communication, storage, and applications. It features theoretical, experimental, and survey articles.