A Study on Flame Detection Method Combining Visible Light and Thermal Infrared Multimodal Images

IF 2.4 3区工程技术 Q2 ENGINEERING, MULTIDISCIPLINARY

Fire Technology Pub Date : 2024-11-12 DOI:10.1007/s10694-024-01676-9

Weining Sun, Yuanhao Liu, Feng Wang, Le Hua, Jianzhong Fu, Songyu Hu

{"title":"A Study on Flame Detection Method Combining Visible Light and Thermal Infrared Multimodal Images","authors":"Weining Sun, Yuanhao Liu, Feng Wang, Le Hua, Jianzhong Fu, Songyu Hu","doi":"10.1007/s10694-024-01676-9","DOIUrl":null,"url":null,"abstract":"<div><p>Fire disasters pose a significant threat to human safety. Therefore, timely and effective fire detection is crucial for mitigating these threats. Combining visible light and thermal infrared for multimodal flame detection can fully utilize the visual and temperature distribution information of flames, potentially considerably enhancing the accuracy and robustness of flame detection methods. This approach is a highly promising detection method. However, the visible light and thermal infrared modalities differ fundamentally in imaging principles, pixel resolution, and texture information. Thus, effective fusion of these modalities becomes challenging. To address this issue, a novel flame detection method that integrates visible light and thermal infrared images is introduced. For the visible light modality, an overall model based on Mask R-CNN is designed, with ConvNeXt as the backbone, FPN as the neck, and a cascade structure as the detection head. Then, for the thermal infrared modality, to adapt to its weak semantic and strong texture features, we specifically modified the model’s neck to better extract the underlying texture information of the image using the PAFPN structure. Furthermore, we designed a multimodal fusion algorithm using GIoU to fuse detection information from the visible light and thermal infrared modalities to address the weak alignment of detection targets in imaging principles, pixel resolution, and texture information. Experimental results on both public and self-collected datasets demonstrate that our proposed method outperforms other mainstream target detection networks in flame detection. Moreover, ablation experiments suggest that multimodal fusion significantly improves the overall performance of the algorithm. Specifically, our method achieves an accuracy of 85.33, a recall of 99.33, and an F1 score of 90.03.</p></div>","PeriodicalId":558,"journal":{"name":"Fire Technology","volume":"61 4","pages":"2167 - 2188"},"PeriodicalIF":2.4000,"publicationDate":"2024-11-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Fire Technology","FirstCategoryId":"5","ListUrlMain":"https://link.springer.com/article/10.1007/s10694-024-01676-9","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Fire disasters pose a significant threat to human safety. Therefore, timely and effective fire detection is crucial for mitigating these threats. Combining visible light and thermal infrared for multimodal flame detection can fully utilize the visual and temperature distribution information of flames, potentially considerably enhancing the accuracy and robustness of flame detection methods. This approach is a highly promising detection method. However, the visible light and thermal infrared modalities differ fundamentally in imaging principles, pixel resolution, and texture information. Thus, effective fusion of these modalities becomes challenging. To address this issue, a novel flame detection method that integrates visible light and thermal infrared images is introduced. For the visible light modality, an overall model based on Mask R-CNN is designed, with ConvNeXt as the backbone, FPN as the neck, and a cascade structure as the detection head. Then, for the thermal infrared modality, to adapt to its weak semantic and strong texture features, we specifically modified the model’s neck to better extract the underlying texture information of the image using the PAFPN structure. Furthermore, we designed a multimodal fusion algorithm using GIoU to fuse detection information from the visible light and thermal infrared modalities to address the weak alignment of detection targets in imaging principles, pixel resolution, and texture information. Experimental results on both public and self-collected datasets demonstrate that our proposed method outperforms other mainstream target detection networks in flame detection. Moreover, ablation experiments suggest that multimodal fusion significantly improves the overall performance of the algorithm. Specifically, our method achieves an accuracy of 85.33, a recall of 99.33, and an F1 score of 90.03.

查看原文本刊更多论文

可见光与热红外多模态图像相结合的火焰检测方法研究

火灾对人类安全构成重大威胁。因此，及时有效的火灾探测对于减轻这些威胁至关重要。结合可见光和热红外进行多模态火焰检测，可以充分利用火焰的视觉和温度分布信息，有可能大大提高火焰检测方法的准确性和鲁棒性。这种方法是一种很有前途的检测方法。然而，可见光和热红外模式在成像原理、像素分辨率和纹理信息方面存在根本差异。因此，这些模式的有效融合变得具有挑战性。为了解决这一问题，提出了一种结合可见光和热红外图像的火焰检测方法。对于可见光模态，设计了基于Mask R-CNN的整体模型，以ConvNeXt为骨干，FPN为颈部，级联结构为检测头。然后，对于热红外模态，为了适应其语义弱、纹理强的特点，我们专门对模型颈部进行了修改，利用PAFPN结构更好地提取了图像的底层纹理信息。此外，我们设计了一种基于GIoU的多模态融合算法，将可见光和热红外的检测信息融合在一起，以解决检测目标在成像原理、像素分辨率和纹理信息方面的弱对准问题。在公共和自收集数据集上的实验结果表明，我们提出的方法在火焰检测方面优于其他主流目标检测网络。此外，消融实验表明，多模态融合显著提高了算法的整体性能。具体来说，我们的方法达到了85.33的准确率，99.33的召回率和90.03的F1分数。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Fire Technology 工程技术-材料科学：综合

CiteScore

6.60

自引率

14.70%

发文量

137

审稿时长

7.5 months

期刊介绍： Fire Technology publishes original contributions, both theoretical and empirical, that contribute to the solution of problems in fire safety science and engineering. It is the leading journal in the field, publishing applied research dealing with the full range of actual and potential fire hazards facing humans and the environment. It covers the entire domain of fire safety science and engineering problems relevant in industrial, operational, cultural, and environmental applications, including modeling, testing, detection, suppression, human behavior, wildfires, structures, and risk analysis. The aim of Fire Technology is to push forward the frontiers of knowledge and technology by encouraging interdisciplinary communication of significant technical developments in fire protection and subjects of scientific interest to the fire protection community at large. It is published in conjunction with the National Fire Protection Association (NFPA) and the Society of Fire Protection Engineers (SFPE). The mission of NFPA is to help save lives and reduce loss with information, knowledge, and passion. The mission of SFPE is advancing the science and practice of fire protection engineering internationally.