EGNL-FAT: An Edge-Guided Non-Local network with Frequency-Aware transformer for smoke segmentation

IF 7.5 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yitong Fu , Haiyan Li , Yujiao Wang , Wenbing Lv , Bingbing He , Pengfei Yu
{"title":"EGNL-FAT: An Edge-Guided Non-Local network with Frequency-Aware transformer for smoke segmentation","authors":"Yitong Fu ,&nbsp;Haiyan Li ,&nbsp;Yujiao Wang ,&nbsp;Wenbing Lv ,&nbsp;Bingbing He ,&nbsp;Pengfei Yu","doi":"10.1016/j.eswa.2025.127621","DOIUrl":null,"url":null,"abstract":"<div><div>Smoke semantic segmentation presents enormous challenges compared to other objects due to the special characteristics of smoke, such as its non-rigid structure, translucency, and variable modes. Therefore, we design an Edge-Guided Non-Local Network with Frequency-Aware Transformer (EGNL-FAT) for smoke segmentation in real-world scenarios. Unlike traditional smoke segmentation methods, this paper first considers the different feature extraction capabilities of CNNs and transformers due to their varying receptive fields. By constructing a non-local network, the model’s ability to establish long-range dependencies is enhanced. Secondly, edge information is utilized to alleviate the interference caused by intraclass inconsistency and interclass similarity during segmentation. To address the scarcity of real-scene datasets, we introduce the Forest-Scene Smoke Segmentation (FSS) dataset, which contains diverse smoke types and complex backgrounds. We also propose a frequency-aware transformer combining Fast Fourier Transform (FFT) and frequency domain decoupling to reduce computational complexity. A Cross-Domain Fusion Module (CDFM) is introduced for collaborative learning of information from various sources using weighted fusion and Coordinate Attention (CA). Additionally, we develop an Edge Feature Extraction Module (EFEM) that efficiently extracts detailed edge information at full resolution using a shallow structure. Finally, a Multi-Directional Cross Attention (MDCA) mechanism is proposed to compute similarities between edge and decoder feature maps, guiding accurate segmentation. Experimental results show that EGNL-FAT achieves mean Intersection over Unions (mIoUs) of 59.66 % (FS01) and 64.68 % (FS02) on FSS, and 79.04 % on SMOKE5K. It demonstrates excellent performance while maintaining satisfactory model complexity and processing efficiency. Our code and dataset are publicly available at: <span><span>https://github.com/yitccc/smoke-segmentation/tree/master</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50461,"journal":{"name":"Expert Systems with Applications","volume":"280 ","pages":"Article 127621"},"PeriodicalIF":7.5000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Expert Systems with Applications","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0957417425012436","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Smoke semantic segmentation presents enormous challenges compared to other objects due to the special characteristics of smoke, such as its non-rigid structure, translucency, and variable modes. Therefore, we design an Edge-Guided Non-Local Network with Frequency-Aware Transformer (EGNL-FAT) for smoke segmentation in real-world scenarios. Unlike traditional smoke segmentation methods, this paper first considers the different feature extraction capabilities of CNNs and transformers due to their varying receptive fields. By constructing a non-local network, the model’s ability to establish long-range dependencies is enhanced. Secondly, edge information is utilized to alleviate the interference caused by intraclass inconsistency and interclass similarity during segmentation. To address the scarcity of real-scene datasets, we introduce the Forest-Scene Smoke Segmentation (FSS) dataset, which contains diverse smoke types and complex backgrounds. We also propose a frequency-aware transformer combining Fast Fourier Transform (FFT) and frequency domain decoupling to reduce computational complexity. A Cross-Domain Fusion Module (CDFM) is introduced for collaborative learning of information from various sources using weighted fusion and Coordinate Attention (CA). Additionally, we develop an Edge Feature Extraction Module (EFEM) that efficiently extracts detailed edge information at full resolution using a shallow structure. Finally, a Multi-Directional Cross Attention (MDCA) mechanism is proposed to compute similarities between edge and decoder feature maps, guiding accurate segmentation. Experimental results show that EGNL-FAT achieves mean Intersection over Unions (mIoUs) of 59.66 % (FS01) and 64.68 % (FS02) on FSS, and 79.04 % on SMOKE5K. It demonstrates excellent performance while maintaining satisfactory model complexity and processing efficiency. Our code and dataset are publicly available at: https://github.com/yitccc/smoke-segmentation/tree/master.
EGNL-FAT:一种带频率感知变压器的边缘引导非局部网络烟雾分割方法
由于烟雾具有非刚性结构、半透明和模式可变等特点,因此与其他物体相比,烟雾语义分割面临着巨大的挑战。因此,我们设计了一种带有频率感知变压器(EGNL-FAT)的边缘引导非局部网络,用于现实场景中的烟雾分割。与传统的烟雾分割方法不同,本文首先考虑了cnn和变压器由于接收域的不同而具有不同的特征提取能力。通过构建非局部网络,增强了模型建立远程依赖关系的能力。其次,利用边缘信息缓解分割过程中类内不一致和类间相似造成的干扰;为了解决真实场景数据集的稀缺性,我们引入了森林场景烟雾分割(Forest-Scene Smoke Segmentation, FSS)数据集,该数据集包含多种烟雾类型和复杂背景。我们还提出了一种结合快速傅里叶变换(FFT)和频域解耦的频率感知变压器,以降低计算复杂度。提出了一种跨域融合模块(CDFM),利用加权融合和坐标注意(CA)对不同来源的信息进行协同学习。此外,我们开发了一个边缘特征提取模块(EFEM),该模块使用浅结构在全分辨率下有效地提取详细的边缘信息。最后,提出了一种多方向交叉注意(Multi-Directional Cross Attention, MDCA)机制来计算边缘和解码器特征映射之间的相似度,指导精确分割。实验结果表明,EGNL-FAT在FSS和smok5k上分别达到59.66% (FS01)和64.68% (FS02)和79.04% (miou)。在保持良好的模型复杂度和处理效率的同时,该模型表现出了优异的性能。我们的代码和数据集可以在:https://github.com/yitccc/smoke-segmentation/tree/master上公开获取。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Expert Systems with Applications
Expert Systems with Applications 工程技术-工程:电子与电气
CiteScore
13.80
自引率
10.60%
发文量
2045
审稿时长
8.7 months
期刊介绍: Expert Systems With Applications is an international journal dedicated to the exchange of information on expert and intelligent systems used globally in industry, government, and universities. The journal emphasizes original papers covering the design, development, testing, implementation, and management of these systems, offering practical guidelines. It spans various sectors such as finance, engineering, marketing, law, project management, information management, medicine, and more. The journal also welcomes papers on multi-agent systems, knowledge management, neural networks, knowledge discovery, data mining, and other related areas, excluding applications to military/defense systems.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信