基于混合窗口变压器和双支路融合的裂纹分割网络

IF 3.5 2区 计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Jianxin Wang, Jin Wang, Yi Li, Dongdong Ge, Siyuan Zhou
{"title":"基于混合窗口变压器和双支路融合的裂纹分割网络","authors":"Jianxin Wang,&nbsp;Jin Wang,&nbsp;Yi Li,&nbsp;Dongdong Ge,&nbsp;Siyuan Zhou","doi":"10.1007/s10489-025-06822-6","DOIUrl":null,"url":null,"abstract":"<div><p>Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 13","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Crack segmentation network based on hybrid-window transformer and dual-branch fusion\",\"authors\":\"Jianxin Wang,&nbsp;Jin Wang,&nbsp;Yi Li,&nbsp;Dongdong Ge,&nbsp;Siyuan Zhou\",\"doi\":\"10.1007/s10489-025-06822-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 13\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06822-6\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06822-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

裂缝是基础设施损伤的外在表现,日常检查是评估其结构安全性的关键。然而,由于裂纹多样性、背景噪声干扰和信息丢失等因素,高精度的裂纹分割仍然面临诸多挑战。为了减轻这些因素的影响,提出了一种基于混合窗口变压器和双分支融合(CSHD)的裂纹分割网络。CSHD网络可以有效地捕获局部纹理细节和全局上下文建模,实现高精度的裂纹分割。首先,设计了混合窗口注意机制(HWA)作为核心组件,该机制采用双分支并行架构,在窗口注意的值路径上集成通道注意和多尺度深度卷积模块,实现空间感受野扩展和跨窗口特征交互;其次,为了增强特征处理能力,提出了局部增强的门控前馈网络(LeGN),该网络通过重叠的多尺度变形卷积实现自适应特征聚合,并设计了门控单元对信息流进行优化。第三,在编解码器层的跳接中引入双分支融合模块(DBF),增强跨层特征交互,同时有效减轻下采样过程中的信息丢失。最后,在3个基准数据集(CrackLS315、DeepCrack537和YCD776)上与7个高级网络进行对比实验,结果表明,本文提出的网络取得了优异的性能,平均mIoU得分分别为71.26%、86.58%和83.93%。代码可从https://github.com/wjxcsust2024/CSHD获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Crack segmentation network based on hybrid-window transformer and dual-branch fusion

Crack segmentation network based on hybrid-window transformer and dual-branch fusion

Crack segmentation network based on hybrid-window transformer and dual-branch fusion

Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Applied Intelligence
Applied Intelligence 工程技术-计算机:人工智能
CiteScore
6.60
自引率
20.80%
发文量
1361
审稿时长
5.9 months
期刊介绍: With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance. The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信