Jianxin Wang, Jin Wang, Yi Li, Dongdong Ge, Siyuan Zhou
{"title":"基于混合窗口变压器和双支路融合的裂纹分割网络","authors":"Jianxin Wang, Jin Wang, Yi Li, Dongdong Ge, Siyuan Zhou","doi":"10.1007/s10489-025-06822-6","DOIUrl":null,"url":null,"abstract":"<div><p>Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.</p></div>","PeriodicalId":8041,"journal":{"name":"Applied Intelligence","volume":"55 13","pages":""},"PeriodicalIF":3.5000,"publicationDate":"2025-08-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Crack segmentation network based on hybrid-window transformer and dual-branch fusion\",\"authors\":\"Jianxin Wang, Jin Wang, Yi Li, Dongdong Ge, Siyuan Zhou\",\"doi\":\"10.1007/s10489-025-06822-6\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p>Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.</p></div>\",\"PeriodicalId\":8041,\"journal\":{\"name\":\"Applied Intelligence\",\"volume\":\"55 13\",\"pages\":\"\"},\"PeriodicalIF\":3.5000,\"publicationDate\":\"2025-08-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Applied Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://link.springer.com/article/10.1007/s10489-025-06822-6\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Applied Intelligence","FirstCategoryId":"94","ListUrlMain":"https://link.springer.com/article/10.1007/s10489-025-06822-6","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Crack segmentation network based on hybrid-window transformer and dual-branch fusion
Cracks are external manifestations of infrastructure damage, and routine inspections are crucial for assessing their structural safety. However, due to factors such as crack diversity, background noise interference, and information loss, high-precision crack segmentation still faces numerous challenges. To alleviate the influence of these factors, a crack segmentation network based on hybrid-window Transformer and dual-branch fusion (CSHD) is proposed. The CSHD network can effectively capture local texture details and global context modeling to achieve high-precision crack segmentation. First, a hybrid-window attention mechanism(HWA) is designed as the core component, which employs a dual-branch parallel architecture to integrate channel attention and multi-scale depth-wise convolution modules on the value path of window attention, achieving spatial receptive field expansion and cross-window feature interaction. Second, to enhance feature processing capabilities, a locally enhanced gated FeedForward network (LeGN) is proposed, which achieves adaptive feature aggregation through overlapping multi-scale deformable convolution, and a gated unit is designed to optimize the information flow. Thirdly, a dual-branch fusion module (DBF) is introduced in skip-connections of encoder-decoder layers to enhance cross-level feature interaction while effectively mitigating information loss during the downsampling process. Finally, comparative experimental results on three benchmark datasets (CrackLS315, DeepCrack537, and YCD776) with seven advanced networks demonstrate that the proposed network achieves excellent performance, obtaining mean intersection over union (mIoU) scores of 71.26%, 86.58%, and 83.93%, respectively. Code is available at: https://github.com/wjxcsust2024/CSHD.
期刊介绍:
With a focus on research in artificial intelligence and neural networks, this journal addresses issues involving solutions of real-life manufacturing, defense, management, government and industrial problems which are too complex to be solved through conventional approaches and require the simulation of intelligent thought processes, heuristics, applications of knowledge, and distributed and parallel processing. The integration of these multiple approaches in solving complex problems is of particular importance.
The journal presents new and original research and technological developments, addressing real and complex issues applicable to difficult problems. It provides a medium for exchanging scientific research and technological achievements accomplished by the international community.