An interactive cross-multi-feature fusion approach for salient object detection in crack segmentation

IF 8.5 1区工程技术 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computer-Aided Civil and Infrastructure Engineering Pub Date : 2025-02-09 DOI:10.1111/mice.13437

Jian Liu, Pei Niu, Lei Kou, Yalin Zhang, Honglei Chang, Feng Guo

{"title":"An interactive cross-multi-feature fusion approach for salient object detection in crack segmentation","authors":"Jian Liu, Pei Niu, Lei Kou, Yalin Zhang, Honglei Chang, Feng Guo","doi":"10.1111/mice.13437","DOIUrl":null,"url":null,"abstract":"<p>Salient object detection (SOD) is a crucial preprocessing technique in visual computing, which identifies the salient regions in an image by simulating the human visual perception system. It achieves remarkable results in tasks such as image quality assessment, editing, and object recognition. However, due to the particularity of pavement crack detection in terms of scale and feature requirements, the SOD model is rarely applied in pavement surface crack detection at present. In order to break the existing dilemma, this paper proposes a new SOD model (iU2Net) specialized for crack detection, which is based on the encoder–decoder structure of U2Net and incorporates the developed interactive cross-multi-feature fusion module (ICMFM). Compared with the existing models, the main contributions of iU2Net are reflected in two aspects. On the one hand, current models are difficult to comprehensively extract the complex features of cracks while iU2Net achieves a breakthrough in feature extraction by efficiently aggregating multiscale crack features and accurately reconstructing them through its unique architecture. On the other hand, iU2Net focuses on infrastructure crack detection, breaking the limitation of independent processing of traditional feature channels and facilitating information exchange. To validate the model's effectiveness, comprehensive experiments are conducted on a public benchmark dataset. iU2Net is compared with eight existing SOD models (EGNet, PoolNet, MINet, F3Net, U2Net, SegNet, BASNet, and DeepCrack). Training and detection performance is evaluated using average mean absolute error (AveMAE), maximum F1 score (MaxF1), mean F1 score (MeanF1), precision–recall curves, and visualizations. Experimental the results indicate that iU2Net exceeds the behavior of other networks during both the training and testing phases, with MaxF1 and MeanF1 achieving maximum values of 0.912 and 0.730, respectively; and AveMAE of 0.048, which is only 0.005 higher than the minimum value, which demonstrates its effectiveness for pavement surface crack detection and indicating potential future applications involving interactive feature fusion.</p>","PeriodicalId":156,"journal":{"name":"Computer-Aided Civil and Infrastructure Engineering","volume":"40 8","pages":"1080-1099"},"PeriodicalIF":8.5000,"publicationDate":"2025-02-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computer-Aided Civil and Infrastructure Engineering","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/mice.13437","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Salient object detection (SOD) is a crucial preprocessing technique in visual computing, which identifies the salient regions in an image by simulating the human visual perception system. It achieves remarkable results in tasks such as image quality assessment, editing, and object recognition. However, due to the particularity of pavement crack detection in terms of scale and feature requirements, the SOD model is rarely applied in pavement surface crack detection at present. In order to break the existing dilemma, this paper proposes a new SOD model (iU2Net) specialized for crack detection, which is based on the encoder–decoder structure of U2Net and incorporates the developed interactive cross-multi-feature fusion module (ICMFM). Compared with the existing models, the main contributions of iU2Net are reflected in two aspects. On the one hand, current models are difficult to comprehensively extract the complex features of cracks while iU2Net achieves a breakthrough in feature extraction by efficiently aggregating multiscale crack features and accurately reconstructing them through its unique architecture. On the other hand, iU2Net focuses on infrastructure crack detection, breaking the limitation of independent processing of traditional feature channels and facilitating information exchange. To validate the model's effectiveness, comprehensive experiments are conducted on a public benchmark dataset. iU2Net is compared with eight existing SOD models (EGNet, PoolNet, MINet, F3Net, U2Net, SegNet, BASNet, and DeepCrack). Training and detection performance is evaluated using average mean absolute error (AveMAE), maximum F1 score (MaxF1), mean F1 score (MeanF1), precision–recall curves, and visualizations. Experimental the results indicate that iU2Net exceeds the behavior of other networks during both the training and testing phases, with MaxF1 and MeanF1 achieving maximum values of 0.912 and 0.730, respectively; and AveMAE of 0.048, which is only 0.005 higher than the minimum value, which demonstrates its effectiveness for pavement surface crack detection and indicating potential future applications involving interactive feature fusion.

查看原文本刊更多论文

裂纹分割中显著目标检测的交互式交叉多特征融合方法

显著目标检测（SOD）是视觉计算中一项重要的预处理技术，它通过模拟人类视觉感知系统来识别图像中的显著区域。它在图像质量评估、编辑和目标识别等任务中取得了显著的效果。然而，由于路面裂缝检测在尺度和特征要求上的特殊性，目前路面表面裂缝检测中很少应用SOD模型。为了打破现有的困境，本文提出了一种新的专门用于裂纹检测的SOD模型（iU2Net），该模型基于U2Net的编码器-解码器结构，并融合了已开发的交互式跨多特征融合模块（ICMFM）。与现有模型相比，iU2Net的主要贡献体现在两个方面。一方面，现有模型难以全面提取裂缝的复杂特征，而iU2Net通过其独特的架构，高效地聚合多尺度裂缝特征并精确重建，在特征提取方面取得了突破。另一方面，iU2Net侧重于基础设施的裂缝检测，打破了传统特征通道独立处理的限制，促进了信息的交换。为了验证模型的有效性，在一个公开的基准数据集上进行了全面的实验。iU2Net与现有的8种SOD模型（EGNet、PoolNet、MINet、F3Net、U2Net、SegNet、BASNet和DeepCrack）进行了比较。使用平均绝对误差（AveMAE）、最大F1分数（MaxF1）、平均F1分数（MeanF1）、查准率-查全率曲线和可视化来评估训练和检测性能。实验结果表明，iU2Net在训练和测试阶段的行为都优于其他网络，MaxF1和MeanF1分别达到最大值0.912和0.730；AveMAE为0.048，仅比最小值高0.005，表明了该方法在路面表面裂缝检测中的有效性，并预示了交互式特征融合的潜在应用前景。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computer-Aided Civil and Infrastructure Engineering 工程技术-工程：土木

CiteScore

17.60

自引率

19.80%

发文量

146

审稿时长

1 months

期刊介绍： Computer-Aided Civil and Infrastructure Engineering stands as a scholarly, peer-reviewed archival journal, serving as a vital link between advancements in computer technology and civil and infrastructure engineering. The journal serves as a distinctive platform for the publication of original articles, spotlighting novel computational techniques and inventive applications of computers. Specifically, it concentrates on recent progress in computer and information technologies, fostering the development and application of emerging computing paradigms. Encompassing a broad scope, the journal addresses bridge, construction, environmental, highway, geotechnical, structural, transportation, and water resources engineering. It extends its reach to the management of infrastructure systems, covering domains such as highways, bridges, pavements, airports, and utilities. The journal delves into areas like artificial intelligence, cognitive modeling, concurrent engineering, database management, distributed computing, evolutionary computing, fuzzy logic, genetic algorithms, geometric modeling, internet-based technologies, knowledge discovery and engineering, machine learning, mobile computing, multimedia technologies, networking, neural network computing, optimization and search, parallel processing, robotics, smart structures, software engineering, virtual reality, and visualization techniques.