Mingchen Wei , Gengkun Wu , Letian Wang , Mengqian Li
{"title":"裂纹van:一种基于多尺度特征细化和大核关注的裂纹危险图像检测方法","authors":"Mingchen Wei , Gengkun Wu , Letian Wang , Mengqian Li","doi":"10.1016/j.measurement.2025.117766","DOIUrl":null,"url":null,"abstract":"<div><div>Detection and segmentation of mine cracks are a crucial step in the process of eliminating the safety hazards of mine caused by large-scale mining on the floor of mines. However, the effectiveness of the current segmentation network still needs to be improved due to various reasons, especially the interference of shadows and the edge details of the cracks. In order to enhance the efficacy of crack detection, a multi-scale dilated larger kernel attention model has been developed. Furthermore, utilizing the visual attention network (VAN) architecture, we propose a multiscale dilated visual attention network (MD-VAN) for the early-stage screening of images containing cracks. In the crack detection stage, we propose a segmentation network, CrackVAN, which is based on multiscale feature refinement using MD-VAN as an encoder. In the feature processing part, we propose a multiscale feature refinement module (MSRF) for processing deep features, which enhances the information of cracks in the deep features by multiscale DWConv. Additionally, a convolutional pyramid compressed attention (CPSA) is proposed to further optimize the shallow features, enhancing the crack boundary information by increasing the attention at both the spatial and channel levels. Experimental results on the mine dataset show that MD-VAN achieves accuracy (94.12%) compared to other networks, while CrackVAN achieves mIoU (92.13%) and F1-score (95.76%). In addition, an evaluation of the efficacy of the proposed methodology was conducted on two publicly available datasets, namely CFD and DeepCrack, and our method gets better performance.</div></div>","PeriodicalId":18349,"journal":{"name":"Measurement","volume":"253 ","pages":"Article 117766"},"PeriodicalIF":5.2000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"CrackVAN: A detection method of crack hazard images based on multi-scale feature refinement and large kernel attention\",\"authors\":\"Mingchen Wei , Gengkun Wu , Letian Wang , Mengqian Li\",\"doi\":\"10.1016/j.measurement.2025.117766\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Detection and segmentation of mine cracks are a crucial step in the process of eliminating the safety hazards of mine caused by large-scale mining on the floor of mines. However, the effectiveness of the current segmentation network still needs to be improved due to various reasons, especially the interference of shadows and the edge details of the cracks. In order to enhance the efficacy of crack detection, a multi-scale dilated larger kernel attention model has been developed. Furthermore, utilizing the visual attention network (VAN) architecture, we propose a multiscale dilated visual attention network (MD-VAN) for the early-stage screening of images containing cracks. In the crack detection stage, we propose a segmentation network, CrackVAN, which is based on multiscale feature refinement using MD-VAN as an encoder. In the feature processing part, we propose a multiscale feature refinement module (MSRF) for processing deep features, which enhances the information of cracks in the deep features by multiscale DWConv. Additionally, a convolutional pyramid compressed attention (CPSA) is proposed to further optimize the shallow features, enhancing the crack boundary information by increasing the attention at both the spatial and channel levels. Experimental results on the mine dataset show that MD-VAN achieves accuracy (94.12%) compared to other networks, while CrackVAN achieves mIoU (92.13%) and F1-score (95.76%). In addition, an evaluation of the efficacy of the proposed methodology was conducted on two publicly available datasets, namely CFD and DeepCrack, and our method gets better performance.</div></div>\",\"PeriodicalId\":18349,\"journal\":{\"name\":\"Measurement\",\"volume\":\"253 \",\"pages\":\"Article 117766\"},\"PeriodicalIF\":5.2000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S026322412501125X\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Measurement","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S026322412501125X","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
CrackVAN: A detection method of crack hazard images based on multi-scale feature refinement and large kernel attention
Detection and segmentation of mine cracks are a crucial step in the process of eliminating the safety hazards of mine caused by large-scale mining on the floor of mines. However, the effectiveness of the current segmentation network still needs to be improved due to various reasons, especially the interference of shadows and the edge details of the cracks. In order to enhance the efficacy of crack detection, a multi-scale dilated larger kernel attention model has been developed. Furthermore, utilizing the visual attention network (VAN) architecture, we propose a multiscale dilated visual attention network (MD-VAN) for the early-stage screening of images containing cracks. In the crack detection stage, we propose a segmentation network, CrackVAN, which is based on multiscale feature refinement using MD-VAN as an encoder. In the feature processing part, we propose a multiscale feature refinement module (MSRF) for processing deep features, which enhances the information of cracks in the deep features by multiscale DWConv. Additionally, a convolutional pyramid compressed attention (CPSA) is proposed to further optimize the shallow features, enhancing the crack boundary information by increasing the attention at both the spatial and channel levels. Experimental results on the mine dataset show that MD-VAN achieves accuracy (94.12%) compared to other networks, while CrackVAN achieves mIoU (92.13%) and F1-score (95.76%). In addition, an evaluation of the efficacy of the proposed methodology was conducted on two publicly available datasets, namely CFD and DeepCrack, and our method gets better performance.
期刊介绍:
Contributions are invited on novel achievements in all fields of measurement and instrumentation science and technology. Authors are encouraged to submit novel material, whose ultimate goal is an advancement in the state of the art of: measurement and metrology fundamentals, sensors, measurement instruments, measurement and estimation techniques, measurement data processing and fusion algorithms, evaluation procedures and methodologies for plants and industrial processes, performance analysis of systems, processes and algorithms, mathematical models for measurement-oriented purposes, distributed measurement systems in a connected world.