{"title":"MCF-Net:用于织物缺陷实时检测的多尺度上下文融合网络","authors":"Yayong Wang, Zhong Xiang, Weitao Wu, Qiang Wu","doi":"10.1016/j.dsp.2025.105425","DOIUrl":null,"url":null,"abstract":"<div><div>Fabric defect detection plays a critical role in textile manufacturing to ensure product quality. However, challenges such as significant variations in defect sizes result in poor detection performance of existing models. To address these challenges, this paper proposes a Multi-scale Context Fusion Network (MCF-Net). First, The Multi-scale Context Diffusion Fusion Pyramid Network (MCD-FPN) was proposed, which reconfigures the traditional feature fusion path and designed a key component, the Multi-scale Context Aggregation Module (MCAM). MCAM takes multi-scale features extracted from the backbone network as input and establishes long-range dependencies between features and their surrounding background through a series of deep convolutional operations. The output of MCAM is then used to enhance both shallow and deep features, ensuring that each detection scale contains comprehensive multi-scale context information. Furthermore, to improve the network’s feature extraction capability, the Latent Feature Transformer (LFT) was proposed, which maps input features into a high-dimensional space and extracts depth features through high-dimensional information compression. Second, to improve the network’s multi-scale perception, the Local Cross Attention Mechanism (LCA) was proposed, which models the spatial information of the features to improve the network’s understanding of global context information. Experimental results on the self-built FD6052 dataset, as well as the publicly available TianChi and DAGM2007 datasets, demonstrate the effectiveness of MCF-Net, which outperforms existing methods in fabric defect detection. In addition, MCF-Net achieves an inference speed of 85.6 FPS, enabling real-time detection at industrial scale.</div></div>","PeriodicalId":51011,"journal":{"name":"Digital Signal Processing","volume":"167 ","pages":"Article 105425"},"PeriodicalIF":2.9000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MCF-Net: a multi-scale context fusion network for real-time fabric defect detection\",\"authors\":\"Yayong Wang, Zhong Xiang, Weitao Wu, Qiang Wu\",\"doi\":\"10.1016/j.dsp.2025.105425\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Fabric defect detection plays a critical role in textile manufacturing to ensure product quality. However, challenges such as significant variations in defect sizes result in poor detection performance of existing models. To address these challenges, this paper proposes a Multi-scale Context Fusion Network (MCF-Net). First, The Multi-scale Context Diffusion Fusion Pyramid Network (MCD-FPN) was proposed, which reconfigures the traditional feature fusion path and designed a key component, the Multi-scale Context Aggregation Module (MCAM). MCAM takes multi-scale features extracted from the backbone network as input and establishes long-range dependencies between features and their surrounding background through a series of deep convolutional operations. The output of MCAM is then used to enhance both shallow and deep features, ensuring that each detection scale contains comprehensive multi-scale context information. Furthermore, to improve the network’s feature extraction capability, the Latent Feature Transformer (LFT) was proposed, which maps input features into a high-dimensional space and extracts depth features through high-dimensional information compression. Second, to improve the network’s multi-scale perception, the Local Cross Attention Mechanism (LCA) was proposed, which models the spatial information of the features to improve the network’s understanding of global context information. Experimental results on the self-built FD6052 dataset, as well as the publicly available TianChi and DAGM2007 datasets, demonstrate the effectiveness of MCF-Net, which outperforms existing methods in fabric defect detection. In addition, MCF-Net achieves an inference speed of 85.6 FPS, enabling real-time detection at industrial scale.</div></div>\",\"PeriodicalId\":51011,\"journal\":{\"name\":\"Digital Signal Processing\",\"volume\":\"167 \",\"pages\":\"Article 105425\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Digital Signal Processing\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S1051200425004476\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Digital Signal Processing","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1051200425004476","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
MCF-Net: a multi-scale context fusion network for real-time fabric defect detection
Fabric defect detection plays a critical role in textile manufacturing to ensure product quality. However, challenges such as significant variations in defect sizes result in poor detection performance of existing models. To address these challenges, this paper proposes a Multi-scale Context Fusion Network (MCF-Net). First, The Multi-scale Context Diffusion Fusion Pyramid Network (MCD-FPN) was proposed, which reconfigures the traditional feature fusion path and designed a key component, the Multi-scale Context Aggregation Module (MCAM). MCAM takes multi-scale features extracted from the backbone network as input and establishes long-range dependencies between features and their surrounding background through a series of deep convolutional operations. The output of MCAM is then used to enhance both shallow and deep features, ensuring that each detection scale contains comprehensive multi-scale context information. Furthermore, to improve the network’s feature extraction capability, the Latent Feature Transformer (LFT) was proposed, which maps input features into a high-dimensional space and extracts depth features through high-dimensional information compression. Second, to improve the network’s multi-scale perception, the Local Cross Attention Mechanism (LCA) was proposed, which models the spatial information of the features to improve the network’s understanding of global context information. Experimental results on the self-built FD6052 dataset, as well as the publicly available TianChi and DAGM2007 datasets, demonstrate the effectiveness of MCF-Net, which outperforms existing methods in fabric defect detection. In addition, MCF-Net achieves an inference speed of 85.6 FPS, enabling real-time detection at industrial scale.
期刊介绍:
Digital Signal Processing: A Review Journal is one of the oldest and most established journals in the field of signal processing yet it aims to be the most innovative. The Journal invites top quality research articles at the frontiers of research in all aspects of signal processing. Our objective is to provide a platform for the publication of ground-breaking research in signal processing with both academic and industrial appeal.
The journal has a special emphasis on statistical signal processing methodology such as Bayesian signal processing, and encourages articles on emerging applications of signal processing such as:
• big data• machine learning• internet of things• information security• systems biology and computational biology,• financial time series analysis,• autonomous vehicles,• quantum computing,• neuromorphic engineering,• human-computer interaction and intelligent user interfaces,• environmental signal processing,• geophysical signal processing including seismic signal processing,• chemioinformatics and bioinformatics,• audio, visual and performance arts,• disaster management and prevention,• renewable energy,