{"title":"基于作物膏体和扩散的金属缺陷半监督分割网络","authors":"Lixiang Zhao, Jianbo Yu","doi":"10.1016/j.knosys.2025.113573","DOIUrl":null,"url":null,"abstract":"<div><div>Metal defect semantic segmentation is a crucial process for classifying and locating defects during the industrial production process, which holds paramount importance in elevating the quality of metal products. Recently, deep learning has exhibited impressive capabilities in identifying and segmenting defects on metal surfaces. However, the prevalent use of fully supervised segmentation techniques demands a substantial amount of annotated data for effective model training, which is hard to obtain in real scenarios. Additionally, most defects of metal products exhibit indistinct edge details, which hinders precise defect localization. In this study, a Crop-Paste and diffusion-based semi-supervised segmentation network (CPDNet) is proposed to identify pixel-level defects on metal surfaces by utilizing data that are both labeled and unlabeled. Firstly, a semi-supervised training method Crop-Paste is proposed to facilitate the learning of comprehensive semantic features from an extensive of unlabeled images and a restricted set of labeled images. Secondly, a frequency-directed diffusion model is proposed to recover high frequency features of defects to generate more accurate segmentation results. Lastly, an edge aware module is proposed in Sobel mean-teacher (M-T) UNet to improve the boundary information representation associated with defects. The experimental results on four datasets related to metal surface defects and a multimodal dataset show that CPDNet achieves a better performance in comparison with those state-of-the-art methods.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"318 ","pages":"Article 113573"},"PeriodicalIF":7.2000,"publicationDate":"2025-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Crop-paste and diffusion-based semi-supervised segmentation network for metal defect detection\",\"authors\":\"Lixiang Zhao, Jianbo Yu\",\"doi\":\"10.1016/j.knosys.2025.113573\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Metal defect semantic segmentation is a crucial process for classifying and locating defects during the industrial production process, which holds paramount importance in elevating the quality of metal products. Recently, deep learning has exhibited impressive capabilities in identifying and segmenting defects on metal surfaces. However, the prevalent use of fully supervised segmentation techniques demands a substantial amount of annotated data for effective model training, which is hard to obtain in real scenarios. Additionally, most defects of metal products exhibit indistinct edge details, which hinders precise defect localization. In this study, a Crop-Paste and diffusion-based semi-supervised segmentation network (CPDNet) is proposed to identify pixel-level defects on metal surfaces by utilizing data that are both labeled and unlabeled. Firstly, a semi-supervised training method Crop-Paste is proposed to facilitate the learning of comprehensive semantic features from an extensive of unlabeled images and a restricted set of labeled images. Secondly, a frequency-directed diffusion model is proposed to recover high frequency features of defects to generate more accurate segmentation results. Lastly, an edge aware module is proposed in Sobel mean-teacher (M-T) UNet to improve the boundary information representation associated with defects. The experimental results on four datasets related to metal surface defects and a multimodal dataset show that CPDNet achieves a better performance in comparison with those state-of-the-art methods.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"318 \",\"pages\":\"Article 113573\"},\"PeriodicalIF\":7.2000,\"publicationDate\":\"2025-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125006197\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125006197","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Crop-paste and diffusion-based semi-supervised segmentation network for metal defect detection
Metal defect semantic segmentation is a crucial process for classifying and locating defects during the industrial production process, which holds paramount importance in elevating the quality of metal products. Recently, deep learning has exhibited impressive capabilities in identifying and segmenting defects on metal surfaces. However, the prevalent use of fully supervised segmentation techniques demands a substantial amount of annotated data for effective model training, which is hard to obtain in real scenarios. Additionally, most defects of metal products exhibit indistinct edge details, which hinders precise defect localization. In this study, a Crop-Paste and diffusion-based semi-supervised segmentation network (CPDNet) is proposed to identify pixel-level defects on metal surfaces by utilizing data that are both labeled and unlabeled. Firstly, a semi-supervised training method Crop-Paste is proposed to facilitate the learning of comprehensive semantic features from an extensive of unlabeled images and a restricted set of labeled images. Secondly, a frequency-directed diffusion model is proposed to recover high frequency features of defects to generate more accurate segmentation results. Lastly, an edge aware module is proposed in Sobel mean-teacher (M-T) UNet to improve the boundary information representation associated with defects. The experimental results on four datasets related to metal surface defects and a multimodal dataset show that CPDNet achieves a better performance in comparison with those state-of-the-art methods.
期刊介绍:
Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.