Hao Shi , Yifeng Pan , Ruoxiang Gao , Zhengchuan Guo , Chengqian Zhang , Peng Zhao
{"title":"DAE-SWnet: Unsupervised internal defect segmentation through infrared thermography with scarce samples","authors":"Hao Shi , Yifeng Pan , Ruoxiang Gao , Zhengchuan Guo , Chengqian Zhang , Peng Zhao","doi":"10.1016/j.jmsy.2025.07.016","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate defect detection is essential for manufacturing reliability, yet identifying internal defects remains challenging. While infrared thermography offers distinct advantages for internal inspection, its effectiveness is hindered by noise and uneven heating. Traditional image processing algorithms struggle with these nonlinearities, while supervised deep learning methods require large annotated datasets, usually impractical in industrial settings. To overcome data scarcity, we propose a novel infrared data amplification strategy leveraging the dynamic temperature evolution. By varying defect depth and heating duration, we generate 16000 thermal images from merely two defective samples using pulsed thermography. Furthermore, we introduce an unsupervised defect segmentation framework, Deep Autoencoder with Swin Transformer Wnet (DAE-SWnet). First, a deep autoencoder denoises thermal images, during which we discover a non-monotonic relationship between reconstruction loss and denoising performance. Next, Swin Transformer and Wnet are cohesively integrated with optimized encoder-decoder channels, to extract latent defect features from denoised images. These latent representations are postprocessed to obtain final segmentation results. Trained solely on artificially designed defects, our model exhibits exceptional performance across samples with varying materials and defect shapes. Moreover, comparative experiments demonstrate that the model achieves higher precision, stronger stability, and better generalization to real-world manufacturing processes. Specifically, it achieves 74.0 % IoU, 84.3 % F1-score, and 97.7 % accuracy, with an average inference time of 0.278 s, highlighting its superiority and practical potential in industrial scenarios where defective products are difficult to obtain and annotate.</div></div>","PeriodicalId":16227,"journal":{"name":"Journal of Manufacturing Systems","volume":"82 ","pages":"Pages 766-785"},"PeriodicalIF":14.2000,"publicationDate":"2025-08-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Manufacturing Systems","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0278612525001918","RegionNum":1,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, INDUSTRIAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate defect detection is essential for manufacturing reliability, yet identifying internal defects remains challenging. While infrared thermography offers distinct advantages for internal inspection, its effectiveness is hindered by noise and uneven heating. Traditional image processing algorithms struggle with these nonlinearities, while supervised deep learning methods require large annotated datasets, usually impractical in industrial settings. To overcome data scarcity, we propose a novel infrared data amplification strategy leveraging the dynamic temperature evolution. By varying defect depth and heating duration, we generate 16000 thermal images from merely two defective samples using pulsed thermography. Furthermore, we introduce an unsupervised defect segmentation framework, Deep Autoencoder with Swin Transformer Wnet (DAE-SWnet). First, a deep autoencoder denoises thermal images, during which we discover a non-monotonic relationship between reconstruction loss and denoising performance. Next, Swin Transformer and Wnet are cohesively integrated with optimized encoder-decoder channels, to extract latent defect features from denoised images. These latent representations are postprocessed to obtain final segmentation results. Trained solely on artificially designed defects, our model exhibits exceptional performance across samples with varying materials and defect shapes. Moreover, comparative experiments demonstrate that the model achieves higher precision, stronger stability, and better generalization to real-world manufacturing processes. Specifically, it achieves 74.0 % IoU, 84.3 % F1-score, and 97.7 % accuracy, with an average inference time of 0.278 s, highlighting its superiority and practical potential in industrial scenarios where defective products are difficult to obtain and annotate.
期刊介绍:
The Journal of Manufacturing Systems is dedicated to showcasing cutting-edge fundamental and applied research in manufacturing at the systems level. Encompassing products, equipment, people, information, control, and support functions, manufacturing systems play a pivotal role in the economical and competitive development, production, delivery, and total lifecycle of products, meeting market and societal needs.
With a commitment to publishing archival scholarly literature, the journal strives to advance the state of the art in manufacturing systems and foster innovation in crafting efficient, robust, and sustainable manufacturing systems. The focus extends from equipment-level considerations to the broader scope of the extended enterprise. The Journal welcomes research addressing challenges across various scales, including nano, micro, and macro-scale manufacturing, and spanning diverse sectors such as aerospace, automotive, energy, and medical device manufacturing.