Multi-position industrial defect inspection using self-training siamese networks with mix strategies

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Industrial Information Integration Pub Date : 2024-04-18 DOI:10.1016/j.jii.2024.100615

Fangjun Wang , Xurong Chi , Liangwu Wei , Yanzhi Song , Zhouwang Yang

{"title":"Multi-position industrial defect inspection using self-training siamese networks with mix strategies","authors":"Fangjun Wang , Xurong Chi , Liangwu Wei , Yanzhi Song , Zhouwang Yang","doi":"10.1016/j.jii.2024.100615","DOIUrl":null,"url":null,"abstract":"<div><p>Structural defects account for a large proportion of defects, and acquiring large batches of high-quality labels is labor-intensive and time-consuming for industrial visual defect inspection tasks. This paper addresses the above problem by exploiting sufficient unlabeled samples, and aims to achieve superior model performance with some labeled data by using self-training methods that incorporate positional information. Specifically, this paper proposes a novel self-training architecture, MixSiam, which uses a Multi-Position-based Mix strategy (MPMix) and Siamese network structure for defect classification. Furthermore, considering the prediction noise problem in unlabeled data during training, we propose a progressive MPMix (MPMix+) strategy to reduce the negative impacts of noise on model training. Finally, we validate the effectiveness of our architecture on industrial datasets. For example, our method achieves 71.40% and 87.01% accuracy on the SMT (Surface Mounting Technology) dataset and MBH (Motor Brush Holder) dataset with only 100 labeled samples, which are 2.40% and 5.86% higher than the state-of-the-art FixMatch method, respectively. Compared with the supervised algorithm with 3,600 labels, our method achieves comparable accuracy on the SMT and MBH datasets, respectively, while saving 2/3 the amount of labeled data. In conclusion, MixSiam effectively utilizes unlabeled industrial data and improves model accuracy with fewer labeled samples, thus reducing the burden of data annotation in industrial production.</p></div>","PeriodicalId":55975,"journal":{"name":"Journal of Industrial Information Integration","volume":"40 ","pages":"Article 100615"},"PeriodicalIF":10.4000,"publicationDate":"2024-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Industrial Information Integration","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452414X24000591","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

Structural defects account for a large proportion of defects, and acquiring large batches of high-quality labels is labor-intensive and time-consuming for industrial visual defect inspection tasks. This paper addresses the above problem by exploiting sufficient unlabeled samples, and aims to achieve superior model performance with some labeled data by using self-training methods that incorporate positional information. Specifically, this paper proposes a novel self-training architecture, MixSiam, which uses a Multi-Position-based Mix strategy (MPMix) and Siamese network structure for defect classification. Furthermore, considering the prediction noise problem in unlabeled data during training, we propose a progressive MPMix (MPMix+) strategy to reduce the negative impacts of noise on model training. Finally, we validate the effectiveness of our architecture on industrial datasets. For example, our method achieves 71.40% and 87.01% accuracy on the SMT (Surface Mounting Technology) dataset and MBH (Motor Brush Holder) dataset with only 100 labeled samples, which are 2.40% and 5.86% higher than the state-of-the-art FixMatch method, respectively. Compared with the supervised algorithm with 3,600 labels, our method achieves comparable accuracy on the SMT and MBH datasets, respectively, while saving 2/3 the amount of labeled data. In conclusion, MixSiam effectively utilizes unlabeled industrial data and improves model accuracy with fewer labeled samples, thus reducing the burden of data annotation in industrial production.

Abstract Image

查看原文本刊更多论文

使用混合策略自训练连体网络进行多位置工业缺陷检测

结构性缺陷在缺陷中占很大比例，而对于工业视觉缺陷检测任务来说，获取大批量高质量的标签既耗费人力又耗费时间。本文通过利用充足的未标记样本来解决上述问题，并通过使用包含位置信息的自我训练方法，旨在利用部分标记数据实现卓越的模型性能。具体来说，本文提出了一种新颖的自训练架构 MixSiam，它使用基于多位置的混合策略（MPMix）和连体网络结构进行缺陷分类。此外，考虑到训练过程中未标记数据的预测噪声问题，我们提出了渐进式 MPMix（MPMix+）策略，以减少噪声对模型训练的负面影响。最后，我们在工业数据集上验证了我们架构的有效性。例如，在只有 100 个标注样本的 SMT（表面安装技术）数据集和 MBH（电机电刷座）数据集上，我们的方法分别实现了 71.40% 和 87.01% 的准确率，比最先进的 FixMatch 方法分别高出 2.40% 和 5.86%。与使用 3,600 个标签的监督算法相比，我们的方法在 SMT 和 MBH 数据集上分别达到了相当的准确率，同时节省了 2/3 的标签数据量。总之，MixSiam 有效地利用了未标注的工业数据，以较少的标注样本提高了模型的准确性，从而减轻了工业生产中数据标注的负担。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Journal of Industrial Information Integration Decision Sciences-Information Systems and Management

CiteScore

22.30

自引率

13.40%

发文量

100

期刊介绍： The Journal of Industrial Information Integration focuses on the industry's transition towards industrial integration and informatization, covering not only hardware and software but also information integration. It serves as a platform for promoting advances in industrial information integration, addressing challenges, issues, and solutions in an interdisciplinary forum for researchers, practitioners, and policy makers. The Journal of Industrial Information Integration welcomes papers on foundational, technical, and practical aspects of industrial information integration, emphasizing the complex and cross-disciplinary topics that arise in industrial integration. Techniques from mathematical science, computer science, computer engineering, electrical and electronic engineering, manufacturing engineering, and engineering management are crucial in this context.