Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data

IF 8.2 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Computers in Industry Pub Date : 2023-10-08 DOI:10.1016/j.compind.2023.104024

Yoonseok Kim , Taeheon Lee , Youngjoo Hyun , Eric Coatanea , Siren Mika , Jeonghoon Mo , YoungJun Yoo

{"title":"Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data","authors":"Yoonseok Kim , Taeheon Lee , Youngjoo Hyun , Eric Coatanea , Siren Mika , Jeonghoon Mo , YoungJun Yoo","doi":"10.1016/j.compind.2023.104024","DOIUrl":null,"url":null,"abstract":"<div><p>This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":null,"pages":null},"PeriodicalIF":8.2000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361523001744","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

Abstract

This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.

查看原文本刊更多论文

基于StyleGAN数据增强增强算法的自监督表示学习异常检测方法

本研究提出了一种使用基于深度生成模型的自监督表示学习方法来检测制造业异常的方法。与正常数据相比，缺陷产品的数据可用性有限，导致数据失衡导致深度学习模型的性能下降。为了解决这一限制，我们提出了一种利用Gramian角场将时间序列数据转换为图像的过程，将StyleGAN应用于异常数据的图像增强，并在监督学习中使用boosting算法进行分类器选择。此外，我们还比较了数据增强前后分类器的准确性。在涉及数控铣床数据和线弧增材制造数据的实验案例中，所提出的方法优于增强前的方法，从而提高了异常检测的精度、召回率和F1分数。此外，增强算法的超参数的贝叶斯优化进一步增强了性能度量。所提出的过程有效地解决了数据不平衡问题，并证明了其适用于各种制造业。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Computers in Industry 工程技术-计算机：跨学科应用

CiteScore

18.90

自引率

8.00%

发文量

152

审稿时长

22 days

期刊介绍： The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.