Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data

IF 8.2 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Yoonseok Kim , Taeheon Lee , Youngjoo Hyun , Eric Coatanea , Siren Mika , Jeonghoon Mo , YoungJun Yoo
{"title":"Self-supervised representation learning anomaly detection methodology based on boosting algorithms enhanced by data augmentation using StyleGAN for manufacturing imbalanced data","authors":"Yoonseok Kim ,&nbsp;Taeheon Lee ,&nbsp;Youngjoo Hyun ,&nbsp;Eric Coatanea ,&nbsp;Siren Mika ,&nbsp;Jeonghoon Mo ,&nbsp;YoungJun Yoo","doi":"10.1016/j.compind.2023.104024","DOIUrl":null,"url":null,"abstract":"<div><p>This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.</p></div>","PeriodicalId":55219,"journal":{"name":"Computers in Industry","volume":null,"pages":null},"PeriodicalIF":8.2000,"publicationDate":"2023-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in Industry","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0166361523001744","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

Abstract

This study proposes a methodology for detecting anomalies in the manufacturing industry using a self-supervised representation learning approach based on deep generative models. The challenge arises from the limited availability of data on defective products compared with normal data, leading to degradation in the performance of deep learning models owing to data imbalances. To address this limitation, we propose a process that leverages the Gramian angular field to transform time-series data into images, applies StyleGAN for image augmentation of anomalous data, and utilizes a boosting algorithm for classifier selection in supervised learning. Additionally, we compared the accuracy of the classifier before and after data augmentation. In experimental cases involving CNC milling machine data and wire arc additive manufacturing data, the proposed approach outperformed the approach before augmentation, resulting in improved precision, recall, and F1-score for anomaly detection. Furthermore, Bayesian optimization of the hyperparameters of the boosting algorithm further enhanced the performance metrics. The proposed process effectively addresses the data imbalance problem, and demonstrates its applicability to various manufacturing industries.

基于StyleGAN数据增强增强算法的自监督表示学习异常检测方法
本研究提出了一种使用基于深度生成模型的自监督表示学习方法来检测制造业异常的方法。与正常数据相比,缺陷产品的数据可用性有限,导致数据失衡导致深度学习模型的性能下降。为了解决这一限制,我们提出了一种利用Gramian角场将时间序列数据转换为图像的过程,将StyleGAN应用于异常数据的图像增强,并在监督学习中使用boosting算法进行分类器选择。此外,我们还比较了数据增强前后分类器的准确性。在涉及数控铣床数据和线弧增材制造数据的实验案例中,所提出的方法优于增强前的方法,从而提高了异常检测的精度、召回率和F1分数。此外,增强算法的超参数的贝叶斯优化进一步增强了性能度量。所提出的过程有效地解决了数据不平衡问题,并证明了其适用于各种制造业。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in Industry
Computers in Industry 工程技术-计算机:跨学科应用
CiteScore
18.90
自引率
8.00%
发文量
152
审稿时长
22 days
期刊介绍: The objective of Computers in Industry is to present original, high-quality, application-oriented research papers that: • Illuminate emerging trends and possibilities in the utilization of Information and Communication Technology in industry; • Establish connections or integrations across various technology domains within the expansive realm of computer applications for industry; • Foster connections or integrations across diverse application areas of ICT in industry.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信