铸造表面数据集和基准,用于在复杂环境中检测细微和易混淆缺陷

IF 4.3 2区 综合性期刊 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC
Qishan Wang;Shuyong Gao;Li Xiong;Aili Liang;Kaidong Jiang;Wenqiang Zhang
{"title":"铸造表面数据集和基准,用于在复杂环境中检测细微和易混淆缺陷","authors":"Qishan Wang;Shuyong Gao;Li Xiong;Aili Liang;Kaidong Jiang;Wenqiang Zhang","doi":"10.1109/JSEN.2024.3387082","DOIUrl":null,"url":null,"abstract":"Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at \n<uri>https://github.com/18894269590/RSA</uri>\n.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 10","pages":"16721-16733"},"PeriodicalIF":4.3000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts\",\"authors\":\"Qishan Wang;Shuyong Gao;Li Xiong;Aili Liang;Kaidong Jiang;Wenqiang Zhang\",\"doi\":\"10.1109/JSEN.2024.3387082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at \\n<uri>https://github.com/18894269590/RSA</uri>\\n.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"24 10\",\"pages\":\"16721-16733\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10502267/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10502267/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
引用次数: 0

摘要

工业异常检测(IAD)算法对于实现自动质量检测至关重要。数据集的多样性是开发综合检测算法的基础。现有的 IAD 数据集侧重于对象和缺陷的多样性,忽略了真实数据中领域的多样性。为了弥补这一差距,本研究提出了铸造表面缺陷检测(CSDD)数据集,其中包含 12647 张高分辨率灰度图像和所有缺陷样本的像素精确地面实况(GT)标签。与现有数据集相比,CSDD 具有以下两个特点:1) 目标样本不对齐,具有复杂多变的上下文信息;2) CSDD 数据集样本中的缺陷很微妙,容易被油污、加工特征和加工痕迹等因素混淆,这说明了在工业环境中检测真实铸造缺陷所面临的挑战。基于该数据集,我们发现,当样本上下文信息存在相当大的差异时,当前最先进的(SOTA)IAD 方法就会面临挑战。此外,当异常样本稀少时,这些方法也会遇到困难,尤其是那些存在细微和易混淆缺陷的样本。为了解决这个问题,我们提出了一种称为现实合成异常(RSAs)的新方法,通过生成大量的 RSAs 来增强模型构建正常样本分布的能力。实验结果表明,从正常样本中对合成异常进行分类的训练模型在 CSDD 方面达到了最高的准确率,并显著提高了对细微和易混淆缺陷的检测准确率。CSDD 数据集和 RSA 代码见 https://github.com/18894269590/RSA。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts
Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at https://github.com/18894269590/RSA .
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
IEEE Sensors Journal
IEEE Sensors Journal 工程技术-工程:电子与电气
CiteScore
7.70
自引率
14.00%
发文量
2058
审稿时长
5.2 months
期刊介绍: The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following: -Sensor Phenomenology, Modelling, and Evaluation -Sensor Materials, Processing, and Fabrication -Chemical and Gas Sensors -Microfluidics and Biosensors -Optical Sensors -Physical Sensors: Temperature, Mechanical, Magnetic, and others -Acoustic and Ultrasonic Sensors -Sensor Packaging -Sensor Networks -Sensor Applications -Sensor Systems: Signals, Processing, and Interfaces -Actuators and Sensor Power Systems -Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting -Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data) -Sensors in Industrial Practice
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信