{"title":"铸造表面数据集和基准,用于在复杂环境中检测细微和易混淆缺陷","authors":"Qishan Wang;Shuyong Gao;Li Xiong;Aili Liang;Kaidong Jiang;Wenqiang Zhang","doi":"10.1109/JSEN.2024.3387082","DOIUrl":null,"url":null,"abstract":"Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at \n<uri>https://github.com/18894269590/RSA</uri>\n.","PeriodicalId":447,"journal":{"name":"IEEE Sensors Journal","volume":"24 10","pages":"16721-16733"},"PeriodicalIF":4.3000,"publicationDate":"2024-04-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts\",\"authors\":\"Qishan Wang;Shuyong Gao;Li Xiong;Aili Liang;Kaidong Jiang;Wenqiang Zhang\",\"doi\":\"10.1109/JSEN.2024.3387082\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at \\n<uri>https://github.com/18894269590/RSA</uri>\\n.\",\"PeriodicalId\":447,\"journal\":{\"name\":\"IEEE Sensors Journal\",\"volume\":\"24 10\",\"pages\":\"16721-16733\"},\"PeriodicalIF\":4.3000,\"publicationDate\":\"2024-04-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Sensors Journal\",\"FirstCategoryId\":\"103\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10502267/\",\"RegionNum\":2,\"RegionCategory\":\"综合性期刊\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Sensors Journal","FirstCategoryId":"103","ListUrlMain":"https://ieeexplore.ieee.org/document/10502267/","RegionNum":2,"RegionCategory":"综合性期刊","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
A Casting Surface Dataset and Benchmark for Subtle and Confusable Defect Detection in Complex Contexts
Industrial anomaly detection (IAD) algorithms are essential for implementing automated quality inspection. Dataset diversity serves as the foundation for developing comprehensive detection algorithms. Existing IAD datasets focus on the diversity of objects and defects, overlooking the diversity of domains within the real data. To bridge this gap, this study proposes the casting surface defect detection (CSDD) dataset, containing 12647 high-resolution gray images and pixel-precise ground truth (GT) labels for all defect samples. Compared to existing datasets, CSDD has the following two characteristics: 1) the target samples are unaligned and have complex and variable context information and 2) the defects in the CSDD dataset samples are subtle and confusable by factors such as oil contamination, processing features, and machining marks, illustrating the challenge of detecting real casting defects in an industrial context. Based on this dataset, we observe that current state-of-the-art (SOTA) IAD methods face challenges when there is considerable variation in sample context information. Furthermore, these methods encounter difficulties when abnormal samples are scarce, particularly those samples with subtle and confusable defects. To address this issue, we propose a novel method called realistic synthetic anomalies (RSAs), which enhances the model’s capacity to construct a normal sample distribution by generating a large number of RSAs. Experimental results demonstrate that the model trained to classify synthetic anomalies from normal samples achieves the highest accuracy for CSDD and significantly improves detection accuracy for subtle and confusable defects. The CSDD dataset and code of RSA are available at
https://github.com/18894269590/RSA
.
期刊介绍:
The fields of interest of the IEEE Sensors Journal are the theory, design , fabrication, manufacturing and applications of devices for sensing and transducing physical, chemical and biological phenomena, with emphasis on the electronics and physics aspect of sensors and integrated sensors-actuators. IEEE Sensors Journal deals with the following:
-Sensor Phenomenology, Modelling, and Evaluation
-Sensor Materials, Processing, and Fabrication
-Chemical and Gas Sensors
-Microfluidics and Biosensors
-Optical Sensors
-Physical Sensors: Temperature, Mechanical, Magnetic, and others
-Acoustic and Ultrasonic Sensors
-Sensor Packaging
-Sensor Networks
-Sensor Applications
-Sensor Systems: Signals, Processing, and Interfaces
-Actuators and Sensor Power Systems
-Sensor Signal Processing for high precision and stability (amplification, filtering, linearization, modulation/demodulation) and under harsh conditions (EMC, radiation, humidity, temperature); energy consumption/harvesting
-Sensor Data Processing (soft computing with sensor data, e.g., pattern recognition, machine learning, evolutionary computation; sensor data fusion, processing of wave e.g., electromagnetic and acoustic; and non-wave, e.g., chemical, gravity, particle, thermal, radiative and non-radiative sensor data, detection, estimation and classification based on sensor data)
-Sensors in Industrial Practice