Minjie Du , Siqi Gu , Zihan Qin , Lizhe Xie , Zheng Wang , Yining Hu
{"title":"一种基于图像重构和异常检测的通用无缺陷数据缺陷检测方法","authors":"Minjie Du , Siqi Gu , Zihan Qin , Lizhe Xie , Zheng Wang , Yining Hu","doi":"10.1016/j.neunet.2025.107662","DOIUrl":null,"url":null,"abstract":"<div><div>This paper presents a novel framework based on hierarchical image reconstruction, employing image reconstruction and anomaly detection techniques. Unlike traditional supervised methods, our approach operates without the need for defect-specific training data, enabling generalization across diverse product types. Using hierarchical reconstruction modules and a self-attention mechanism, our method achieves an average precision of 97.83% on the MVTec AD 2D dataset, surpassing the U-Net model by 11.1% and the U-Transformer by 12.9%. Furthermore, the model inference speed reaches 24.1 FPS, representing a 48.1% increase over U-Transformer models. These results demonstrate the framework’s effectiveness in enhancing both detection accuracy and speed, providing a robust solution for real-time industrial defect inspection.</div></div>","PeriodicalId":49763,"journal":{"name":"Neural Networks","volume":"190 ","pages":"Article 107662"},"PeriodicalIF":6.0000,"publicationDate":"2025-06-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A generalized defect-data-free defect inspection method based on image reconstruction and anomaly detection\",\"authors\":\"Minjie Du , Siqi Gu , Zihan Qin , Lizhe Xie , Zheng Wang , Yining Hu\",\"doi\":\"10.1016/j.neunet.2025.107662\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>This paper presents a novel framework based on hierarchical image reconstruction, employing image reconstruction and anomaly detection techniques. Unlike traditional supervised methods, our approach operates without the need for defect-specific training data, enabling generalization across diverse product types. Using hierarchical reconstruction modules and a self-attention mechanism, our method achieves an average precision of 97.83% on the MVTec AD 2D dataset, surpassing the U-Net model by 11.1% and the U-Transformer by 12.9%. Furthermore, the model inference speed reaches 24.1 FPS, representing a 48.1% increase over U-Transformer models. These results demonstrate the framework’s effectiveness in enhancing both detection accuracy and speed, providing a robust solution for real-time industrial defect inspection.</div></div>\",\"PeriodicalId\":49763,\"journal\":{\"name\":\"Neural Networks\",\"volume\":\"190 \",\"pages\":\"Article 107662\"},\"PeriodicalIF\":6.0000,\"publicationDate\":\"2025-06-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Neural Networks\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0893608025005428\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Networks","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0893608025005428","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
摘要
本文提出了一种基于分层图像重构的新框架,该框架采用了图像重构和异常检测技术。与传统的监督方法不同,我们的方法不需要缺陷特定的训练数据,可以跨不同的产品类型进行泛化。利用分层重构模块和自关注机制,该方法在MVTec AD 2D数据集上的平均精度达到97.83%,比U-Net模型高11.1%,比U-Transformer模型高12.9%。模型推理速度达到24.1 FPS,比U-Transformer模型提高48.1%。这些结果证明了该框架在提高检测精度和速度方面的有效性,为实时工业缺陷检测提供了一个强大的解决方案。
A generalized defect-data-free defect inspection method based on image reconstruction and anomaly detection
This paper presents a novel framework based on hierarchical image reconstruction, employing image reconstruction and anomaly detection techniques. Unlike traditional supervised methods, our approach operates without the need for defect-specific training data, enabling generalization across diverse product types. Using hierarchical reconstruction modules and a self-attention mechanism, our method achieves an average precision of 97.83% on the MVTec AD 2D dataset, surpassing the U-Net model by 11.1% and the U-Transformer by 12.9%. Furthermore, the model inference speed reaches 24.1 FPS, representing a 48.1% increase over U-Transformer models. These results demonstrate the framework’s effectiveness in enhancing both detection accuracy and speed, providing a robust solution for real-time industrial defect inspection.
期刊介绍:
Neural Networks is a platform that aims to foster an international community of scholars and practitioners interested in neural networks, deep learning, and other approaches to artificial intelligence and machine learning. Our journal invites submissions covering various aspects of neural networks research, from computational neuroscience and cognitive modeling to mathematical analyses and engineering applications. By providing a forum for interdisciplinary discussions between biology and technology, we aim to encourage the development of biologically-inspired artificial intelligence.