基于双特征的统一变压器模型图像异常检测

IF 10.4 1区计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS

Journal of Industrial Information Integration Pub Date : 2025-06-23 DOI:10.1016/j.jii.2025.100892

Yuanbo Wang, Junfeng Jing, Xin Zhang

{"title":"基于双特征的统一变压器模型图像异常检测","authors":"Yuanbo Wang, Junfeng Jing, Xin Zhang","doi":"10.1016/j.jii.2025.100892","DOIUrl":null,"url":null,"abstract":"<div><div>Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: <span><span>https://github.com/wyanb/DFGR</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55975,"journal":{"name":"Journal of Industrial Information Integration","volume":"47 ","pages":"Article 100892"},"PeriodicalIF":10.4000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image anomaly detection with a unified transformer model guided by dual-feature\",\"authors\":\"Yuanbo Wang, Junfeng Jing, Xin Zhang\",\"doi\":\"10.1016/j.jii.2025.100892\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: <span><span>https://github.com/wyanb/DFGR</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55975,\"journal\":{\"name\":\"Journal of Industrial Information Integration\",\"volume\":\"47 \",\"pages\":\"Article 100892\"},\"PeriodicalIF\":10.4000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Industrial Information Integration\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2452414X25001153\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Industrial Information Integration","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452414X25001153","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}

引用次数: 0

摘要

基于重构的框架对无监督工业图像异常检测进行了广泛的研究。虽然展示了有希望的性能基准，但现有的重构网络经常退化为身份映射行为，它们容易直接将输入复制为输出，并且无法真正学习数据的深层结构信息和统计分布，通常需要为单个对象类别（一类模型）训练不同的模型。本文提出了一种基于双特征引导重构网络（DFGR）的多类异常检测统一模型。其中一种是对底层特征的强化，实现了模型对重要正常特征重构的引导功能，显著降低了对先验知识的依赖；另一种是多层融合特征，提供了丰富的图像语义特征。我们利用主结构特征来指导重构，实现了主结构特征的全局信息与重构特征图的局部信息的交互。该方法较好地平衡了低层空间结构信息对整个重建过程的贡献，并有效地降低了重建网络对小背景噪声的剧烈响应。在MVTec数据集上的实验结果表明，接收机工作特征（AUROC）下的图像级面积为98.0%，像素级AUROC为97.1%，在DAGM2007、Rollei和红蓝数据集上的进一步验证证实了双特征结构的可行性。代码：https://github.com/wyanb/DFGR。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Image anomaly detection with a unified transformer model guided by dual-feature

查看原文本刊更多论文

Image anomaly detection with a unified transformer model guided by dual-feature

Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: https://github.com/wyanb/DFGR.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Journal of Industrial Information Integration Decision Sciences-Information Systems and Management

CiteScore

22.30

自引率

13.40%

发文量

100

期刊介绍： The Journal of Industrial Information Integration focuses on the industry's transition towards industrial integration and informatization, covering not only hardware and software but also information integration. It serves as a platform for promoting advances in industrial information integration, addressing challenges, issues, and solutions in an interdisciplinary forum for researchers, practitioners, and policy makers. The Journal of Industrial Information Integration welcomes papers on foundational, technical, and practical aspects of industrial information integration, emphasizing the complex and cross-disciplinary topics that arise in industrial integration. Techniques from mathematical science, computer science, computer engineering, electrical and electronic engineering, manufacturing engineering, and engineering management are crucial in this context.