基于双特征的统一变压器模型图像异常检测

IF 10.4 1区 计算机科学 Q1 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Yuanbo Wang, Junfeng Jing, Xin Zhang
{"title":"基于双特征的统一变压器模型图像异常检测","authors":"Yuanbo Wang,&nbsp;Junfeng Jing,&nbsp;Xin Zhang","doi":"10.1016/j.jii.2025.100892","DOIUrl":null,"url":null,"abstract":"<div><div>Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: <span><span>https://github.com/wyanb/DFGR</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":55975,"journal":{"name":"Journal of Industrial Information Integration","volume":"47 ","pages":"Article 100892"},"PeriodicalIF":10.4000,"publicationDate":"2025-06-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Image anomaly detection with a unified transformer model guided by dual-feature\",\"authors\":\"Yuanbo Wang,&nbsp;Junfeng Jing,&nbsp;Xin Zhang\",\"doi\":\"10.1016/j.jii.2025.100892\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: <span><span>https://github.com/wyanb/DFGR</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":55975,\"journal\":{\"name\":\"Journal of Industrial Information Integration\",\"volume\":\"47 \",\"pages\":\"Article 100892\"},\"PeriodicalIF\":10.4000,\"publicationDate\":\"2025-06-23\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Industrial Information Integration\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2452414X25001153\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Industrial Information Integration","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2452414X25001153","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0

摘要

基于重构的框架对无监督工业图像异常检测进行了广泛的研究。虽然展示了有希望的性能基准,但现有的重构网络经常退化为身份映射行为,它们容易直接将输入复制为输出,并且无法真正学习数据的深层结构信息和统计分布,通常需要为单个对象类别(一类模型)训练不同的模型。本文提出了一种基于双特征引导重构网络(DFGR)的多类异常检测统一模型。其中一种是对底层特征的强化,实现了模型对重要正常特征重构的引导功能,显著降低了对先验知识的依赖;另一种是多层融合特征,提供了丰富的图像语义特征。我们利用主结构特征来指导重构,实现了主结构特征的全局信息与重构特征图的局部信息的交互。该方法较好地平衡了低层空间结构信息对整个重建过程的贡献,并有效地降低了重建网络对小背景噪声的剧烈响应。在MVTec数据集上的实验结果表明,接收机工作特征(AUROC)下的图像级面积为98.0%,像素级AUROC为97.1%,在DAGM2007、Rollei和红蓝数据集上的进一步验证证实了双特征结构的可行性。代码:https://github.com/wyanb/DFGR。
本文章由计算机程序翻译,如有差异,请以英文原文为准。

Image anomaly detection with a unified transformer model guided by dual-feature

Image anomaly detection with a unified transformer model guided by dual-feature
Unsupervised industrial image anomaly detection has been extensively investigated by reconstruction-based frameworks. While demonstrating promising performance benchmarks, existing reconstruction networks frequently degenerate into identity mapping behavior: they are prone to directly copying inputs as outputs, which fails to truly learn the deep structural information and statistical distribution of the data, and often require training distinct models for individual object categories (one-class-one-model). In this paper, we propose a powerful multi-class unified model based on the Dual-Feature Guided Reconstruction Network (DFGR) for multi-class anomaly detection. One of DFGR strengthens the low-level feature to realize the guidance function of the model to reconstruct important normal features and significantly reducing the reliance on prior knowledge, and the other multi-layer fusion feature provides rich semantic features of the image. We utilize the main structural features to guide the reconstruction, realizes the interaction between the global information of the main structural features and the local information of the reconstructed feature map. Our method better balances the contribution of the low-level spatial structure information to the overall reconstruction process, and also effectively reduces the sharp response of the reconstructed network to small background noises. Experimental results on the MVTec dataset demonstrate an image-level area under the receiver operating characteristic (AUROC) of 98.0% and a pixel-level AUROC of 97.1%, and further validations on the DAGM2007, Rollei, and red and blue datasets confirm the feasibility of the dual-feature structure. The code: https://github.com/wyanb/DFGR.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Journal of Industrial Information Integration
Journal of Industrial Information Integration Decision Sciences-Information Systems and Management
CiteScore
22.30
自引率
13.40%
发文量
100
期刊介绍: The Journal of Industrial Information Integration focuses on the industry's transition towards industrial integration and informatization, covering not only hardware and software but also information integration. It serves as a platform for promoting advances in industrial information integration, addressing challenges, issues, and solutions in an interdisciplinary forum for researchers, practitioners, and policy makers. The Journal of Industrial Information Integration welcomes papers on foundational, technical, and practical aspects of industrial information integration, emphasizing the complex and cross-disciplinary topics that arise in industrial integration. Techniques from mathematical science, computer science, computer engineering, electrical and electronic engineering, manufacturing engineering, and engineering management are crucial in this context.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信