工业缺陷图像生成的结构化引导扩散模型

IF 7.6 1区 计算机科学 Q1 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE
Yulai Xie , Xiaoning Pi , Yang Zhang , Fang Ren
{"title":"工业缺陷图像生成的结构化引导扩散模型","authors":"Yulai Xie ,&nbsp;Xiaoning Pi ,&nbsp;Yang Zhang ,&nbsp;Fang Ren","doi":"10.1016/j.knosys.2025.114642","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial defect images exhibit distinct characteristics from natural images, including severe class imbalance and structured similarity and diversity. Current defect image generation methods often lack fine-grained control over defect elements and suffer from limited diversity. This paper presents the Structured Guided Diffusion Model (Structured-GDM) for generating high-quality defect images with independent control over three structured elements: normal backgrounds, defect classes, and defect shapes. Controllability enables the generation of high-diversity defect images by preserving normal background outlines with detailed variation, specifying defect classes and shapes, and guiding the generation of reasonable (single or combined) defects using prior or expert knowledge. The structured architecture separates the training and use of elemental diffusion, classification, and segmentation models in a building-block manner, offering improved flexibility and maintainability. Additionally, a multiple-class training scheme is proposed to train overall models for one-for-all multiple-class defect generation, which exploits the inter-class similarity of defects and simplifies implementation. Extensive experiments on multiple MVTec and NEU-DET demonstrate that the method achieves superior performance in both image quality metrics and down-stream tasks, while maintaining high diversity and structured controllability.</div></div>","PeriodicalId":49939,"journal":{"name":"Knowledge-Based Systems","volume":"330 ","pages":"Article 114642"},"PeriodicalIF":7.6000,"publicationDate":"2025-10-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Structured guided diffusion models for industrial defect image generation\",\"authors\":\"Yulai Xie ,&nbsp;Xiaoning Pi ,&nbsp;Yang Zhang ,&nbsp;Fang Ren\",\"doi\":\"10.1016/j.knosys.2025.114642\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Industrial defect images exhibit distinct characteristics from natural images, including severe class imbalance and structured similarity and diversity. Current defect image generation methods often lack fine-grained control over defect elements and suffer from limited diversity. This paper presents the Structured Guided Diffusion Model (Structured-GDM) for generating high-quality defect images with independent control over three structured elements: normal backgrounds, defect classes, and defect shapes. Controllability enables the generation of high-diversity defect images by preserving normal background outlines with detailed variation, specifying defect classes and shapes, and guiding the generation of reasonable (single or combined) defects using prior or expert knowledge. The structured architecture separates the training and use of elemental diffusion, classification, and segmentation models in a building-block manner, offering improved flexibility and maintainability. Additionally, a multiple-class training scheme is proposed to train overall models for one-for-all multiple-class defect generation, which exploits the inter-class similarity of defects and simplifies implementation. Extensive experiments on multiple MVTec and NEU-DET demonstrate that the method achieves superior performance in both image quality metrics and down-stream tasks, while maintaining high diversity and structured controllability.</div></div>\",\"PeriodicalId\":49939,\"journal\":{\"name\":\"Knowledge-Based Systems\",\"volume\":\"330 \",\"pages\":\"Article 114642\"},\"PeriodicalIF\":7.6000,\"publicationDate\":\"2025-10-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Knowledge-Based Systems\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0950705125016818\",\"RegionNum\":1,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Knowledge-Based Systems","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0950705125016818","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0

摘要

工业缺陷图像表现出与自然图像截然不同的特征,包括严重的阶级不平衡和结构的相似性和多样性。目前的缺陷图像生成方法往往缺乏对缺陷元素的细粒度控制,且多样性有限。本文提出了结构化引导扩散模型(Structured- gdm),用于生成高质量的缺陷图像,并对三个结构化元素:正常背景、缺陷类别和缺陷形状进行独立控制。可控性通过保留具有详细变化的正常背景轮廓,指定缺陷类别和形状,并使用先前或专家知识指导合理(单个或组合)缺陷的生成,从而能够生成高度多样化的缺陷图像。结构化的体系结构以构建块的方式分离了基本扩散、分类和分割模型的训练和使用,提供了改进的灵活性和可维护性。此外,提出了一种多类训练方案,利用缺陷的类间相似性,简化了实现过程,对多类缺陷生成的整体模型进行了训练。在多个MVTec和NEU-DET上的大量实验表明,该方法在图像质量指标和下游任务方面都取得了优异的性能,同时保持了高度的多样性和结构可控性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Structured guided diffusion models for industrial defect image generation
Industrial defect images exhibit distinct characteristics from natural images, including severe class imbalance and structured similarity and diversity. Current defect image generation methods often lack fine-grained control over defect elements and suffer from limited diversity. This paper presents the Structured Guided Diffusion Model (Structured-GDM) for generating high-quality defect images with independent control over three structured elements: normal backgrounds, defect classes, and defect shapes. Controllability enables the generation of high-diversity defect images by preserving normal background outlines with detailed variation, specifying defect classes and shapes, and guiding the generation of reasonable (single or combined) defects using prior or expert knowledge. The structured architecture separates the training and use of elemental diffusion, classification, and segmentation models in a building-block manner, offering improved flexibility and maintainability. Additionally, a multiple-class training scheme is proposed to train overall models for one-for-all multiple-class defect generation, which exploits the inter-class similarity of defects and simplifies implementation. Extensive experiments on multiple MVTec and NEU-DET demonstrate that the method achieves superior performance in both image quality metrics and down-stream tasks, while maintaining high diversity and structured controllability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Knowledge-Based Systems
Knowledge-Based Systems 工程技术-计算机:人工智能
CiteScore
14.80
自引率
12.50%
发文量
1245
审稿时长
7.8 months
期刊介绍: Knowledge-Based Systems, an international and interdisciplinary journal in artificial intelligence, publishes original, innovative, and creative research results in the field. It focuses on knowledge-based and other artificial intelligence techniques-based systems. The journal aims to support human prediction and decision-making through data science and computation techniques, provide a balanced coverage of theory and practical study, and encourage the development and implementation of knowledge-based intelligence models, methods, systems, and software tools. Applications in business, government, education, engineering, and healthcare are emphasized.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信