{"title":"U-Flow:利用无监督阈值进行异常检测的 U 型归一化流程","authors":"Matías Tailanian, Álvaro Pardo, Pablo Musé","doi":"10.1007/s10851-024-01193-y","DOIUrl":null,"url":null,"abstract":"<p>In this work, we propose a one-class self-supervised method for anomaly segmentation in images that benefits from both a modern machine learning approach and a more classic statistical detection theory. The method consists of four phases. First, features are extracted using a multi-scale image transformer architecture. Then, these features are fed into a U-shaped normalizing flow (NF) that lays the theoretical foundations for the subsequent phases. The third phase computes a pixel-level anomaly map from the NF embedding, and the last phase performs a segmentation based on the <i>a contrario</i> framework. This multiple hypothesis testing strategy permits the derivation of robust unsupervised detection thresholds, which are crucial in real-world applications where an operational point is needed. The segmentation results are evaluated using the mean intersection over union metric, and for assessing the generated anomaly maps we report the area under the receiver operating characteristic curve (<i>AUROC</i>), as well as the area under the per-region-overlap curve (<i>AUPRO</i>). Extensive experimentation in various datasets shows that the proposed approach produces state-of-the-art results for all metrics and all datasets, ranking first in most MVTec-AD categories, with a mean pixel-level <i>AUROC</i> of 98.74%. Code and trained models are available at https://github.com/mtailanian/uflow.</p>","PeriodicalId":16196,"journal":{"name":"Journal of Mathematical Imaging and Vision","volume":"55 1","pages":""},"PeriodicalIF":1.3000,"publicationDate":"2024-05-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"U-Flow: A U-Shaped Normalizing Flow for Anomaly Detection with Unsupervised Threshold\",\"authors\":\"Matías Tailanian, Álvaro Pardo, Pablo Musé\",\"doi\":\"10.1007/s10851-024-01193-y\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In this work, we propose a one-class self-supervised method for anomaly segmentation in images that benefits from both a modern machine learning approach and a more classic statistical detection theory. The method consists of four phases. First, features are extracted using a multi-scale image transformer architecture. Then, these features are fed into a U-shaped normalizing flow (NF) that lays the theoretical foundations for the subsequent phases. The third phase computes a pixel-level anomaly map from the NF embedding, and the last phase performs a segmentation based on the <i>a contrario</i> framework. This multiple hypothesis testing strategy permits the derivation of robust unsupervised detection thresholds, which are crucial in real-world applications where an operational point is needed. The segmentation results are evaluated using the mean intersection over union metric, and for assessing the generated anomaly maps we report the area under the receiver operating characteristic curve (<i>AUROC</i>), as well as the area under the per-region-overlap curve (<i>AUPRO</i>). Extensive experimentation in various datasets shows that the proposed approach produces state-of-the-art results for all metrics and all datasets, ranking first in most MVTec-AD categories, with a mean pixel-level <i>AUROC</i> of 98.74%. Code and trained models are available at https://github.com/mtailanian/uflow.</p>\",\"PeriodicalId\":16196,\"journal\":{\"name\":\"Journal of Mathematical Imaging and Vision\",\"volume\":\"55 1\",\"pages\":\"\"},\"PeriodicalIF\":1.3000,\"publicationDate\":\"2024-05-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Mathematical Imaging and Vision\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1007/s10851-024-01193-y\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Mathematical Imaging and Vision","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1007/s10851-024-01193-y","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
U-Flow: A U-Shaped Normalizing Flow for Anomaly Detection with Unsupervised Threshold
In this work, we propose a one-class self-supervised method for anomaly segmentation in images that benefits from both a modern machine learning approach and a more classic statistical detection theory. The method consists of four phases. First, features are extracted using a multi-scale image transformer architecture. Then, these features are fed into a U-shaped normalizing flow (NF) that lays the theoretical foundations for the subsequent phases. The third phase computes a pixel-level anomaly map from the NF embedding, and the last phase performs a segmentation based on the a contrario framework. This multiple hypothesis testing strategy permits the derivation of robust unsupervised detection thresholds, which are crucial in real-world applications where an operational point is needed. The segmentation results are evaluated using the mean intersection over union metric, and for assessing the generated anomaly maps we report the area under the receiver operating characteristic curve (AUROC), as well as the area under the per-region-overlap curve (AUPRO). Extensive experimentation in various datasets shows that the proposed approach produces state-of-the-art results for all metrics and all datasets, ranking first in most MVTec-AD categories, with a mean pixel-level AUROC of 98.74%. Code and trained models are available at https://github.com/mtailanian/uflow.
期刊介绍:
The Journal of Mathematical Imaging and Vision is a technical journal publishing important new developments in mathematical imaging. The journal publishes research articles, invited papers, and expository articles.
Current developments in new image processing hardware, the advent of multisensor data fusion, and rapid advances in vision research have led to an explosive growth in the interdisciplinary field of imaging science. This growth has resulted in the development of highly sophisticated mathematical models and theories. The journal emphasizes the role of mathematics as a rigorous basis for imaging science. This provides a sound alternative to present journals in this area. Contributions are judged on the basis of mathematical content. Articles may be physically speculative but need to be mathematically sound. Emphasis is placed on innovative or established mathematical techniques applied to vision and imaging problems in a novel way, as well as new developments and problems in mathematics arising from these applications.
The scope of the journal includes:
computational models of vision; imaging algebra and mathematical morphology
mathematical methods in reconstruction, compactification, and coding
filter theory
probabilistic, statistical, geometric, topological, and fractal techniques and models in imaging science
inverse optics
wave theory.
Specific application areas of interest include, but are not limited to:
all aspects of image formation and representation
medical, biological, industrial, geophysical, astronomical and military imaging
image analysis and image understanding
parallel and distributed computing
computer vision architecture design.