{"title":"RAM: Interpreting real-world image super-resolution in the industry environment","authors":"Ze-Yu Mi, Yu-Bin Yang","doi":"10.1016/j.patrec.2025.03.034","DOIUrl":null,"url":null,"abstract":"<div><div>Industrial image super-resolution (SR) plays a crucial role in various industrial applications by generating high-resolution images that enhance image quality, clarity, and texture. The interpretability of industrial SR models is becoming increasingly important, enabling designers and quality inspectors to perform detailed image analysis and make more informed decisions. However, existing interpretability methods struggle to adapt to the complex degradation and diverse image patterns in industrial SR, making it challenging to provide reliable and accurate interpretations. To address this challenge, we propose a novel approach, Real Attribution Maps (RAM), designed for precise interpretation of industrial SR. RAM introduces two key components: the multi-path downsampling (MPD) function and the multi-progressive degradation (MPG) function. The MPD generates multiple attribution paths by applying a range of downsampling strategies, while the MPG incorporates random degradation kernels to better simulate real-world conditions, ensuring more accurate feature attribution. The final attribution map is derived by averaging the results from all paths. Extensive experiments conducted on industrial datasets, including IndSR, Wafer Maps, and Pelvis, validate the effectiveness of RAM. Our results show substantial improvements in several interpretation evaluation metrics and enhanced visual explanations that eliminate irrelevant interference. This work provides a powerful and versatile tool for explaining industrial SR models, offering significant advances in the interpretability of complex industrial images.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"192 ","pages":"Pages 86-92"},"PeriodicalIF":3.9000,"publicationDate":"2025-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525001266","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
引用次数: 0
Abstract
Industrial image super-resolution (SR) plays a crucial role in various industrial applications by generating high-resolution images that enhance image quality, clarity, and texture. The interpretability of industrial SR models is becoming increasingly important, enabling designers and quality inspectors to perform detailed image analysis and make more informed decisions. However, existing interpretability methods struggle to adapt to the complex degradation and diverse image patterns in industrial SR, making it challenging to provide reliable and accurate interpretations. To address this challenge, we propose a novel approach, Real Attribution Maps (RAM), designed for precise interpretation of industrial SR. RAM introduces two key components: the multi-path downsampling (MPD) function and the multi-progressive degradation (MPG) function. The MPD generates multiple attribution paths by applying a range of downsampling strategies, while the MPG incorporates random degradation kernels to better simulate real-world conditions, ensuring more accurate feature attribution. The final attribution map is derived by averaging the results from all paths. Extensive experiments conducted on industrial datasets, including IndSR, Wafer Maps, and Pelvis, validate the effectiveness of RAM. Our results show substantial improvements in several interpretation evaluation metrics and enhanced visual explanations that eliminate irrelevant interference. This work provides a powerful and versatile tool for explaining industrial SR models, offering significant advances in the interpretability of complex industrial images.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.