Mengjie Tang , Jie Tu , Panyu Zhou , Kelvin K.L. Wong
{"title":"基于非对称师生网络的半导体冷却器件异常晶粒检测工业视觉模型","authors":"Mengjie Tang , Jie Tu , Panyu Zhou , Kelvin K.L. Wong","doi":"10.1016/j.patrec.2025.08.021","DOIUrl":null,"url":null,"abstract":"<div><div>Unsupervised detection of micro-defects in thermoelectric cooler (TEC) grains faces significant challenges due to subtle, low-contrast anomalies and the high cost of manual annotation. In this work, we propose ATS-Net, an asymmetric teacher-student network built upon a shared ResNet backbone, designed as a lightweight, deployment-ready model that generalizes effectively across datasets without per-dataset tuning. Our key contributions include exponential moving-average normalization to stabilize feature statistics, a single-layer Real-NVP coupling mechanism to amplify teacher-student discrepancies at anomaly regions, and a dual-scale contextual transformer block facilitating joint local and global attention. ATS-Net is trained exclusively on defect-free samples in a two-stage process, and evaluated using image-level AUROC, pixel-level PRO at 95 % recall, and mean intersection-over-union (mIoU) metrics on the proprietary TEC-Grain dataset and the public MVTec-AD benchmark. Experimental results demonstrate that ATS-Net achieves superior performance, reaching 99.2 % AUROC, 99.1 % PRO, and 0.989 mIoU on TEC-Grain, and 98.8 % AUROC, 98.6 % PRO, and 0.97 mIoU on MVTec-AD. The model operates efficiently at 3.8 GFLOPs with 19.36 MB parameters and a speed of 96 FPS on an RTX 3090 GPU. Ablation studies show the introduced modules collectively enhance AUROC by 9.5 percentage points over the MKD baseline, with only a minor increase in parameters. ATS-Net thus effectively balances detection accuracy, interpretability, and speed, making it suitable for real-time defect inspection in semiconductor cooling device production. Future research will focus on integrating multi-modal fusion and self-supervised pretraining strategies to further minimize annotation requirements.</div></div>","PeriodicalId":54638,"journal":{"name":"Pattern Recognition Letters","volume":"197 ","pages":"Pages 288-296"},"PeriodicalIF":3.3000,"publicationDate":"2025-08-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An asymmetric teacher-student network based industrial vision model for abnormal grain detection of semiconductor cooling devices\",\"authors\":\"Mengjie Tang , Jie Tu , Panyu Zhou , Kelvin K.L. Wong\",\"doi\":\"10.1016/j.patrec.2025.08.021\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>Unsupervised detection of micro-defects in thermoelectric cooler (TEC) grains faces significant challenges due to subtle, low-contrast anomalies and the high cost of manual annotation. In this work, we propose ATS-Net, an asymmetric teacher-student network built upon a shared ResNet backbone, designed as a lightweight, deployment-ready model that generalizes effectively across datasets without per-dataset tuning. Our key contributions include exponential moving-average normalization to stabilize feature statistics, a single-layer Real-NVP coupling mechanism to amplify teacher-student discrepancies at anomaly regions, and a dual-scale contextual transformer block facilitating joint local and global attention. ATS-Net is trained exclusively on defect-free samples in a two-stage process, and evaluated using image-level AUROC, pixel-level PRO at 95 % recall, and mean intersection-over-union (mIoU) metrics on the proprietary TEC-Grain dataset and the public MVTec-AD benchmark. Experimental results demonstrate that ATS-Net achieves superior performance, reaching 99.2 % AUROC, 99.1 % PRO, and 0.989 mIoU on TEC-Grain, and 98.8 % AUROC, 98.6 % PRO, and 0.97 mIoU on MVTec-AD. The model operates efficiently at 3.8 GFLOPs with 19.36 MB parameters and a speed of 96 FPS on an RTX 3090 GPU. Ablation studies show the introduced modules collectively enhance AUROC by 9.5 percentage points over the MKD baseline, with only a minor increase in parameters. ATS-Net thus effectively balances detection accuracy, interpretability, and speed, making it suitable for real-time defect inspection in semiconductor cooling device production. Future research will focus on integrating multi-modal fusion and self-supervised pretraining strategies to further minimize annotation requirements.</div></div>\",\"PeriodicalId\":54638,\"journal\":{\"name\":\"Pattern Recognition Letters\",\"volume\":\"197 \",\"pages\":\"Pages 288-296\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-08-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Pattern Recognition Letters\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0167865525003009\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pattern Recognition Letters","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0167865525003009","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
An asymmetric teacher-student network based industrial vision model for abnormal grain detection of semiconductor cooling devices
Unsupervised detection of micro-defects in thermoelectric cooler (TEC) grains faces significant challenges due to subtle, low-contrast anomalies and the high cost of manual annotation. In this work, we propose ATS-Net, an asymmetric teacher-student network built upon a shared ResNet backbone, designed as a lightweight, deployment-ready model that generalizes effectively across datasets without per-dataset tuning. Our key contributions include exponential moving-average normalization to stabilize feature statistics, a single-layer Real-NVP coupling mechanism to amplify teacher-student discrepancies at anomaly regions, and a dual-scale contextual transformer block facilitating joint local and global attention. ATS-Net is trained exclusively on defect-free samples in a two-stage process, and evaluated using image-level AUROC, pixel-level PRO at 95 % recall, and mean intersection-over-union (mIoU) metrics on the proprietary TEC-Grain dataset and the public MVTec-AD benchmark. Experimental results demonstrate that ATS-Net achieves superior performance, reaching 99.2 % AUROC, 99.1 % PRO, and 0.989 mIoU on TEC-Grain, and 98.8 % AUROC, 98.6 % PRO, and 0.97 mIoU on MVTec-AD. The model operates efficiently at 3.8 GFLOPs with 19.36 MB parameters and a speed of 96 FPS on an RTX 3090 GPU. Ablation studies show the introduced modules collectively enhance AUROC by 9.5 percentage points over the MKD baseline, with only a minor increase in parameters. ATS-Net thus effectively balances detection accuracy, interpretability, and speed, making it suitable for real-time defect inspection in semiconductor cooling device production. Future research will focus on integrating multi-modal fusion and self-supervised pretraining strategies to further minimize annotation requirements.
期刊介绍:
Pattern Recognition Letters aims at rapid publication of concise articles of a broad interest in pattern recognition.
Subject areas include all the current fields of interest represented by the Technical Committees of the International Association of Pattern Recognition, and other developing themes involving learning and recognition.