An Auto-adjusting Weight Model for Imbalanced Wafer Defects Recognition

2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD) Pub Date : 2022-11-30 DOI:10.1109/ICSMD57530.2022.10058248

Yu Chen, Xinjia Zhao, Meng Zhao, Meng Zhao, J. Ji

{"title":"An Auto-adjusting Weight Model for Imbalanced Wafer Defects Recognition","authors":"Yu Chen, Xinjia Zhao, Meng Zhao, Meng Zhao, J. Ji","doi":"10.1109/ICSMD57530.2022.10058248","DOIUrl":null,"url":null,"abstract":"Wafer defect recognition is popular research in the semiconductor industry. Generally, each defect pattern is related to a specific manufacturing problem. By identifying defect patterns correctly, manufacturing problems can be recognized and fixed in time, which improves the quality and production yield of wafers. However, due to the location, light and the increasing number of wafers, traditional recognition methods achieve unsatisfactory performance. Currently, convolutional neural network (CNN) based methods outperform traditional methods in accuracy and speed, but fail when training with imbalanced target classes. To address the imbalanced problem, a CNN-based knowledge distillation (KD) method is proposed. To improve the identification of different types of defects, a multi-head attention layer is applied to the proposed CNN model, which enriches local and global information of features. Besides, when training the CNN model, target features are constrained with Distillation Loss and Focal Loss, reducing the effect of dataset imbalance. Experiments on the public dataset WM-811K are conducted to verify the proposed methods, and experimental results showed that the accuracy, precision, recall, specificity, and F1 score of our method reached 97.7%, 96.9%, 97.2%, 99.7% and 97.0% respectively, and the classification accuracy of each class was above 93.0%, which indicates the proposed method was reasonable and effective on large-scale imbalanced wafer defect datasets.","PeriodicalId":396735,"journal":{"name":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICSMD57530.2022.10058248","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

Wafer defect recognition is popular research in the semiconductor industry. Generally, each defect pattern is related to a specific manufacturing problem. By identifying defect patterns correctly, manufacturing problems can be recognized and fixed in time, which improves the quality and production yield of wafers. However, due to the location, light and the increasing number of wafers, traditional recognition methods achieve unsatisfactory performance. Currently, convolutional neural network (CNN) based methods outperform traditional methods in accuracy and speed, but fail when training with imbalanced target classes. To address the imbalanced problem, a CNN-based knowledge distillation (KD) method is proposed. To improve the identification of different types of defects, a multi-head attention layer is applied to the proposed CNN model, which enriches local and global information of features. Besides, when training the CNN model, target features are constrained with Distillation Loss and Focal Loss, reducing the effect of dataset imbalance. Experiments on the public dataset WM-811K are conducted to verify the proposed methods, and experimental results showed that the accuracy, precision, recall, specificity, and F1 score of our method reached 97.7%, 96.9%, 97.2%, 99.7% and 97.0% respectively, and the classification accuracy of each class was above 93.0%, which indicates the proposed method was reasonable and effective on large-scale imbalanced wafer defect datasets.

查看原文本刊更多论文

一种用于不平衡晶圆缺陷识别的自动调整权重模型

晶圆缺陷识别是半导体行业研究的热点。一般来说，每个缺陷模式都与一个特定的制造问题相关。通过正确识别缺陷模式，可以及时发现和解决制造问题，从而提高晶圆的质量和产量。然而，由于位置、光线和晶圆数量的增加，传统的识别方法的性能并不理想。目前，基于卷积神经网络(CNN)的训练方法在准确率和速度上都优于传统方法，但在目标类不平衡的情况下训练失败。为了解决不平衡问题，提出了一种基于cnn的知识蒸馏(KD)方法。为了提高对不同类型缺陷的识别能力，本文提出的CNN模型采用了多头关注层，丰富了特征的局部和全局信息。此外，在训练CNN模型时，目标特征受到蒸馏损失和焦点损失的约束，减少了数据集不平衡的影响。在公共数据集WM-811K上进行了实验验证，实验结果表明，本文方法的准确率、精密度、召回率、特异性和F1评分分别达到97.7%、96.9%、97.2%、99.7%和97.0%，各类别的分类准确率均在93.0%以上，表明本文方法在大规模不平衡晶圆缺陷数据集上是合理有效的。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 International Conference on Sensing, Measurement & Data Analytics in the era of Artificial Intelligence (ICSMD)

自引率

0.00%

发文量