{"title":"在统计异构数据上进行晶圆缺陷分类的聚类联合学习","authors":"Guang Yang;Zhijia Yang;Shuping Cui;Chunhe Song;Jizhou Wang;Haodong Wei","doi":"10.1109/TIM.2024.3415785","DOIUrl":null,"url":null,"abstract":"Data-driven deep learning techniques for wafer defect image classification provide wafer manufacturers with a tool to rapidly identify surface defects. However, the defect data and computational capabilities of a single wafer manufacturer are often insufficient to support the training of deep learning models. In response, we introduce federated learning (FL), a paradigm that leverages the data and computational capabilities of various wafer manufacturers, all while ensuring that the original data from different manufacturers remain unexposed to each other. Due to variations in manufacturing processes and image acquisition equipment, identical wafer defects can exhibit different features in different manufacturing settings, leading to statistically heterogeneous datasets. This heterogeneity can reduce model convergence speed and accuracy. To counteract this issue, we propose a personalized FL approach with clustering. In the personalization phase, we train distinct network layers for each client’s local model, capitalizing on the feature extraction capability of the global model’s shallow network, while also achieving commendable performance on each client’s unique dataset. During the clustering phase, we provide a theoretical analysis, demonstrating that the divergence of weights between two models is bounded above, laying a theoretical foundation for the clustering operation. We then enhance a density-based clustering method, enabling the clustering of clients with similar data features without the need to specify the number of cluster centers, thus mitigating the problem of global model oscillation. We have conducted experiments under various data heterogeneity scenarios. The experiments show that our method can achieve a 2.8% accuracy improvement average versus the compared state-of-the-art federated methods with a faster convergence rate.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":null,"pages":null},"PeriodicalIF":5.6000,"publicationDate":"2024-09-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Clustering Federated Learning for Wafer Defects Classification on Statistical Heterogeneous Data\",\"authors\":\"Guang Yang;Zhijia Yang;Shuping Cui;Chunhe Song;Jizhou Wang;Haodong Wei\",\"doi\":\"10.1109/TIM.2024.3415785\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data-driven deep learning techniques for wafer defect image classification provide wafer manufacturers with a tool to rapidly identify surface defects. However, the defect data and computational capabilities of a single wafer manufacturer are often insufficient to support the training of deep learning models. In response, we introduce federated learning (FL), a paradigm that leverages the data and computational capabilities of various wafer manufacturers, all while ensuring that the original data from different manufacturers remain unexposed to each other. Due to variations in manufacturing processes and image acquisition equipment, identical wafer defects can exhibit different features in different manufacturing settings, leading to statistically heterogeneous datasets. This heterogeneity can reduce model convergence speed and accuracy. To counteract this issue, we propose a personalized FL approach with clustering. In the personalization phase, we train distinct network layers for each client’s local model, capitalizing on the feature extraction capability of the global model’s shallow network, while also achieving commendable performance on each client’s unique dataset. During the clustering phase, we provide a theoretical analysis, demonstrating that the divergence of weights between two models is bounded above, laying a theoretical foundation for the clustering operation. We then enhance a density-based clustering method, enabling the clustering of clients with similar data features without the need to specify the number of cluster centers, thus mitigating the problem of global model oscillation. We have conducted experiments under various data heterogeneity scenarios. The experiments show that our method can achieve a 2.8% accuracy improvement average versus the compared state-of-the-art federated methods with a faster convergence rate.\",\"PeriodicalId\":13341,\"journal\":{\"name\":\"IEEE Transactions on Instrumentation and Measurement\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":5.6000,\"publicationDate\":\"2024-09-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Instrumentation and Measurement\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10666741/\",\"RegionNum\":2,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ENGINEERING, ELECTRICAL & ELECTRONIC\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10666741/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}
Clustering Federated Learning for Wafer Defects Classification on Statistical Heterogeneous Data
Data-driven deep learning techniques for wafer defect image classification provide wafer manufacturers with a tool to rapidly identify surface defects. However, the defect data and computational capabilities of a single wafer manufacturer are often insufficient to support the training of deep learning models. In response, we introduce federated learning (FL), a paradigm that leverages the data and computational capabilities of various wafer manufacturers, all while ensuring that the original data from different manufacturers remain unexposed to each other. Due to variations in manufacturing processes and image acquisition equipment, identical wafer defects can exhibit different features in different manufacturing settings, leading to statistically heterogeneous datasets. This heterogeneity can reduce model convergence speed and accuracy. To counteract this issue, we propose a personalized FL approach with clustering. In the personalization phase, we train distinct network layers for each client’s local model, capitalizing on the feature extraction capability of the global model’s shallow network, while also achieving commendable performance on each client’s unique dataset. During the clustering phase, we provide a theoretical analysis, demonstrating that the divergence of weights between two models is bounded above, laying a theoretical foundation for the clustering operation. We then enhance a density-based clustering method, enabling the clustering of clients with similar data features without the need to specify the number of cluster centers, thus mitigating the problem of global model oscillation. We have conducted experiments under various data heterogeneity scenarios. The experiments show that our method can achieve a 2.8% accuracy improvement average versus the compared state-of-the-art federated methods with a faster convergence rate.
期刊介绍:
Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.