Eternal-MAML: a meta-learning framework for cross-domain defect recognition.

IF 3.5 4区计算机科学 Q2 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

PeerJ Computer Science Pub Date : 2025-05-07 eCollection Date: 2025-01-01 DOI:10.7717/peerj-cs.2757

Jipeng Feng, Haigang Zhang, Zhifeng Wang

{"title":"Eternal-MAML: a meta-learning framework for cross-domain defect recognition.","authors":"Jipeng Feng, Haigang Zhang, Zhifeng Wang","doi":"10.7717/peerj-cs.2757","DOIUrl":null,"url":null,"abstract":"<p><p>Defect recognition tasks for industrial product suffer from a serious lack of samples, greatly limiting the generalizability of deep learning models. Addressing the imbalance of defective samples often involves leveraging pre-trained models for transfer learning. However, when these models, pre-trained on natural image datasets, are transferred to pixel-level defect recognition tasks, they frequently suffer from overfitting due to data scarcity. Furthermore, significant variations in the morphology, texture, and underlying causes of defects across different industrial products often lead to a degradation in performance, or even complete failure, when directly transferring a defect classification model trained on one type of product to another. The Model-Agnostic Meta-Learning (MAML) framework can learn a general representation of defects from multiple industrial defect recognition tasks and build a foundational model. Despite lacking sufficient training data, the MAML framework can still achieve effective knowledge transfer among cross-domain tasks. We noticed there exists serious label arrangement issues in MAML because of the random selection of recognition tasks, which seriously affects the performance of MAML model during both training and testing phase. This article proposes a novel MAML framework, termed as Eternal-MAML, which guides the update of the classifier module by learning a meta-vector that shares commonality across batch tasks in the inner loop, and addresses the overfitting phenomenon caused by label arrangement issues in testing phase for vanilla MAML. Additionally, the feature extractor in this framework combines the advantages of the Squeeze-and-Excitation module and Residual block to enhance training stability and improve the generalization accuracy of model transfer with the learned initialization parameters. In the simulation experiments, several datasets are applied to verified the cross-domain meta-learning performance of the proposed Eternal-MAML framework. The experimental results show that the proposed framework outperforms the state-of-the-art baselines in terms of average normalized accuracy. Finally, the ablation studies are conducted to examine how the primary components of the framework affect its overall performance. Code is available at https://github.com/zhg-SZPT/Eternal-MAML.</p>","PeriodicalId":54224,"journal":{"name":"PeerJ Computer Science","volume":"11 ","pages":"e2757"},"PeriodicalIF":3.5000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12190434/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"PeerJ Computer Science","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.7717/peerj-cs.2757","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 0

Abstract

Defect recognition tasks for industrial product suffer from a serious lack of samples, greatly limiting the generalizability of deep learning models. Addressing the imbalance of defective samples often involves leveraging pre-trained models for transfer learning. However, when these models, pre-trained on natural image datasets, are transferred to pixel-level defect recognition tasks, they frequently suffer from overfitting due to data scarcity. Furthermore, significant variations in the morphology, texture, and underlying causes of defects across different industrial products often lead to a degradation in performance, or even complete failure, when directly transferring a defect classification model trained on one type of product to another. The Model-Agnostic Meta-Learning (MAML) framework can learn a general representation of defects from multiple industrial defect recognition tasks and build a foundational model. Despite lacking sufficient training data, the MAML framework can still achieve effective knowledge transfer among cross-domain tasks. We noticed there exists serious label arrangement issues in MAML because of the random selection of recognition tasks, which seriously affects the performance of MAML model during both training and testing phase. This article proposes a novel MAML framework, termed as Eternal-MAML, which guides the update of the classifier module by learning a meta-vector that shares commonality across batch tasks in the inner loop, and addresses the overfitting phenomenon caused by label arrangement issues in testing phase for vanilla MAML. Additionally, the feature extractor in this framework combines the advantages of the Squeeze-and-Excitation module and Residual block to enhance training stability and improve the generalization accuracy of model transfer with the learned initialization parameters. In the simulation experiments, several datasets are applied to verified the cross-domain meta-learning performance of the proposed Eternal-MAML framework. The experimental results show that the proposed framework outperforms the state-of-the-art baselines in terms of average normalized accuracy. Finally, the ablation studies are conducted to examine how the primary components of the framework affect its overall performance. Code is available at https://github.com/zhg-SZPT/Eternal-MAML.

查看原文本刊更多论文

Eternal-MAML：用于跨领域缺陷识别的元学习框架。

工业产品缺陷识别任务严重缺乏样本，极大地限制了深度学习模型的泛化能力。解决不平衡的缺陷样本往往涉及利用预训练模型的迁移学习。然而，当这些在自然图像数据集上进行预训练的模型被转移到像素级缺陷识别任务时，由于数据稀缺性，它们经常遭受过拟合。此外，当直接将在一种类型的产品上训练的缺陷分类模型转移到另一种类型的产品时，不同工业产品中缺陷的形态学、质地和潜在原因的显著变化经常导致性能下降，甚至完全失败。模型不可知元学习（model - agnostic Meta-Learning， MAML）框架可以从多个工业缺陷识别任务中学习缺陷的一般表示，并构建基础模型。尽管缺乏足够的训练数据，MAML框架仍然可以在跨领域任务之间实现有效的知识转移。我们注意到，由于识别任务的随机选择，在MAML中存在严重的标签排列问题，严重影响了MAML模型在训练和测试阶段的性能。本文提出了一种新的MAML框架，称为Eternal-MAML，它通过学习一个元向量来指导分类器模块的更新，该元向量在内环中跨批处理任务共享共性，并解决了普通MAML测试阶段由标签排列问题引起的过拟合现象。此外，该框架中的特征提取器结合了Squeeze-and-Excitation模块和Residual block模块的优点，增强了训练的稳定性，提高了利用学习到的初始化参数进行模型迁移的泛化精度。在仿真实验中，应用多个数据集验证了所提出的Eternal-MAML框架的跨域元学习性能。实验结果表明，该框架在平均归一化精度方面优于最先进的基线。最后，进行了烧蚀研究，以检查框架的主要组成部分如何影响其整体性能。代码可从https://github.com/zhg-SZPT/Eternal-MAML获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

PeerJ Computer Science Computer Science-General Computer Science

CiteScore

6.10

自引率

5.30%

发文量

332

审稿时长

10 weeks

期刊介绍： PeerJ Computer Science is the new open access journal covering all subject areas in computer science, with the backing of a prestigious advisory board and more than 300 academic editors.