Analysis of Trainability of Gradient-based Multi -environment Learning from Gradient Norm Regularization Perspective

2021 International Joint Conference on Neural Networks (IJCNN) Pub Date : 2021-07-18 DOI:10.1109/IJCNN52387.2021.9533904

S. Takagi, Yoshihiro Nagano, Yuki Yoshida, Masato Okada

{"title":"Analysis of Trainability of Gradient-based Multi -environment Learning from Gradient Norm Regularization Perspective","authors":"S. Takagi, Yoshihiro Nagano, Yuki Yoshida, Masato Okada","doi":"10.1109/IJCNN52387.2021.9533904","DOIUrl":null,"url":null,"abstract":"Adaptation and invariance to multiple environments are both crucial abilities for intelligent systems. Model-agnostic meta-learning (MAML) is a meta-learning algorithm to enable such adaptability, and invariant risk minimization (IRM) is a problem setting to achieve the invariant representation across multiple environments. We can formulate both methods as optimization problems with the environment-dependent constraint and this constraint is known to hamper optimization. Therefore, understanding the effect of the constraint on the optimization is important. In this paper, we provide a conceptual insight on how the constraint affects the optimization of MAML and IRM by analyzing the trainability of the gradient descent on the loss with the gradient norm penalty, which is easier to study but is related to both MAML and IRM. We conduct numerical experiments with practical datasets and architectures for MAML and IRM and validate that the analysis of the gradient norm penalty loss captures well the empirical relationship between the constraint and the trainability of MAML and IRM.","PeriodicalId":396583,"journal":{"name":"2021 International Joint Conference on Neural Networks (IJCNN)","volume":"142 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Joint Conference on Neural Networks (IJCNN)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IJCNN52387.2021.9533904","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Adaptation and invariance to multiple environments are both crucial abilities for intelligent systems. Model-agnostic meta-learning (MAML) is a meta-learning algorithm to enable such adaptability, and invariant risk minimization (IRM) is a problem setting to achieve the invariant representation across multiple environments. We can formulate both methods as optimization problems with the environment-dependent constraint and this constraint is known to hamper optimization. Therefore, understanding the effect of the constraint on the optimization is important. In this paper, we provide a conceptual insight on how the constraint affects the optimization of MAML and IRM by analyzing the trainability of the gradient descent on the loss with the gradient norm penalty, which is easier to study but is related to both MAML and IRM. We conduct numerical experiments with practical datasets and architectures for MAML and IRM and validate that the analysis of the gradient norm penalty loss captures well the empirical relationship between the constraint and the trainability of MAML and IRM.

查看原文本刊更多论文

基于梯度范数正则化的梯度多环境学习可训练性分析

对多种环境的适应和不变性都是智能系统的关键能力。模型不可知元学习(MAML)是实现这种适应性的元学习算法，不变风险最小化(IRM)是实现跨多个环境不变表示的问题设置。我们可以将这两种方法表述为具有环境相关约束的优化问题，而这种约束是已知的阻碍优化的约束。因此，了解约束对优化的影响非常重要。在本文中，我们通过分析梯度下降对损失和梯度范数惩罚的可训练性，对约束如何影响MAML和IRM的优化提供了概念上的见解，梯度下降对损失的可训练性更容易研究，但与MAML和IRM都相关。我们在MAML和IRM的实际数据集和体系结构上进行了数值实验，验证了梯度范数惩罚损失的分析很好地捕捉了约束与MAML和IRM的可训练性之间的经验关系。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 International Joint Conference on Neural Networks (IJCNN)

自引率

0.00%

发文量