Lifelong Learning of Task-Parameter Relationships for Knowledge Transfer

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW) Pub Date : 2023-06-01 DOI:10.1109/CVPRW59228.2023.00251

S. Srivastava, Mohammad Yaqub, K. Nandakumar

{"title":"Lifelong Learning of Task-Parameter Relationships for Knowledge Transfer","authors":"S. Srivastava, Mohammad Yaqub, K. Nandakumar","doi":"10.1109/CVPRW59228.2023.00251","DOIUrl":null,"url":null,"abstract":"The ability to acquire new skills and knowledge continually is one of the defining qualities of the human brain, which is critically missing in most modern machine vision systems. In this work, we focus on knowledge transfer in the lifelong learning setting. We propose a lifelong learner that models the similarities between the optimal weight spaces of tasks and exploits this in order to enable knowledge transfer across tasks in a continual learning setting. To characterize the \"task-parameter relationships\", we propose a metric called adaptation rate integral (ARI), which measures the expected rate of adaptation over a finite number of steps for a (task, parameter) pair. These task-parameter relationships are learned using an auxiliary network trained on guided explorations of parameter space. The learned auxiliary network is then used to heuristically select the best parameter sets on seen tasks, which are consolidated using a hypernetwork. Given a new (unseen) task, knowledge transfer occurs through the selection of the most suitable parameter set from the hypernetwork that can be rapidly finetuned. We show that the proposed approach can improve knowledge transfer between tasks across standard benchmarks without any increase in overall model capacity, while naturally mitigating catastrophic forgetting.","PeriodicalId":355438,"journal":{"name":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","volume":"30 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW59228.2023.00251","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

The ability to acquire new skills and knowledge continually is one of the defining qualities of the human brain, which is critically missing in most modern machine vision systems. In this work, we focus on knowledge transfer in the lifelong learning setting. We propose a lifelong learner that models the similarities between the optimal weight spaces of tasks and exploits this in order to enable knowledge transfer across tasks in a continual learning setting. To characterize the "task-parameter relationships", we propose a metric called adaptation rate integral (ARI), which measures the expected rate of adaptation over a finite number of steps for a (task, parameter) pair. These task-parameter relationships are learned using an auxiliary network trained on guided explorations of parameter space. The learned auxiliary network is then used to heuristically select the best parameter sets on seen tasks, which are consolidated using a hypernetwork. Given a new (unseen) task, knowledge transfer occurs through the selection of the most suitable parameter set from the hypernetwork that can be rapidly finetuned. We show that the proposed approach can improve knowledge transfer between tasks across standard benchmarks without any increase in overall model capacity, while naturally mitigating catastrophic forgetting.

查看原文本刊更多论文

面向知识转移的任务-参数关系终身学习

不断获得新技能和知识的能力是人类大脑的决定性品质之一，这在大多数现代机器视觉系统中都是严重缺失的。在这项工作中，我们关注终身学习环境下的知识转移。我们提出了一个终身学习者，它可以模拟任务的最优权重空间之间的相似性，并利用这一点，以便在持续学习的环境中实现跨任务的知识转移。为了描述“任务-参数关系”，我们提出了一个称为适应率积分(ARI)的度量，它测量(任务，参数)对在有限数量的步骤上的预期适应率。这些任务-参数关系是通过一个辅助网络来学习的，该网络是在参数空间的引导探索上训练的。然后使用学习到的辅助网络启发式地选择已知任务的最佳参数集，并使用超网络对这些参数集进行整合。给定一个新的(看不见的)任务，通过从超网络中选择最合适的参数集来进行知识转移，这些参数集可以快速微调。我们表明，所提出的方法可以提高跨标准基准任务之间的知识转移，而不会增加整体模型容量，同时自然地减轻灾难性遗忘。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)

自引率

0.00%

发文量