认知有效的终身学习模式

2023 IEEE International Conference on Robotics and Biomimetics (ROBIO) Pub Date : 2023-12-04 DOI:10.1109/ROBIO58561.2023.10355028

Hanne Say, E. Oztop

{"title":"认知有效的终身学习模式","authors":"Hanne Say, E. Oztop","doi":"10.1109/ROBIO58561.2023.10355028","DOIUrl":null,"url":null,"abstract":"In continual learning, usually a sequence of tasks are given to a learning agent and the performance of the agent after learning is measured in terms of resistance to catastrophic forgetting, efficacy of knowledge transfer and overall performance on the individual tasks. On the other hand, in multi-task learning, the system is designed to simultaneously acquire knowledge in multiple tasks, often through offline batch learning. A more cognitively valid scenario for lifelong robot learning would be to have a robotic agent to autonomously decide which task to engage and disengage while leveraging many-to-many knowledge transfer ability among tasks during online learning. In this study, we propose a novel lifelong robot learning architecture to fulfill the aforementioned desiderata, and show its validity in an environment where a robot learns the effects of its actions in different task settings. To realize the proposed model, we adopt learning progress measure for task selection, and have the tasks learn by independent neural networks with special structure that allows access to the neural layers of the non-selected tasks. The experiments conducted with a simulated robot arm in an object interaction scenario show that the proposed architecture yields better knowledge transfer and facilitates faster learning compared to baselines of fixed sequence task learning and isolated task learners with no knowledge transfer.","PeriodicalId":505134,"journal":{"name":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","volume":"83 8","pages":"1-7"},"PeriodicalIF":0.0000,"publicationDate":"2023-12-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Model for Cognitively Valid Lifelong Learning\",\"authors\":\"Hanne Say, E. Oztop\",\"doi\":\"10.1109/ROBIO58561.2023.10355028\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In continual learning, usually a sequence of tasks are given to a learning agent and the performance of the agent after learning is measured in terms of resistance to catastrophic forgetting, efficacy of knowledge transfer and overall performance on the individual tasks. On the other hand, in multi-task learning, the system is designed to simultaneously acquire knowledge in multiple tasks, often through offline batch learning. A more cognitively valid scenario for lifelong robot learning would be to have a robotic agent to autonomously decide which task to engage and disengage while leveraging many-to-many knowledge transfer ability among tasks during online learning. In this study, we propose a novel lifelong robot learning architecture to fulfill the aforementioned desiderata, and show its validity in an environment where a robot learns the effects of its actions in different task settings. To realize the proposed model, we adopt learning progress measure for task selection, and have the tasks learn by independent neural networks with special structure that allows access to the neural layers of the non-selected tasks. The experiments conducted with a simulated robot arm in an object interaction scenario show that the proposed architecture yields better knowledge transfer and facilitates faster learning compared to baselines of fixed sequence task learning and isolated task learners with no knowledge transfer.\",\"PeriodicalId\":505134,\"journal\":{\"name\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"volume\":\"83 8\",\"pages\":\"1-7\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-12-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ROBIO58561.2023.10355028\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ROBIO58561.2023.10355028","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在持续学习中，通常会给学习代理一连串的任务，学习代理学习后的表现则根据其对灾难性遗忘的抵抗能力、知识迁移的效率以及在单个任务中的总体表现来衡量。另一方面，在多任务学习中，系统被设计为同时获取多个任务的知识，通常是通过离线批量学习。对于机器人的终身学习而言，一个更符合认知规律的方案是让机器人代理自主决定参与和脱离哪项任务，同时在在线学习过程中利用任务间多对多的知识转移能力。在本研究中，我们提出了一种新型的机器人终身学习架构，以满足上述需求，并在机器人学习其在不同任务设置中的行动效果的环境中展示了其有效性。为了实现所提出的模型，我们采用了学习进度衡量标准来进行任务选择，并让任务通过独立的神经网络进行学习，该网络具有特殊的结构，允许访问非选择任务的神经层。在模拟机械臂与物体交互场景中进行的实验表明，与固定顺序任务学习基线和无知识转移的孤立任务学习基线相比，所提出的架构能产生更好的知识转移并促进更快的学习。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A Model for Cognitively Valid Lifelong Learning

In continual learning, usually a sequence of tasks are given to a learning agent and the performance of the agent after learning is measured in terms of resistance to catastrophic forgetting, efficacy of knowledge transfer and overall performance on the individual tasks. On the other hand, in multi-task learning, the system is designed to simultaneously acquire knowledge in multiple tasks, often through offline batch learning. A more cognitively valid scenario for lifelong robot learning would be to have a robotic agent to autonomously decide which task to engage and disengage while leveraging many-to-many knowledge transfer ability among tasks during online learning. In this study, we propose a novel lifelong robot learning architecture to fulfill the aforementioned desiderata, and show its validity in an environment where a robot learns the effects of its actions in different task settings. To realize the proposed model, we adopt learning progress measure for task selection, and have the tasks learn by independent neural networks with special structure that allows access to the neural layers of the non-selected tasks. The experiments conducted with a simulated robot arm in an object interaction scenario show that the proposed architecture yields better knowledge transfer and facilitates faster learning compared to baselines of fixed sequence task learning and isolated task learners with no knowledge transfer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2023 IEEE International Conference on Robotics and Biomimetics (ROBIO)

自引率

0.00%

发文量