情景记忆在多任务强化学习中的迁移

Q2 Psychology
Artyom Y. Sorokin, Mikhail S. Burtsev
{"title":"情景记忆在多任务强化学习中的迁移","authors":"Artyom Y. Sorokin,&nbsp;Mikhail S. Burtsev","doi":"10.1016/j.bica.2018.09.003","DOIUrl":null,"url":null,"abstract":"<div><p><span><span>Episodic memory plays important role in animal </span>behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task </span>reinforcement learning<span> (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent<span> sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.</span></span></p></div>","PeriodicalId":48756,"journal":{"name":"Biologically Inspired Cognitive Architectures","volume":"26 ","pages":"Pages 91-95"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1016/j.bica.2018.09.003","citationCount":"3","resultStr":"{\"title\":\"Episodic memory transfer for multi-task reinforcement learning\",\"authors\":\"Artyom Y. Sorokin,&nbsp;Mikhail S. Burtsev\",\"doi\":\"10.1016/j.bica.2018.09.003\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><p><span><span>Episodic memory plays important role in animal </span>behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task </span>reinforcement learning<span> (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent<span> sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.</span></span></p></div>\",\"PeriodicalId\":48756,\"journal\":{\"name\":\"Biologically Inspired Cognitive Architectures\",\"volume\":\"26 \",\"pages\":\"Pages 91-95\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1016/j.bica.2018.09.003\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biologically Inspired Cognitive Architectures\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2212683X18300902\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"Psychology\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biologically Inspired Cognitive Architectures","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2212683X18300902","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Psychology","Score":null,"Total":0}
引用次数: 3

摘要

情景记忆在动物行为中起着重要的作用。它允许在不断变化的环境中重用通用技能来解决特定任务。生物认知系统的这一有益特征仍未成功地纳入人工神经体系结构。本文提出了一种基于共享情景记忆的多任务强化学习(SEM-PAAC)神经结构。该体系结构扩展了并行优势参与者批评(PAAC),使用两个循环子网络分别跟踪环境和任务状态。第一子网存储情景记忆,第二子网允许特定于任务的策略执行。Taxi领域的实验表明,SEM-PAAC在单独求解子任务时具有与PAAC相同的性能。另一方面,由于情景记忆的重用,当子任务联合解决以完成完整的出租车任务时,SEM-PAAC显著更好。提出的架构也成功地学会了预测任务的完成。这是向多任务问题的更自主代理迈出的一步。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Episodic memory transfer for multi-task reinforcement learning

Episodic memory plays important role in animal behavior. It allows to reuse general skills for solution of specific tasks in changing environment. This beneficial feature of biological cognitive systems is still not incorporated successfully in an artificial neural architectures. In this paper we propose a neural architecture with shared episodic memory for multi-task reinforcement learning (SEM-PAAC). This architecture extends Parallel Advantage Actor Critic (PAAC) with two recurrent sub-networks for separate tracking of environment and task states. The first subnetwork store episodic memory and the second one allows task specific execution of policy. Experiments in the Taxi domain demonstrated that SEM-PAAC has the same performance as PAAC when subtasks are solved separately. On the other hand when subtasks are solved jointly for completing full Taxi task SEM-PAAC is significantly better due to reuse of episodic memory. Proposed architecture also successfully learned to predict task completion. This is a step towards more autonomous agents for multitask problems.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Biologically Inspired Cognitive Architectures
Biologically Inspired Cognitive Architectures COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCEN-NEUROSCIENCES
CiteScore
3.60
自引率
0.00%
发文量
0
期刊介绍: Announcing the merge of Biologically Inspired Cognitive Architectures with Cognitive Systems Research. Cognitive Systems Research is dedicated to the study of human-level cognition. As such, it welcomes papers which advance the understanding, design and applications of cognitive and intelligent systems, both natural and artificial. The journal brings together a broad community studying cognition in its many facets in vivo and in silico, across the developmental spectrum, focusing on individual capacities or on entire architectures. It aims to foster debate and integrate ideas, concepts, constructs, theories, models and techniques from across different disciplines and different perspectives on human-level cognition. The scope of interest includes the study of cognitive capacities and architectures - both brain-inspired and non-brain-inspired - and the application of cognitive systems to real-world problems as far as it offers insights relevant for the understanding of cognition. Cognitive Systems Research therefore welcomes mature and cutting-edge research approaching cognition from a systems-oriented perspective, both theoretical and empirically-informed, in the form of original manuscripts, short communications, opinion articles, systematic reviews, and topical survey articles from the fields of Cognitive Science (including Philosophy of Cognitive Science), Artificial Intelligence/Computer Science, Cognitive Robotics, Developmental Science, Psychology, and Neuroscience and Neuromorphic Engineering. Empirical studies will be considered if they are supplemented by theoretical analyses and contributions to theory development and/or computational modelling studies.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信