Getting Priorities Right: Intrinsic Motivation with Multi-Objective Reinforcement Learning

2022 IEEE International Conference on Development and Learning (ICDL) Pub Date : 2022-09-12 DOI:10.1109/ICDL53763.2022.9962187

Yusuf Al-Husaini, Matthias Rolf

{"title":"Getting Priorities Right: Intrinsic Motivation with Multi-Objective Reinforcement Learning","authors":"Yusuf Al-Husaini, Matthias Rolf","doi":"10.1109/ICDL53763.2022.9962187","DOIUrl":null,"url":null,"abstract":"Intrinsic motivation is a common method to facilitate exploration in reinforcement learning agents. Curiosity is thereby supposed to aid the learning of a primary goal. However, indulging in curiosity may also stand in conflict with more urgent or essential objectives such as self-sustenance. This paper addresses the problem of balancing curiosity, and correctly prioritising other needs in a reinforcement learning context. We demonstrate the use of the multi-objective reinforcement learning framework C-MORE to integrate curiosity, and compare results to a standard linear reinforcement learning integration. Results clearly demonstrate that curiosity can be modelled with the priority-objective reinforcement learning paradigm. In particular, C-MORE is found to explore robustly while maintaining self-sustenance objectives, whereas the linear approach is found to over-explore and take unnecessary risks. The findings demonstrate a significant weakness of the common linear integration method for intrinsic motivation, and the need to acknowledge the potential conflicts between curiosity and other objectives in a multi-objective framework.","PeriodicalId":274171,"journal":{"name":"2022 IEEE International Conference on Development and Learning (ICDL)","volume":"86 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE International Conference on Development and Learning (ICDL)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDL53763.2022.9962187","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Intrinsic motivation is a common method to facilitate exploration in reinforcement learning agents. Curiosity is thereby supposed to aid the learning of a primary goal. However, indulging in curiosity may also stand in conflict with more urgent or essential objectives such as self-sustenance. This paper addresses the problem of balancing curiosity, and correctly prioritising other needs in a reinforcement learning context. We demonstrate the use of the multi-objective reinforcement learning framework C-MORE to integrate curiosity, and compare results to a standard linear reinforcement learning integration. Results clearly demonstrate that curiosity can be modelled with the priority-objective reinforcement learning paradigm. In particular, C-MORE is found to explore robustly while maintaining self-sustenance objectives, whereas the linear approach is found to over-explore and take unnecessary risks. The findings demonstrate a significant weakness of the common linear integration method for intrinsic motivation, and the need to acknowledge the potential conflicts between curiosity and other objectives in a multi-objective framework.

查看原文本刊更多论文

正确获得优先级:多目标强化学习的内在动机

在强化学习智能体中，内在动机是促进探索的常用方法。因此，好奇心应该有助于学习一个主要目标。然而，沉迷于好奇心也可能与更紧迫或更重要的目标(如自我维持)相冲突。本文解决了在强化学习环境中平衡好奇心和正确优先考虑其他需求的问题。我们演示了使用多目标强化学习框架C-MORE来整合好奇心，并将结果与标准线性强化学习集成进行比较。结果清楚地表明，好奇心可以用优先目标强化学习范式建模。特别是，C-MORE被发现在保持自我维持目标的同时进行稳健的探索，而线性方法被发现过度探索并承担不必要的风险。研究结果表明，普通的内在动机线性整合方法存在明显的弱点，需要承认在多目标框架中好奇心和其他目标之间的潜在冲突。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE International Conference on Development and Learning (ICDL)

自引率

0.00%

发文量