Partitioning Sensorimotor Space by Predictability Principle in Intrinsic Motivation Systems

2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob) Pub Date : 2018-09-01 DOI:10.1109/DEVLRN.2018.8760504

M. Sener, Emre Ugur

{"title":"Partitioning Sensorimotor Space by Predictability Principle in Intrinsic Motivation Systems","authors":"M. Sener, Emre Ugur","doi":"10.1109/DEVLRN.2018.8760504","DOIUrl":null,"url":null,"abstract":"Inspired by infant development, intrinsic motivation (IM) guides the robot with intelligent exploration strategies, enabling efficient and effective learning in high-dimensional search spaces. A particular method in IM, namely Intelligent Adaptive Curiosity (IAC), adaptively partitions agents sensorimotor space $(\\mathrm{S}\\mathbb{M})$ into regions of exploration, and guides the agent to select the regions that are in the moderate level of difficulty, and learns separate experts for different regions. Therefore, the means of partitioning the $\\mathbb{SM}$ and the mechanisms behind region generation is of utmost importance. In this study, we propose a method for partitioning the space that allows maximizing the performances of the experts that will be responsible for learning skills. In brief, for each potential partitioning, the error of the experts are calculated and the partitioning that would generate the minimal error in the future is selected. Our method is evaluated in a setting with a simulated robot that learns predicting the next state given the current state and the action taken in an environment composed of regions with different properties. We verified the proposed method, SM is partitioned into more semantically meaningful regions adapting environment dynamics, the exploration of the robot in these regions can better exploit IM mechanisms and the system learn more efficiently and effectively i.e. with higher performance in a shorter time, compared to a baseline method.","PeriodicalId":236346,"journal":{"name":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/DEVLRN.2018.8760504","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 4

Abstract

Inspired by infant development, intrinsic motivation (IM) guides the robot with intelligent exploration strategies, enabling efficient and effective learning in high-dimensional search spaces. A particular method in IM, namely Intelligent Adaptive Curiosity (IAC), adaptively partitions agents sensorimotor space $(\mathrm{S}\mathbb{M})$ into regions of exploration, and guides the agent to select the regions that are in the moderate level of difficulty, and learns separate experts for different regions. Therefore, the means of partitioning the $\mathbb{SM}$ and the mechanisms behind region generation is of utmost importance. In this study, we propose a method for partitioning the space that allows maximizing the performances of the experts that will be responsible for learning skills. In brief, for each potential partitioning, the error of the experts are calculated and the partitioning that would generate the minimal error in the future is selected. Our method is evaluated in a setting with a simulated robot that learns predicting the next state given the current state and the action taken in an environment composed of regions with different properties. We verified the proposed method, SM is partitioned into more semantically meaningful regions adapting environment dynamics, the exploration of the robot in these regions can better exploit IM mechanisms and the system learn more efficiently and effectively i.e. with higher performance in a shorter time, compared to a baseline method.

查看原文本刊更多论文

用内在动机系统的可预见性原则划分感觉运动空间

受婴儿发育的启发，内在动机(IM)引导机器人采用智能的探索策略，在高维搜索空间中实现高效的学习。IM中有一种特殊的方法，即智能自适应好奇心(IAC)，它自适应地将智能体的感觉运动空间$(\ mathm {S}\mathbb{M})$划分为探索区域，并引导智能体选择难度适中的区域，并为不同的区域学习单独的专家。因此，划分$\mathbb{SM}$的方法和区域生成背后的机制是至关重要的。在这项研究中，我们提出了一种划分空间的方法，使负责学习技能的专家的表现最大化。简而言之，对于每个可能的分区，计算专家的误差，并选择将来产生最小误差的分区。我们的方法是在一个模拟机器人的设置中进行评估的，该机器人在给定当前状态的情况下学习预测下一个状态，并在由不同属性的区域组成的环境中采取行动。我们验证了所提出的方法，SM被划分为适应环境动态的更有语义意义的区域，机器人在这些区域的探索可以更好地利用IM机制，系统学习效率更高，即在更短的时间内具有更高的性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 Joint IEEE 8th International Conference on Development and Learning and Epigenetic Robotics (ICDL-EpiRob)

自引率

0.00%

发文量