基于上下文的自适应机器人行为学习模型(CARB-LM)

Joohee Suh, Dean Frederick Hougen
{"title":"基于上下文的自适应机器人行为学习模型(CARB-LM)","authors":"Joohee Suh, Dean Frederick Hougen","doi":"10.1109/CICA.2014.7013253","DOIUrl":null,"url":null,"abstract":"An important, long-term objective of intelligent robotics is to develop robots that can learn about and adapt to new environments. We focus on developing a learning model that can build up new knowledge through direct experience with and feedback from an environment. We designed and constructed Context-based Adaptive Robot Behavior-Learning Model (CARB-LM) which is conceptually inspired by Hebbian and anti-Hebbian learning and by neuromodulation in neural networks. CARB-LM has two types of learning processes: (1) context-based learning and (2) reward-based learning. The former uses past accumulated positive experiences as analogies to current conditions, allowing the robot to infer likely rewarding behaviors, and the latter exploits current reward information so the robot can refine its behaviors based on current experience. The reward is acquired by checking the effect of the robot's behavior in the environment. As a first test of this model, we tasked a simulated TurtleBot robot with moving smoothly around a previously unexplored environment. We simulated this environment using ROS and Gazebo and performed experiments to evaluate the model. The robot showed substantial learning and greatly outperformed both a hand-coded controller and a randomly wandering robot.","PeriodicalId":340740,"journal":{"name":"2014 IEEE Symposium on Computational Intelligence in Control and Automation (CICA)","volume":"49 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Context-based adaptive robot behavior learning model (CARB-LM)\",\"authors\":\"Joohee Suh, Dean Frederick Hougen\",\"doi\":\"10.1109/CICA.2014.7013253\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An important, long-term objective of intelligent robotics is to develop robots that can learn about and adapt to new environments. We focus on developing a learning model that can build up new knowledge through direct experience with and feedback from an environment. We designed and constructed Context-based Adaptive Robot Behavior-Learning Model (CARB-LM) which is conceptually inspired by Hebbian and anti-Hebbian learning and by neuromodulation in neural networks. CARB-LM has two types of learning processes: (1) context-based learning and (2) reward-based learning. The former uses past accumulated positive experiences as analogies to current conditions, allowing the robot to infer likely rewarding behaviors, and the latter exploits current reward information so the robot can refine its behaviors based on current experience. The reward is acquired by checking the effect of the robot's behavior in the environment. As a first test of this model, we tasked a simulated TurtleBot robot with moving smoothly around a previously unexplored environment. We simulated this environment using ROS and Gazebo and performed experiments to evaluate the model. The robot showed substantial learning and greatly outperformed both a hand-coded controller and a randomly wandering robot.\",\"PeriodicalId\":340740,\"journal\":{\"name\":\"2014 IEEE Symposium on Computational Intelligence in Control and Automation (CICA)\",\"volume\":\"49 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 IEEE Symposium on Computational Intelligence in Control and Automation (CICA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CICA.2014.7013253\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 IEEE Symposium on Computational Intelligence in Control and Automation (CICA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CICA.2014.7013253","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

智能机器人的一个重要的长期目标是开发能够学习和适应新环境的机器人。我们专注于开发一种学习模式,可以通过对环境的直接体验和反馈来建立新知识。我们设计并构建了基于上下文的自适应机器人行为学习模型(CARB-LM),该模型在概念上受到Hebbian和anti-Hebbian学习以及神经网络中的神经调节的启发。CARB-LM有两种学习过程:(1)基于情境的学习和(2)基于奖励的学习。前者使用过去积累的积极经验作为当前条件的类比,允许机器人推断可能的奖励行为,后者利用当前奖励信息,使机器人可以根据当前经验改进其行为。通过检查机器人在环境中的行为效果来获得奖励。作为该模型的第一次测试,我们要求模拟的TurtleBot机器人在以前未探索过的环境中平稳移动。我们使用ROS和Gazebo模拟了这种环境,并进行了实验来评估模型。机器人表现出大量的学习能力,并且大大优于手动编码控制器和随机漫游机器人。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Context-based adaptive robot behavior learning model (CARB-LM)
An important, long-term objective of intelligent robotics is to develop robots that can learn about and adapt to new environments. We focus on developing a learning model that can build up new knowledge through direct experience with and feedback from an environment. We designed and constructed Context-based Adaptive Robot Behavior-Learning Model (CARB-LM) which is conceptually inspired by Hebbian and anti-Hebbian learning and by neuromodulation in neural networks. CARB-LM has two types of learning processes: (1) context-based learning and (2) reward-based learning. The former uses past accumulated positive experiences as analogies to current conditions, allowing the robot to infer likely rewarding behaviors, and the latter exploits current reward information so the robot can refine its behaviors based on current experience. The reward is acquired by checking the effect of the robot's behavior in the environment. As a first test of this model, we tasked a simulated TurtleBot robot with moving smoothly around a previously unexplored environment. We simulated this environment using ROS and Gazebo and performed experiments to evaluate the model. The robot showed substantial learning and greatly outperformed both a hand-coded controller and a randomly wandering robot.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信