可以走路的腿:基于实例的模块化强化学习应用

2005 International Symposium on Computational Intelligence in Robotics and Automation Pub Date : 2005-06-27 DOI:10.1109/CIRA.2005.1554304

D. Jacob, D. Polani, Chrystopher L. Nehaniv

{"title":"可以走路的腿:基于实例的模块化强化学习应用","authors":"D. Jacob, D. Polani, Chrystopher L. Nehaniv","doi":"10.1109/CIRA.2005.1554304","DOIUrl":null,"url":null,"abstract":"Experiments to illustrate a novel methodology for reinforcement learning in embodied physical agents are described. A simulated legged robot is decomposed into structure-based modules following the authors' EMBER principles of local sensing, action and learning. The legs are individually trained to 'walk' in isolation, and re-attached to the robot; walking is then sufficiently stable that learning in situ can continue. The experiments demonstrate the benefits of the modular decomposition: state-space factorisation leads to faster learning, in this case to the extent that an otherwise intractable problem becomes learnable.","PeriodicalId":162553,"journal":{"name":"2005 International Symposium on Computational Intelligence in Robotics and Automation","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Legs that can walk: embodiment-based modular reinforcement learning applied\",\"authors\":\"D. Jacob, D. Polani, Chrystopher L. Nehaniv\",\"doi\":\"10.1109/CIRA.2005.1554304\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Experiments to illustrate a novel methodology for reinforcement learning in embodied physical agents are described. A simulated legged robot is decomposed into structure-based modules following the authors' EMBER principles of local sensing, action and learning. The legs are individually trained to 'walk' in isolation, and re-attached to the robot; walking is then sufficiently stable that learning in situ can continue. The experiments demonstrate the benefits of the modular decomposition: state-space factorisation leads to faster learning, in this case to the extent that an otherwise intractable problem becomes learnable.\",\"PeriodicalId\":162553,\"journal\":{\"name\":\"2005 International Symposium on Computational Intelligence in Robotics and Automation\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2005-06-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2005 International Symposium on Computational Intelligence in Robotics and Automation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIRA.2005.1554304\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2005 International Symposium on Computational Intelligence in Robotics and Automation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIRA.2005.1554304","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

摘要

本文描述了一种新的强化学习方法的实验。根据作者提出的局部感知、动作和学习的余烬原理，将仿真机器人分解为基于结构的模块。这些腿经过单独训练，可以独立“行走”，并重新连接到机器人上;然后走路足够稳定，可以继续原地学习。实验证明了模块化分解的好处:状态空间分解导致更快的学习，在这种情况下，一个原本难以解决的问题变得可学习。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Legs that can walk: embodiment-based modular reinforcement learning applied

Experiments to illustrate a novel methodology for reinforcement learning in embodied physical agents are described. A simulated legged robot is decomposed into structure-based modules following the authors' EMBER principles of local sensing, action and learning. The legs are individually trained to 'walk' in isolation, and re-attached to the robot; walking is then sufficiently stable that learning in situ can continue. The experiments demonstrate the benefits of the modular decomposition: state-space factorisation leads to faster learning, in this case to the extent that an otherwise intractable problem becomes learnable.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2005 International Symposium on Computational Intelligence in Robotics and Automation

自引率

0.00%

发文量