{"title":"基于强化学习算法的多自由度水下机器人运动获取","authors":"Yongfeng Han, H. Kimura","doi":"10.1109/TENCON.2010.5686136","DOIUrl":null,"url":null,"abstract":"This paper deals with motions obtaining of an underwater robot arm which have multi-degree of freedom by using reinforcement learning algorithms. A natural gradient Actor-Critic algorithm which uses Eligibility Traces is applied to the robot arm. In this algorithm, motion planning problems are modeled as finite state Markov decision processes. The robot arm is developed to have 4 joints, each joint consists 1 servo motor. The experiment results show the robot arm successfully learning to swim by feasible learning steps.","PeriodicalId":101683,"journal":{"name":"TENCON 2010 - 2010 IEEE Region 10 Conference","volume":"14 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2010-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Motions obtaining of multi-degree-freedom underwater robot by using reinforcement learning algorithms\",\"authors\":\"Yongfeng Han, H. Kimura\",\"doi\":\"10.1109/TENCON.2010.5686136\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper deals with motions obtaining of an underwater robot arm which have multi-degree of freedom by using reinforcement learning algorithms. A natural gradient Actor-Critic algorithm which uses Eligibility Traces is applied to the robot arm. In this algorithm, motion planning problems are modeled as finite state Markov decision processes. The robot arm is developed to have 4 joints, each joint consists 1 servo motor. The experiment results show the robot arm successfully learning to swim by feasible learning steps.\",\"PeriodicalId\":101683,\"journal\":{\"name\":\"TENCON 2010 - 2010 IEEE Region 10 Conference\",\"volume\":\"14 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2010-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"TENCON 2010 - 2010 IEEE Region 10 Conference\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/TENCON.2010.5686136\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"TENCON 2010 - 2010 IEEE Region 10 Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/TENCON.2010.5686136","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Motions obtaining of multi-degree-freedom underwater robot by using reinforcement learning algorithms
This paper deals with motions obtaining of an underwater robot arm which have multi-degree of freedom by using reinforcement learning algorithms. A natural gradient Actor-Critic algorithm which uses Eligibility Traces is applied to the robot arm. In this algorithm, motion planning problems are modeled as finite state Markov decision processes. The robot arm is developed to have 4 joints, each joint consists 1 servo motor. The experiment results show the robot arm successfully learning to swim by feasible learning steps.