{"title":"基于利润分享的双足步行机器人在pomdp环境下的避跌策略研究","authors":"Toshihiro Suzuki, Y. Osana","doi":"10.1109/NaBIC.2014.6921875","DOIUrl":null,"url":null,"abstract":"In this paper, fall avoidance of bipedal walking robot is realized by the Profit Sharing that can learn deterministic policy for POMDPs environments. In this research, the Profit Sharing that can learn deterministic policy for POMDPs environments which can obtain the deterministic policy by using the history of observations is employed. We carried out a series of experiments using bipedal walking robot, and confirmed that attitude control can be realized by the Profit Sharing that can learn deterministic policy for POMDPs environments.","PeriodicalId":209716,"journal":{"name":"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2014-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Fall avoidance of bipedalwalking robot by profit sharing that can learn deterministic policy for POMDPs environments\",\"authors\":\"Toshihiro Suzuki, Y. Osana\",\"doi\":\"10.1109/NaBIC.2014.6921875\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, fall avoidance of bipedal walking robot is realized by the Profit Sharing that can learn deterministic policy for POMDPs environments. In this research, the Profit Sharing that can learn deterministic policy for POMDPs environments which can obtain the deterministic policy by using the history of observations is employed. We carried out a series of experiments using bipedal walking robot, and confirmed that attitude control can be realized by the Profit Sharing that can learn deterministic policy for POMDPs environments.\",\"PeriodicalId\":209716,\"journal\":{\"name\":\"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/NaBIC.2014.6921875\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 Sixth World Congress on Nature and Biologically Inspired Computing (NaBIC 2014)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaBIC.2014.6921875","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Fall avoidance of bipedalwalking robot by profit sharing that can learn deterministic policy for POMDPs environments
In this paper, fall avoidance of bipedal walking robot is realized by the Profit Sharing that can learn deterministic policy for POMDPs environments. In this research, the Profit Sharing that can learn deterministic policy for POMDPs environments which can obtain the deterministic policy by using the history of observations is employed. We carried out a series of experiments using bipedal walking robot, and confirmed that attitude control can be realized by the Profit Sharing that can learn deterministic policy for POMDPs environments.