{"title":"基于结构化学习的伙伴机器人轨迹生成与积累","authors":"Y. Nojima, N. Kubota, F. Kojima","doi":"10.1109/CEC.2004.1331173","DOIUrl":null,"url":null,"abstract":"The aim of This work is to develop partner robots that can obtain and accumulate human-friendly behaviors. To realize it, we use a concept of structured learning which emphasizes the importance of an interactive learning of several modules through interaction with its environment. In a proposed method, a robot obtains hand-to-hand behavior by using an interactive evolutionary computation based on human evaluations estimated by fuzzy state-value functions. Moreover, a self-organizing map is used for clustering human hand positions. A state-value function and a knowledge database are assigned to each clustered positions. Furthermore, the best trajectory is stored in the knowledge database to reuse it in the same situation. Some experimental results show the effectiveness of the proposed method.","PeriodicalId":152088,"journal":{"name":"Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2004-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Trajectory generation and accumulation for partner robots based on structured learning\",\"authors\":\"Y. Nojima, N. Kubota, F. Kojima\",\"doi\":\"10.1109/CEC.2004.1331173\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The aim of This work is to develop partner robots that can obtain and accumulate human-friendly behaviors. To realize it, we use a concept of structured learning which emphasizes the importance of an interactive learning of several modules through interaction with its environment. In a proposed method, a robot obtains hand-to-hand behavior by using an interactive evolutionary computation based on human evaluations estimated by fuzzy state-value functions. Moreover, a self-organizing map is used for clustering human hand positions. A state-value function and a knowledge database are assigned to each clustered positions. Furthermore, the best trajectory is stored in the knowledge database to reuse it in the same situation. Some experimental results show the effectiveness of the proposed method.\",\"PeriodicalId\":152088,\"journal\":{\"name\":\"Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753)\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2004-06-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CEC.2004.1331173\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2004 Congress on Evolutionary Computation (IEEE Cat. No.04TH8753)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CEC.2004.1331173","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Trajectory generation and accumulation for partner robots based on structured learning
The aim of This work is to develop partner robots that can obtain and accumulate human-friendly behaviors. To realize it, we use a concept of structured learning which emphasizes the importance of an interactive learning of several modules through interaction with its environment. In a proposed method, a robot obtains hand-to-hand behavior by using an interactive evolutionary computation based on human evaluations estimated by fuzzy state-value functions. Moreover, a self-organizing map is used for clustering human hand positions. A state-value function and a knowledge database are assigned to each clustered positions. Furthermore, the best trajectory is stored in the knowledge database to reuse it in the same situation. Some experimental results show the effectiveness of the proposed method.