{"title":"面向机器人导航任务的自主知识转移框架","authors":"Weiwei Yu, Shen Wang, K. Madani, Huashan Feng","doi":"10.1109/ICAWST.2013.6765408","DOIUrl":null,"url":null,"abstract":"This paper proposes a framework of knowledge transfer used in the robot navigation problem, in order to speedup the learning process when the robot meets new or more complex task. Take the graph of source navigation maps as the template library, Principal Component Analysis method is adopted to match the target graph with source template. Then, the robot recognizes the similar source task. The value-function of the selected source task, which correlates the cumulative rewards with the sensory information corresponding to the generalized distance information to each beacon, is transferred to the target task. With this approach, the robot can choose the most related knowledge from the source tasks and transfer the generalized knowledge to the target navigation task automatically without handed code mapping between tasks. Experimental results demonstrate that the presented transfer method can yield a remarkable speedup for learning process in the robot navigation task.","PeriodicalId":68697,"journal":{"name":"炎黄地理","volume":"43 1","pages":"52-58"},"PeriodicalIF":0.0000,"publicationDate":"2013-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A framework of autonomous knowledge transfer for robot navigation task\",\"authors\":\"Weiwei Yu, Shen Wang, K. Madani, Huashan Feng\",\"doi\":\"10.1109/ICAWST.2013.6765408\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper proposes a framework of knowledge transfer used in the robot navigation problem, in order to speedup the learning process when the robot meets new or more complex task. Take the graph of source navigation maps as the template library, Principal Component Analysis method is adopted to match the target graph with source template. Then, the robot recognizes the similar source task. The value-function of the selected source task, which correlates the cumulative rewards with the sensory information corresponding to the generalized distance information to each beacon, is transferred to the target task. With this approach, the robot can choose the most related knowledge from the source tasks and transfer the generalized knowledge to the target navigation task automatically without handed code mapping between tasks. Experimental results demonstrate that the presented transfer method can yield a remarkable speedup for learning process in the robot navigation task.\",\"PeriodicalId\":68697,\"journal\":{\"name\":\"炎黄地理\",\"volume\":\"43 1\",\"pages\":\"52-58\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2013-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"炎黄地理\",\"FirstCategoryId\":\"1089\",\"ListUrlMain\":\"https://doi.org/10.1109/ICAWST.2013.6765408\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"炎黄地理","FirstCategoryId":"1089","ListUrlMain":"https://doi.org/10.1109/ICAWST.2013.6765408","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A framework of autonomous knowledge transfer for robot navigation task
This paper proposes a framework of knowledge transfer used in the robot navigation problem, in order to speedup the learning process when the robot meets new or more complex task. Take the graph of source navigation maps as the template library, Principal Component Analysis method is adopted to match the target graph with source template. Then, the robot recognizes the similar source task. The value-function of the selected source task, which correlates the cumulative rewards with the sensory information corresponding to the generalized distance information to each beacon, is transferred to the target task. With this approach, the robot can choose the most related knowledge from the source tasks and transfer the generalized knowledge to the target navigation task automatically without handed code mapping between tasks. Experimental results demonstrate that the presented transfer method can yield a remarkable speedup for learning process in the robot navigation task.