{"title":"基于卡尔曼时间差异学习的模型选择","authors":"Takehiro Kitao, Masato Shirai, T. Miura","doi":"10.1109/CIC.2017.00017","DOIUrl":null,"url":null,"abstract":"In this work we discuss how useful Kalman Temporal Difference (KTD) is for the purpose of improvement of multiple model learning. By KTD we mean a learning framework by combining Kalman Filters and Temporal Difference (TD) to enhance multi-agent environment. In this approach, we have to attack dependency issues against initialization parameters: the results (quality and efficiency) heavily depend on the parameters. In this investigation, we propose a new approach by estimating multiple models in parallel and by selecting suitable ones eventually.","PeriodicalId":156843,"journal":{"name":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","volume":"121 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"Model Selection Based on Kalman Temporal Differences Learning\",\"authors\":\"Takehiro Kitao, Masato Shirai, T. Miura\",\"doi\":\"10.1109/CIC.2017.00017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this work we discuss how useful Kalman Temporal Difference (KTD) is for the purpose of improvement of multiple model learning. By KTD we mean a learning framework by combining Kalman Filters and Temporal Difference (TD) to enhance multi-agent environment. In this approach, we have to attack dependency issues against initialization parameters: the results (quality and efficiency) heavily depend on the parameters. In this investigation, we propose a new approach by estimating multiple models in parallel and by selecting suitable ones eventually.\",\"PeriodicalId\":156843,\"journal\":{\"name\":\"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)\",\"volume\":\"121 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIC.2017.00017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE 3rd International Conference on Collaboration and Internet Computing (CIC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIC.2017.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Model Selection Based on Kalman Temporal Differences Learning
In this work we discuss how useful Kalman Temporal Difference (KTD) is for the purpose of improvement of multiple model learning. By KTD we mean a learning framework by combining Kalman Filters and Temporal Difference (TD) to enhance multi-agent environment. In this approach, we have to attack dependency issues against initialization parameters: the results (quality and efficiency) heavily depend on the parameters. In this investigation, we propose a new approach by estimating multiple models in parallel and by selecting suitable ones eventually.