Toshiaki Takano, H. Takase, H. Kawanaka, S. Tsuruoka
{"title":"Transfer Method for Reinforcement Learning in Same Transition Model -- Quick Approach and Preferential Exploration","authors":"Toshiaki Takano, H. Takase, H. Kawanaka, S. Tsuruoka","doi":"10.1109/ICMLA.2011.148","DOIUrl":null,"url":null,"abstract":"We aim to accelerate learning processes in reinforcement learning by transfer learning. Its concept is that knowledge to solve similar tasks accelerates a learning process of a target task. We have proposed that the basic transfer method based on forbidden rule set that is a set of rules which cause to immediately failure of a target task. However, the basic method works poorly for the gSame Transition Model,h which has same state transition probability and different goal. In this article, we propose an effective transfer learning method in same transition model. In detail, it consists of two strategies: (1) approaching to the goal for the selected source task quickly, and (2) exploring states around the goal preferentially.","PeriodicalId":439926,"journal":{"name":"2011 10th International Conference on Machine Learning and Applications and Workshops","volume":"8 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 10th International Conference on Machine Learning and Applications and Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2011.148","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 7
Abstract
We aim to accelerate learning processes in reinforcement learning by transfer learning. Its concept is that knowledge to solve similar tasks accelerates a learning process of a target task. We have proposed that the basic transfer method based on forbidden rule set that is a set of rules which cause to immediately failure of a target task. However, the basic method works poorly for the gSame Transition Model,h which has same state transition probability and different goal. In this article, we propose an effective transfer learning method in same transition model. In detail, it consists of two strategies: (1) approaching to the goal for the selected source task quickly, and (2) exploring states around the goal preferentially.