{"title":"Mutual Reinforcement Learning with Heterogenous Agents","authors":"Cameron Reid, S. Mukhopadhyay","doi":"10.1109/SMARTCOMP52413.2021.00081","DOIUrl":null,"url":null,"abstract":"Mutual learning is an emerging technique for allowing intelligent systems to learn from each other, giving rise to improved performance. In this paper, we explore mutual reinforcement learning between systems which use very different learning algorithms. In particular, we present an algorithm which allows two agents, one using Q-learning and another using adaptive dynamic programming, to share learned knowledge. We discuss how these agents negotiate the relative importance of knowledge they receive from other agents, and we present results that show how this affects the learning process.","PeriodicalId":330785,"journal":{"name":"2021 IEEE International Conference on Smart Computing (SMARTCOMP)","volume":"18 ","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 IEEE International Conference on Smart Computing (SMARTCOMP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/SMARTCOMP52413.2021.00081","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Mutual learning is an emerging technique for allowing intelligent systems to learn from each other, giving rise to improved performance. In this paper, we explore mutual reinforcement learning between systems which use very different learning algorithms. In particular, we present an algorithm which allows two agents, one using Q-learning and another using adaptive dynamic programming, to share learned knowledge. We discuss how these agents negotiate the relative importance of knowledge they receive from other agents, and we present results that show how this affects the learning process.