基于频率与利润成比例投票的BRT代理推箱问题的共识构建

J. Robotics Mechatronics Pub Date : 2023-08-20 DOI:10.20965/jrm.2023.p1016

M. Kubo, Hiroshi Sato, A. Yamaguchi

{"title":"基于频率与利润成比例投票的BRT代理推箱问题的共识构建","authors":"M. Kubo, Hiroshi Sato, A. Yamaguchi","doi":"10.20965/jrm.2023.p1016","DOIUrl":null,"url":null,"abstract":"In this study, we added voting behavior in which voting proportionately reflects the value of a view (option, opinion, and so on) to the BRT agent. BRT agent is a consensus-building model of the decision-making process among a group of human, and is a framework that allows the expression of the collective behavior while maintaining dispersiveness, although it has been noted that it is unable to reach consensus by making use of experience. To resolve this issue, we propose the incorporation of a mechanism of voting at frequencies proportional to the value estimated using reinforcement learning. We conducted a series of computer-based experiments using the box-pushing problem and verified that the proposed method reached a consensus to arrive at solutions based on experience.","PeriodicalId":178614,"journal":{"name":"J. Robotics Mechatronics","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-08-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Consensus Building in Box-Pushing Problem by BRT Agent that Votes with Frequency Proportional to Profit\",\"authors\":\"M. Kubo, Hiroshi Sato, A. Yamaguchi\",\"doi\":\"10.20965/jrm.2023.p1016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this study, we added voting behavior in which voting proportionately reflects the value of a view (option, opinion, and so on) to the BRT agent. BRT agent is a consensus-building model of the decision-making process among a group of human, and is a framework that allows the expression of the collective behavior while maintaining dispersiveness, although it has been noted that it is unable to reach consensus by making use of experience. To resolve this issue, we propose the incorporation of a mechanism of voting at frequencies proportional to the value estimated using reinforcement learning. We conducted a series of computer-based experiments using the box-pushing problem and verified that the proposed method reached a consensus to arrive at solutions based on experience.\",\"PeriodicalId\":178614,\"journal\":{\"name\":\"J. Robotics Mechatronics\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-08-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"J. Robotics Mechatronics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.20965/jrm.2023.p1016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"J. Robotics Mechatronics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.20965/jrm.2023.p1016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

在这项研究中，我们增加了投票行为，其中投票按比例反映了对BRT代理的看法(选项，意见等)的价值。BRT agent是一种人类群体决策过程的共识构建模型，是一种允许在保持分散性的同时表达集体行为的框架，尽管已经注意到它无法利用经验达成共识。为了解决这个问题，我们建议结合一种机制，以与使用强化学习估计的值成比例的频率进行投票。我们利用推盒问题进行了一系列基于计算机的实验，并验证了所提出的方法在基于经验的解决方案上达成了共识。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Consensus Building in Box-Pushing Problem by BRT Agent that Votes with Frequency Proportional to Profit

In this study, we added voting behavior in which voting proportionately reflects the value of a view (option, opinion, and so on) to the BRT agent. BRT agent is a consensus-building model of the decision-making process among a group of human, and is a framework that allows the expression of the collective behavior while maintaining dispersiveness, although it has been noted that it is unable to reach consensus by making use of experience. To resolve this issue, we propose the incorporation of a mechanism of voting at frequencies proportional to the value estimated using reinforcement learning. We conducted a series of computer-based experiments using the box-pushing problem and verified that the proposed method reached a consensus to arrive at solutions based on experience.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

J. Robotics Mechatronics

自引率

0.00%

发文量