Collective preference learning in the best-of-n problem

IF 1.9 4区计算机科学 Q3 COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE

Swarm Intelligence Pub Date : 2021-06-02 DOI:10.1007/s11721-021-00191-9

Michael Crosscombe, Jonathan Lawry

{"title":"Collective preference learning in the best-of-n problem","authors":"Michael Crosscombe, Jonathan Lawry","doi":"10.1007/s11721-021-00191-9","DOIUrl":null,"url":null,"abstract":"Decentralised autonomous systems rely on distributed learning to make decisions and to collaborate in pursuit of a shared objective. For example, in swarm robotics the best-of-n problem is a well-known collective decision-making problem in which agents attempt to learn the best option out of n possible alternatives based on local feedback from the environment. This typically involves gathering information about all n alternatives while then systematically discarding information about all but the best option. However, for applications such as search and rescue in which learning the ranking of options is useful or crucial, best-of-n decision-making can be wasteful and costly. Instead, we investigate a more general distributed learning process in which agents learn a preference ordering over all of the n options. More specifically, we introduce a distributed rank learning algorithm based on three-valued logic. We then use agent-based simulation experiments to demonstrate the effectiveness of this model. In this context, we show that a population of agents are able to learn a total ordering over the n options and furthermore the learning process is robust to evidential noise. To demonstrate the practicality of our model, we restrict the communication bandwidth between the agents and show that this model is also robust to limited communications whilst outperforming a comparable probabilistic model under the same communication conditions.","PeriodicalId":51284,"journal":{"name":"Swarm Intelligence","volume":"326 ","pages":""},"PeriodicalIF":1.9000,"publicationDate":"2021-06-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Swarm Intelligence","FirstCategoryId":"94","ListUrlMain":"https://doi.org/10.1007/s11721-021-00191-9","RegionNum":4,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}

引用次数: 5

Abstract

Decentralised autonomous systems rely on distributed learning to make decisions and to collaborate in pursuit of a shared objective. For example, in swarm robotics the best-of-n problem is a well-known collective decision-making problem in which agents attempt to learn the best option out of n possible alternatives based on local feedback from the environment. This typically involves gathering information about all n alternatives while then systematically discarding information about all but the best option. However, for applications such as search and rescue in which learning the ranking of options is useful or crucial, best-of-n decision-making can be wasteful and costly. Instead, we investigate a more general distributed learning process in which agents learn a preference ordering over all of the n options. More specifically, we introduce a distributed rank learning algorithm based on three-valued logic. We then use agent-based simulation experiments to demonstrate the effectiveness of this model. In this context, we show that a population of agents are able to learn a total ordering over the n options and furthermore the learning process is robust to evidential noise. To demonstrate the practicality of our model, we restrict the communication bandwidth between the agents and show that this model is also robust to limited communications whilst outperforming a comparable probabilistic model under the same communication conditions.

查看原文本刊更多论文

最优化问题中的集体偏好学习

分散的自治系统依靠分布式学习来做出决策，并在追求共同目标的过程中进行协作。例如，在群体机器人中，n个最优问题是一个众所周知的集体决策问题，在这个问题中，智能体试图根据环境的本地反馈从n个可能的替代方案中学习最佳选择。这通常包括收集关于所有n个选择的信息，然后系统地丢弃除了最佳选择之外的所有信息。然而，对于搜索和救援等应用来说，学习选项的排序是有用的或至关重要的，最佳决策可能是浪费和昂贵的。相反，我们研究了一个更一般的分布式学习过程，其中智能体学习所有n个选项的偏好顺序。更具体地说，我们介绍了一种基于三值逻辑的分布式排名学习算法。然后，我们使用基于智能体的仿真实验来证明该模型的有效性。在这种情况下，我们证明了智能体群体能够学习n个选项的总排序，并且学习过程对证据噪声具有鲁棒性。为了证明我们模型的实用性，我们限制了代理之间的通信带宽，并表明该模型对有限通信也具有鲁棒性，同时在相同的通信条件下优于可比的概率模型。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Swarm Intelligence COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE-ROBOTICS

CiteScore

5.70

自引率

11.50%

发文量

审稿时长

>12 weeks

期刊介绍： Swarm Intelligence is the principal peer-reviewed publication dedicated to reporting on research and developments in the multidisciplinary field of swarm intelligence. The journal publishes original research articles and occasional review articles on theoretical, experimental and/or practical aspects of swarm intelligence. All articles are published both in print and in electronic form. There are no page charges for publication. Swarm Intelligence is published quarterly. The field of swarm intelligence deals with systems composed of many individuals that coordinate using decentralized control and self-organization. In particular, it focuses on the collective behaviors that result from the local interactions of the individuals with each other and with their environment. It is a fast-growing field that encompasses the efforts of researchers in multiple disciplines, ranging from ethology and social science to operations research and computer engineering. Swarm Intelligence will report on advances in the understanding and utilization of swarm intelligence systems, that is, systems that are based on the principles of swarm intelligence. The following subjects are of particular interest to the journal: • modeling and analysis of collective biological systems such as social insect colonies, flocking vertebrates, and human crowds as well as any other swarm intelligence systems; • application of biological swarm intelligence models to real-world problems such as distributed computing, data clustering, graph partitioning, optimization and decision making; • theoretical and empirical research in ant colony optimization, particle swarm optimization, swarm robotics, and other swarm intelligence algorithms.