Multi-Agent Data Collection in Non-Stationary Environments

2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM) Pub Date : 2022-06-01 DOI:10.1109/WoWMoM54355.2022.00023

N. Nguyen, D. Nguyen, Junae Kim, G. Rizzo, H. Nguyen

{"title":"Multi-Agent Data Collection in Non-Stationary Environments","authors":"N. Nguyen, D. Nguyen, Junae Kim, G. Rizzo, H. Nguyen","doi":"10.1109/WoWMoM54355.2022.00023","DOIUrl":null,"url":null,"abstract":"Coordinated multi-robot systems are an effective way to harvest data from sensor networks and to implement active perception strategies. However, achieving efficient coordination in a way which guarantees a target QoS while adapting dynamically to changes (in the environment, due to sensors’ mobility, and/or in the value of harvested data) is to date a key open issue. In this paper, we propose a novel decentralized Monte Carlo Tree Search algorithm (MCTS) which allows agents to optimize their own actions while achieving some form of coordination, in a changing environment. Its key underlying idea is to balance in an adaptive manner the exploration-exploitation trade-off to deal effectively with abrupt changes caused by the environment and random changes caused by other agents’ actions. Critically, outdated and irrelevant samples - an inherent and prevalent feature in all multi-agent MCTS-based algorithms - are filtered out by means of a sliding window mechanism. We show both theoretically and through simulations that our algorithm provides a log-factor (in terms of time steps) smaller regret than state-of-the-art decentralized multi-agent planning methods. We instantiate our approach on the problem of underwater data collection, showing on a set of different models for changes that our approach greatly outperforms the best available algorithms for that setting, both in terms of convergence speed and of global utility.","PeriodicalId":275324,"journal":{"name":"2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","volume":"12 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WoWMoM54355.2022.00023","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Coordinated multi-robot systems are an effective way to harvest data from sensor networks and to implement active perception strategies. However, achieving efficient coordination in a way which guarantees a target QoS while adapting dynamically to changes (in the environment, due to sensors’ mobility, and/or in the value of harvested data) is to date a key open issue. In this paper, we propose a novel decentralized Monte Carlo Tree Search algorithm (MCTS) which allows agents to optimize their own actions while achieving some form of coordination, in a changing environment. Its key underlying idea is to balance in an adaptive manner the exploration-exploitation trade-off to deal effectively with abrupt changes caused by the environment and random changes caused by other agents’ actions. Critically, outdated and irrelevant samples - an inherent and prevalent feature in all multi-agent MCTS-based algorithms - are filtered out by means of a sliding window mechanism. We show both theoretically and through simulations that our algorithm provides a log-factor (in terms of time steps) smaller regret than state-of-the-art decentralized multi-agent planning methods. We instantiate our approach on the problem of underwater data collection, showing on a set of different models for changes that our approach greatly outperforms the best available algorithms for that setting, both in terms of convergence speed and of global utility.

查看原文本刊更多论文

非平稳环境下的多智能体数据采集

协同多机器人系统是从传感器网络中获取数据和实施主动感知策略的有效途径。然而，在动态适应变化(在环境中，由于传感器的移动性，和/或收集的数据的价值)的同时，以保证目标QoS的方式实现有效的协调是迄今为止一个关键的开放问题。在本文中，我们提出了一种新的分散式蒙特卡罗树搜索算法(MCTS)，该算法允许智能体在不断变化的环境中优化自己的行为，同时实现某种形式的协调。它的核心思想是以一种适应性的方式平衡探索与开发之间的权衡，以有效地处理由环境引起的突变和由其他主体行为引起的随机变化。关键是，过时和不相关的样本-所有基于多智能体mcts的算法中固有和普遍的特征-通过滑动窗口机制过滤掉。我们从理论上和通过模拟表明，我们的算法比最先进的分散多智能体规划方法提供了一个对数因子(就时间步长而言)更小的遗憾。我们在水下数据收集问题上实例化了我们的方法，在一组不同的变化模型上显示，我们的方法在收敛速度和全局效用方面都大大优于该设置的最佳可用算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2022 IEEE 23rd International Symposium on a World of Wireless, Mobile and Multimedia Networks (WoWMoM)

自引率

0.00%

发文量