基于图神经网络的异构机器人群分离学习分散控制器

2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS) Pub Date : 2022-07-25 DOI:10.1109/MARSS55884.2022.9870482

Oyindamola Omotuyi, Manish Kumar

{"title":"基于图神经网络的异构机器人群分离学习分散控制器","authors":"Oyindamola Omotuyi, Manish Kumar","doi":"10.1109/MARSS55884.2022.9870482","DOIUrl":null,"url":null,"abstract":"In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.","PeriodicalId":144730,"journal":{"name":"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Decentralized Controllers for Segregation of Heterogeneous Robot Swarms with Graph Neural Networks\",\"authors\":\"Oyindamola Omotuyi, Manish Kumar\",\"doi\":\"10.1109/MARSS55884.2022.9870482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.\",\"PeriodicalId\":144730,\"journal\":{\"name\":\"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MARSS55884.2022.9870482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MARSS55884.2022.9870482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

本文研究了具有隔离行为的大型异构机器人群体的分散控制器问题。正如在自然界中看到的那样，隔离行为包括根据机器人的类型将一组机器人分成不同的组。我们的方法包括通过模仿训练时基于微分电位概念的集中式控制器的策略，在测试时利用本地信息来学习控制器。我们使用具有多跳通信的时变聚合图神经网络来参数化策略。这不仅包括来自近邻的信息，也包括来自远方邻居的信息。我们通过各种实验表明，我们的控制器优于只考虑近邻的局部控制器，并达到与集中控制器相似的性能。此外，我们通过探索更大的群体和不同的群体来证明我们的方法的可扩展性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Learning Decentralized Controllers for Segregation of Heterogeneous Robot Swarms with Graph Neural Networks

In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)

自引率

0.00%

发文量