{"title":"基于图神经网络的异构机器人群分离学习分散控制器","authors":"Oyindamola Omotuyi, Manish Kumar","doi":"10.1109/MARSS55884.2022.9870482","DOIUrl":null,"url":null,"abstract":"In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.","PeriodicalId":144730,"journal":{"name":"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)","volume":"47 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Learning Decentralized Controllers for Segregation of Heterogeneous Robot Swarms with Graph Neural Networks\",\"authors\":\"Oyindamola Omotuyi, Manish Kumar\",\"doi\":\"10.1109/MARSS55884.2022.9870482\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.\",\"PeriodicalId\":144730,\"journal\":{\"name\":\"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)\",\"volume\":\"47 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MARSS55884.2022.9870482\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 International Conference on Manipulation, Automation and Robotics at Small Scales (MARSS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MARSS55884.2022.9870482","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Learning Decentralized Controllers for Segregation of Heterogeneous Robot Swarms with Graph Neural Networks
In this paper, we studied the problem of finding decentralized controllers for large-scale heterogeneous robot swarms exhibiting segregative behaviors. As seen in nature, Segregative behaviors involve sorting a group of robots into groups based on their type. Our approach involves learning controllers that utilize local information at test time by imitating the policy of a centralized controller based on a differential potential concept at training time. We parameterized our policy using a time-varying aggregation graph neural network with multi-hop communication. This incorporates information not only from immediate neighbors but distant neighbors. We showed that our controller outperformed a local controller that considers only immediate neighbors and achieved similar performance to the centralized controller through varied experiments. In addition, we demonstrated the scalability of our method by exploring larger swarms and different groups.