Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić
{"title":"基于多智能体q学习的无人机网络目标检测与室内测绘","authors":"Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić","doi":"10.1109/BalkanCom53780.2021.9593232","DOIUrl":null,"url":null,"abstract":"We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.","PeriodicalId":115090,"journal":{"name":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-09-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"6","resultStr":"{\"title\":\"Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping\",\"authors\":\"Anna Guerra, Francesco Guidi, D. Dardari, P. Djurić\",\"doi\":\"10.1109/BalkanCom53780.2021.9593232\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.\",\"PeriodicalId\":115090,\"journal\":{\"name\":\"2021 International Balkan Conference on Communications and Networking (BalkanCom)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-09-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"6\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Balkan Conference on Communications and Networking (BalkanCom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BalkanCom53780.2021.9593232\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Balkan Conference on Communications and Networking (BalkanCom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BalkanCom53780.2021.9593232","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Multi-Agent Q-Learning in UAV Networks for Target Detection and Indoor Mapping
We consider a network of unmanned aerial vehicles (UAVs) for a search-and-rescue operations involving both detection of multiple targets and mapping of environment, where the learning time is limited. One possibility for accomplishing the goal while guaranteeing short learning time is to employ cooperation among UAVs. With this objective, we adopt a multi-agent Q-learning algorithm that allows the UAVs to learn a suitable navigation policy in real-time in order to complete a mission within a fixed time frame. The obtained results demonstrate that proper combination of the information gathered by the UAVs allows for an accelerated learning process.