{"title":"Swarm AGV Optimization Using Deep Reinforcement Learning","authors":"Pilar Arques-Corrales, F. A. Gregori","doi":"10.1145/3426826.3426839","DOIUrl":null,"url":null,"abstract":"Behavior design for Automated Guided Vehicles (AGV) systems is an active research area, fundamental for robotics, industrial systems automation. The rise of machine learning neural systems and deep learning make promising results in a multitude of areas including warehouse environments.In this paper, several different policies will be obtained by using reinforcement learning on a heterogeneous swarm robotic system, applied for solving logistical tasks in Automated Guided Vehicles. More specifically, two different types of agents will be used: the vehicles that collect, transport and deposit their package and the traffic lights that regulate the number of vehicles that circulate on the tracks. The main objective of our work is to learn simultaneously two different control policies, one for each kind of agent.The obtained policies have shown their ability to correctly learn the package transport behavior in addition to balance traffic flow to facilitate agent mobility and avoid collisions. Furthermore, the scalability of the system and the behavior performance for different number of vehicles has been shown.","PeriodicalId":90643,"journal":{"name":"Machine learning for multimodal interaction : ... international workshop, MLMI ... : revised selected papers. Workshop on Machine Learning for Multimodal Interaction","volume":"21 1","pages":"65-69"},"PeriodicalIF":0.0000,"publicationDate":"2020-09-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Machine learning for multimodal interaction : ... international workshop, MLMI ... : revised selected papers. Workshop on Machine Learning for Multimodal Interaction","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3426826.3426839","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Behavior design for Automated Guided Vehicles (AGV) systems is an active research area, fundamental for robotics, industrial systems automation. The rise of machine learning neural systems and deep learning make promising results in a multitude of areas including warehouse environments.In this paper, several different policies will be obtained by using reinforcement learning on a heterogeneous swarm robotic system, applied for solving logistical tasks in Automated Guided Vehicles. More specifically, two different types of agents will be used: the vehicles that collect, transport and deposit their package and the traffic lights that regulate the number of vehicles that circulate on the tracks. The main objective of our work is to learn simultaneously two different control policies, one for each kind of agent.The obtained policies have shown their ability to correctly learn the package transport behavior in addition to balance traffic flow to facilitate agent mobility and avoid collisions. Furthermore, the scalability of the system and the behavior performance for different number of vehicles has been shown.