Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment

Portuguese Conference on Artificial Intelligence Pub Date : 2022-03-06 DOI:10.48550/arXiv.2203.03021

Diogo S. Carvalho, B. Sengupta

{"title":"Hierarchically Structured Scheduling and Execution of Tasks in a Multi-Agent Environment","authors":"Diogo S. Carvalho, B. Sengupta","doi":"10.48550/arXiv.2203.03021","DOIUrl":null,"url":null,"abstract":"In a warehouse environment, tasks appear dynamically. Consequently, a task management system that matches them with the workforce too early (e.g., weeks in advance) is necessarily sub-optimal. Also, the rapidly increasing size of the action space of such a system consists of a significant problem for traditional schedulers. Reinforcement learning, however, is suited to deal with issues requiring making sequential decisions towards a long-term, often remote, goal. In this work, we set ourselves on a problem that presents itself with a hierarchical structure: the task-scheduling, by a centralised agent, in a dynamic warehouse multi-agent environment and the execution of one such schedule, by decentralised agents with only partial observability thereof. We propose to use deep reinforcement learning to solve both the high-level scheduling problem and the low-level multi-agent problem of schedule execution. Finally, we also conceive the case where centralisation is impossible at test time and workers must learn how to cooperate in executing the tasks in an environment with no schedule and only partial observability.","PeriodicalId":382392,"journal":{"name":"Portuguese Conference on Artificial Intelligence","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-03-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Portuguese Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2203.03021","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 1

Abstract

In a warehouse environment, tasks appear dynamically. Consequently, a task management system that matches them with the workforce too early (e.g., weeks in advance) is necessarily sub-optimal. Also, the rapidly increasing size of the action space of such a system consists of a significant problem for traditional schedulers. Reinforcement learning, however, is suited to deal with issues requiring making sequential decisions towards a long-term, often remote, goal. In this work, we set ourselves on a problem that presents itself with a hierarchical structure: the task-scheduling, by a centralised agent, in a dynamic warehouse multi-agent environment and the execution of one such schedule, by decentralised agents with only partial observability thereof. We propose to use deep reinforcement learning to solve both the high-level scheduling problem and the low-level multi-agent problem of schedule execution. Finally, we also conceive the case where centralisation is impossible at test time and workers must learn how to cooperate in executing the tasks in an environment with no schedule and only partial observability.

查看原文本刊更多论文

多智能体环境中分层结构的任务调度与执行

在仓库环境中，任务是动态出现的。因此，过早地(例如，提前几周)将他们与劳动力相匹配的任务管理系统必然是次优的。此外，这种系统的动作空间的快速增长构成了传统调度器的一个重大问题。然而，强化学习适合于处理需要对长期(通常是远程)目标做出连续决策的问题。在这项工作中，我们将自己置于一个具有分层结构的问题上:在动态仓库多代理环境中，由集中式代理执行任务调度，以及由只有部分可观察性的分散代理执行这样的调度。我们建议使用深度强化学习来解决高级调度问题和低级调度执行的多智能体问题。最后，我们还设想了在测试时不可能集中的情况，工人必须学习如何在没有时间表和只有部分可观察性的环境中合作执行任务。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Portuguese Conference on Artificial Intelligence

自引率

0.00%

发文量