{"title":"AgentMario: A Multitask Agent for Robotic Interaction With Locker Systems","authors":"Haimo Zhang;Ting Lyu;Hong Li;Yishan Liu;Zibo Gao;Yan Yu;Can Wang;Lindsay Wang;Yuejia Zhang;Kunlun He;Kaigui Bian","doi":"10.1109/JIOT.2024.3470835","DOIUrl":null,"url":null,"abstract":"A robotic locker system is needed where automated storage and retrieval of items are required without the need for staff presence. For example, a robot can provide 7/24 available services of medical items pick-up and return, during the COVID-19 pandemic (or under other emergencies). A robotic locker system is usually equipped with a user-friendly intuitive interface (e.g., a touchscreen); meanwhile, the robot desires a multitask agent that can observe, understand, and operate the locker’s interface to complete many tasks of storing/accessing/shipping items. In this article, we study building a multitask agent for interacting with robotic locker systems, called AgentMario. Without human intervention for a specific task, AgentMario decomposes solving a task into learning basic skills (states or user interfaces) and planning over the skills (finding the next state/interface). When the agent is solving a task, our search algorithm walks on the finite state machine graph and generates the proper plans (operation sequence) for the agent. In experiments, our method accomplishes four diverse tasks of picking-up/storing/dropping-off/shipping items. By employing image recognition and mechanical automation technologies, we implement AgentMario with a robot arm to enable contactless operation over the locker’s interface. Experimental results show that our method outperforms baselines in most tasks by a large margin.","PeriodicalId":54347,"journal":{"name":"IEEE Internet of Things Journal","volume":"12 3","pages":"2473-2485"},"PeriodicalIF":8.9000,"publicationDate":"2025-01-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Internet of Things Journal","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10852197/","RegionNum":1,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
A robotic locker system is needed where automated storage and retrieval of items are required without the need for staff presence. For example, a robot can provide 7/24 available services of medical items pick-up and return, during the COVID-19 pandemic (or under other emergencies). A robotic locker system is usually equipped with a user-friendly intuitive interface (e.g., a touchscreen); meanwhile, the robot desires a multitask agent that can observe, understand, and operate the locker’s interface to complete many tasks of storing/accessing/shipping items. In this article, we study building a multitask agent for interacting with robotic locker systems, called AgentMario. Without human intervention for a specific task, AgentMario decomposes solving a task into learning basic skills (states or user interfaces) and planning over the skills (finding the next state/interface). When the agent is solving a task, our search algorithm walks on the finite state machine graph and generates the proper plans (operation sequence) for the agent. In experiments, our method accomplishes four diverse tasks of picking-up/storing/dropping-off/shipping items. By employing image recognition and mechanical automation technologies, we implement AgentMario with a robot arm to enable contactless operation over the locker’s interface. Experimental results show that our method outperforms baselines in most tasks by a large margin.
期刊介绍:
The EEE Internet of Things (IoT) Journal publishes articles and review articles covering various aspects of IoT, including IoT system architecture, IoT enabling technologies, IoT communication and networking protocols such as network coding, and IoT services and applications. Topics encompass IoT's impacts on sensor technologies, big data management, and future internet design for applications like smart cities and smart homes. Fields of interest include IoT architecture such as things-centric, data-centric, service-oriented IoT architecture; IoT enabling technologies and systematic integration such as sensor technologies, big sensor data management, and future Internet design for IoT; IoT services, applications, and test-beds such as IoT service middleware, IoT application programming interface (API), IoT application design, and IoT trials/experiments; IoT standardization activities and technology development in different standard development organizations (SDO) such as IEEE, IETF, ITU, 3GPP, ETSI, etc.