A Cloud-Based Robust Semaphore Mirroring System for Social Robots

2018 IEEE 14th International Conference on Automation Science and Engineering (CASE) Pub Date : 2018-08-01 DOI:10.1109/COASE.2018.8560553

Nan Tian, Benjamin Kuo, X. Ren, Michael Yu, Robert Zhang, Bill Huang, Ken Goldberg, S. Sojoudi

{"title":"A Cloud-Based Robust Semaphore Mirroring System for Social Robots","authors":"Nan Tian, Benjamin Kuo, X. Ren, Michael Yu, Robert Zhang, Bill Huang, Ken Goldberg, S. Sojoudi","doi":"10.1109/COASE.2018.8560553","DOIUrl":null,"url":null,"abstract":"We present a cloud-based human-robot interaction system that automatically controls a humanoid robot to mirror a human demonstrator performing flag semaphores. We use a cloud-based framework called Human Augmented Robotic Intelligence (HARI) to perform gesture recognition of the human demonstrator and gesture control of a local humanoid robot, named Pepper. To ensure that the system is real-time, we design a system to maximize cloud computation contribution to the deep-neural-network-based gesture recognition system, OpenPose, and to minimize communication costs between the cloud and the robot. A hybrid control system is used to hide latency caused by either routing or physical distances. We conducted real-time semaphore mirroring experiments in which both the robots and the demonstrator were located in Tokyo, Japan, whereas the cloud server was deployed in the United States. The total latency was 400ms for the video streaming to the cloud and 108ms for the robot commanding from the cloud. Further, we measured the reliability of our gesture-based semaphore recognition system with two human subjects, and were able to achieve 90% and 76.7% recognition accuracy, respectively, for the two subjects with open-loop when the subjects were not allowed to see the recognition results. We could achieve 100% recognition accuracy when both subjects were allowed to adapt to the recognition system under a closed-loop setting. Lastly, we showed that we can support two humanoid robots with a single server at the same time. With this real-time cloud-based HRI system, we illustrate that we can deploy gesture-based human-robot globally and at scale.","PeriodicalId":6518,"journal":{"name":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","volume":"5 1","pages":"1351-1358"},"PeriodicalIF":0.0000,"publicationDate":"2018-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/COASE.2018.8560553","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 11

Abstract

We present a cloud-based human-robot interaction system that automatically controls a humanoid robot to mirror a human demonstrator performing flag semaphores. We use a cloud-based framework called Human Augmented Robotic Intelligence (HARI) to perform gesture recognition of the human demonstrator and gesture control of a local humanoid robot, named Pepper. To ensure that the system is real-time, we design a system to maximize cloud computation contribution to the deep-neural-network-based gesture recognition system, OpenPose, and to minimize communication costs between the cloud and the robot. A hybrid control system is used to hide latency caused by either routing or physical distances. We conducted real-time semaphore mirroring experiments in which both the robots and the demonstrator were located in Tokyo, Japan, whereas the cloud server was deployed in the United States. The total latency was 400ms for the video streaming to the cloud and 108ms for the robot commanding from the cloud. Further, we measured the reliability of our gesture-based semaphore recognition system with two human subjects, and were able to achieve 90% and 76.7% recognition accuracy, respectively, for the two subjects with open-loop when the subjects were not allowed to see the recognition results. We could achieve 100% recognition accuracy when both subjects were allowed to adapt to the recognition system under a closed-loop setting. Lastly, we showed that we can support two humanoid robots with a single server at the same time. With this real-time cloud-based HRI system, we illustrate that we can deploy gesture-based human-robot globally and at scale.

查看原文本刊更多论文

基于云的社交机器人鲁棒信号镜像系统

我们提出了一个基于云的人机交互系统，该系统自动控制人形机器人镜像执行旗帜信号的人类演示者。我们使用一种名为“人类增强机器人智能”(Human Augmented robot Intelligence, HARI)的基于云的框架来对人类演示者进行手势识别，并对一个名为Pepper的本地人形机器人进行手势控制。为了确保系统的实时性，我们设计了一个系统，以最大限度地提高云计算对基于深度神经网络的手势识别系统OpenPose的贡献，并最大限度地降低云和机器人之间的通信成本。混合控制系统用于隐藏由路由或物理距离引起的延迟。我们进行了实时信号量镜像实验，其中机器人和演示器都位于日本东京，而云服务器部署在美国。视频流到云端的总延迟为400ms，机器人从云端指挥的总延迟为108ms。此外，我们对基于手势的信号量识别系统的可靠性进行了测试，在不允许受试者看到识别结果的情况下，对于开环的两名受试者，我们的识别准确率分别达到90%和76.7%。在一个闭环设置下，允许两个被试适应识别系统，我们可以达到100%的识别准确率。最后，我们展示了我们可以同时用一个服务器支持两个人形机器人。通过这种基于云的实时HRI系统，我们可以在全球范围内大规模部署基于手势的人机交互。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2018 IEEE 14th International Conference on Automation Science and Engineering (CASE)

自引率

0.00%

发文量