基于递归神经网络的人机姿态分层映射

2020 Fourth IEEE International Conference on Robotic Computing (IRC) Pub Date : 2020-11-01 DOI:10.1109/IRC.2020.00016

Zainab Al-Qurashi, Brian D. Ziebart

{"title":"基于递归神经网络的人机姿态分层映射","authors":"Zainab Al-Qurashi, Brian D. Ziebart","doi":"10.1109/IRC.2020.00016","DOIUrl":null,"url":null,"abstract":"To perform many critical manipulation tasks successfully, human-robot mimicking systems should not only accurately copy the position of a human hand, but its orientation as well. Deep learning methods trained from pairs of corresponding human and robot poses offer one promising approach for constructing a human-robot mapping to accomplish this. However, ignoring the spatial and temporal structure of this mapping makes learning it less effective. We propose two different hierarchical architectures that leverage the structural and temporal human-robot mapping. We partially separate the robotic manipulator's end-effector position and orientation while considering the mutual coupling effects between them. This divides the main problem-making the robot match the human's hand position and mimic its orientation accurately along an unknown trajectory-into several sub-problems. We address these using different recurrent neural networks (RNNs) with Long-Short Term Memory (LSTM) that we combine and train hierarchically based on the coupling over the aspects of the robot that each controls. We evaluate our proposed architectures using a virtual reality system to track human table tennis motions and compare with single artificial neural network (ANN) and RNN models. We compare the benefits of using deep learning neural networks with and without our architectures and find smaller errors in position and orientation, along with increased flexibility in wrist movement are obtained by our proposed architectures.","PeriodicalId":232817,"journal":{"name":"2020 Fourth IEEE International Conference on Robotic Computing (IRC)","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Recurrent Neural Networks for Hierarchically Mapping Human-Robot Poses\",\"authors\":\"Zainab Al-Qurashi, Brian D. Ziebart\",\"doi\":\"10.1109/IRC.2020.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To perform many critical manipulation tasks successfully, human-robot mimicking systems should not only accurately copy the position of a human hand, but its orientation as well. Deep learning methods trained from pairs of corresponding human and robot poses offer one promising approach for constructing a human-robot mapping to accomplish this. However, ignoring the spatial and temporal structure of this mapping makes learning it less effective. We propose two different hierarchical architectures that leverage the structural and temporal human-robot mapping. We partially separate the robotic manipulator's end-effector position and orientation while considering the mutual coupling effects between them. This divides the main problem-making the robot match the human's hand position and mimic its orientation accurately along an unknown trajectory-into several sub-problems. We address these using different recurrent neural networks (RNNs) with Long-Short Term Memory (LSTM) that we combine and train hierarchically based on the coupling over the aspects of the robot that each controls. We evaluate our proposed architectures using a virtual reality system to track human table tennis motions and compare with single artificial neural network (ANN) and RNN models. We compare the benefits of using deep learning neural networks with and without our architectures and find smaller errors in position and orientation, along with increased flexibility in wrist movement are obtained by our proposed architectures.\",\"PeriodicalId\":232817,\"journal\":{\"name\":\"2020 Fourth IEEE International Conference on Robotic Computing (IRC)\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2020 Fourth IEEE International Conference on Robotic Computing (IRC)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IRC.2020.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 Fourth IEEE International Conference on Robotic Computing (IRC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IRC.2020.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

为了成功地完成许多关键的操作任务，人机模拟系统不仅要准确地复制人手的位置，而且要准确地复制人手的方向。从相应的人和机器人姿势对训练的深度学习方法为构建人-机器人映射提供了一种有希望的方法来实现这一目标。然而，忽略这种映射的空间和时间结构会降低学习的效率。我们提出了两种不同的层次结构，利用结构和时间的人-机器人映射。考虑到机器人末端执行器的相互耦合效应，将末端执行器的位置和姿态部分分离。这将主要问题——使机器人匹配人的手的位置并沿着未知的轨迹精确地模仿其方向——划分为几个子问题。我们使用具有长短期记忆(LSTM)的不同循环神经网络(rnn)来解决这些问题，我们基于每个控制的机器人各方面的耦合对其进行分层组合和训练。我们使用虚拟现实系统来评估我们提出的架构，以跟踪人类乒乓球运动，并与单个人工神经网络(ANN)和RNN模型进行比较。我们比较了在使用和不使用我们的架构时使用深度学习神经网络的好处，发现我们提出的架构在位置和方向上的误差更小，手腕运动的灵活性也更大。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Recurrent Neural Networks for Hierarchically Mapping Human-Robot Poses

To perform many critical manipulation tasks successfully, human-robot mimicking systems should not only accurately copy the position of a human hand, but its orientation as well. Deep learning methods trained from pairs of corresponding human and robot poses offer one promising approach for constructing a human-robot mapping to accomplish this. However, ignoring the spatial and temporal structure of this mapping makes learning it less effective. We propose two different hierarchical architectures that leverage the structural and temporal human-robot mapping. We partially separate the robotic manipulator's end-effector position and orientation while considering the mutual coupling effects between them. This divides the main problem-making the robot match the human's hand position and mimic its orientation accurately along an unknown trajectory-into several sub-problems. We address these using different recurrent neural networks (RNNs) with Long-Short Term Memory (LSTM) that we combine and train hierarchically based on the coupling over the aspects of the robot that each controls. We evaluate our proposed architectures using a virtual reality system to track human table tennis motions and compare with single artificial neural network (ANN) and RNN models. We compare the benefits of using deep learning neural networks with and without our architectures and find smaller errors in position and orientation, along with increased flexibility in wrist movement are obtained by our proposed architectures.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2020 Fourth IEEE International Conference on Robotic Computing (IRC)

自引率

0.00%

发文量