Skeleton-based Multi-Feature Sharing Real-Time Action Recognition Network for Human-Robot Interaction

2023 International Conference on Advanced Robotics and Mechatronics (ICARM) Pub Date : 2023-07-08 DOI:10.1109/ICARM58088.2023.10218912

Zhiwen Deng, Qing Gao, Xiang Yu, Zhaojie Ju, Junkang Chen

{"title":"Skeleton-based Multi-Feature Sharing Real-Time Action Recognition Network for Human-Robot Interaction","authors":"Zhiwen Deng, Qing Gao, Xiang Yu, Zhaojie Ju, Junkang Chen","doi":"10.1109/ICARM58088.2023.10218912","DOIUrl":null,"url":null,"abstract":"Human-Robot Interaction (HRI) is one of the directions that deserves to be studied, and it is used in various fields (e.g., emergency rescue, telemedicine, and astronaut assistance). Action-based HRI has been proven practical in many application scenarios (e.g., industrial teleoperation and telemedicine). Among these many applications, speed and accuracy are two common optimization goals for researchers. A skeleton-based multi-feature sharing real-time action recognition network (MSR-Net) has been proposed to ensure its high accuracy and speed. Three pairs of feature inputs are proposed to make the network input more informative, which contain: joint distance-joint distance motion (JD-JDM), angle-angle motion (A-AM), slow motion coordinates-fast motion coordinates (SMC-FMC). To ensure that features are fully extracted while reducing the number of model parameters, one-dimensional convolutional neural networks (1DCNN) are used to build multi-feature sharing two-stage feature extraction networks. As a result, MSR-Net outperforms state-of-the-art models in accuracy on the JHMDB (86.8%) and SHREC datasets (96.8% on coarse class and 93.8% on fine class). Application experiments were carried out on the HRI platform to demonstrate the effectiveness of MSR-Net in HRI. The video demonstration of the experimental results of the HRI application can be accessed on https://youtu.be/NzXrngh7BbQ.","PeriodicalId":220013,"journal":{"name":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 International Conference on Advanced Robotics and Mechatronics (ICARM)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICARM58088.2023.10218912","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Human-Robot Interaction (HRI) is one of the directions that deserves to be studied, and it is used in various fields (e.g., emergency rescue, telemedicine, and astronaut assistance). Action-based HRI has been proven practical in many application scenarios (e.g., industrial teleoperation and telemedicine). Among these many applications, speed and accuracy are two common optimization goals for researchers. A skeleton-based multi-feature sharing real-time action recognition network (MSR-Net) has been proposed to ensure its high accuracy and speed. Three pairs of feature inputs are proposed to make the network input more informative, which contain: joint distance-joint distance motion (JD-JDM), angle-angle motion (A-AM), slow motion coordinates-fast motion coordinates (SMC-FMC). To ensure that features are fully extracted while reducing the number of model parameters, one-dimensional convolutional neural networks (1DCNN) are used to build multi-feature sharing two-stage feature extraction networks. As a result, MSR-Net outperforms state-of-the-art models in accuracy on the JHMDB (86.8%) and SHREC datasets (96.8% on coarse class and 93.8% on fine class). Application experiments were carried out on the HRI platform to demonstrate the effectiveness of MSR-Net in HRI. The video demonstration of the experimental results of the HRI application can be accessed on https://youtu.be/NzXrngh7BbQ.

查看原文本刊更多论文

基于骨架的人机交互多特征共享实时动作识别网络

人机交互(Human-Robot Interaction, HRI)是一个值得研究的方向之一，它被应用于各个领域(如紧急救援、远程医疗和宇航员援助)。基于行动的HRI在许多应用场景(例如，工业远程操作和远程医疗)中已被证明是实用的。在这些众多的应用中，速度和准确性是研究人员共同的优化目标。提出了一种基于骨架的多特征共享实时动作识别网络(MSR-Net)，以保证其高精度和高速度。为了提高网络输入的信息量，提出了关节距离-关节距离运动(JD-JDM)、角度-角度运动(A-AM)、慢运动坐标-快运动坐标(SMC-FMC)三对特征输入。为了在保证特征充分提取的同时减少模型参数的数量，采用一维卷积神经网络(1DCNN)构建多特征共享的两阶段特征提取网络。因此，MSR-Net在JHMDB(86.8%)和SHREC数据集(粗类96.8%和细类93.8%)上的准确率优于最先进的模型。在HRI平台上进行了应用实验，验证了MSR-Net在HRI中的有效性。HRI应用实验结果的视频演示可以在https://youtu.be/NzXrngh7BbQ上访问。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 International Conference on Advanced Robotics and Mechatronics (ICARM)

自引率

0.00%

发文量