Simultaneous temporal and spatial deep attention for imaged skeleton-based action recognition

Mohamed Lamine Rouali, Said Yacine Boulahia, Abdenour Amamra
{"title":"Simultaneous temporal and spatial deep attention for imaged skeleton-based action recognition","authors":"Mohamed Lamine Rouali, Said Yacine Boulahia, Abdenour Amamra","doi":"10.1145/3480651.3480668","DOIUrl":null,"url":null,"abstract":"The use of skeletons as a modality to represent and recognize human actions has gained interest thanks to the compactness of the data, their reliable representativeness in addition to their strong robustness. The deep learning based recognition approaches which are based on it often propose to improve the recognition pipeline by integrating the concept of attention in their modeling. The idea is to allow the model to focus on the relevant information of the action instead of attempting some kind of blind modeling. In this article, we propose an action recognition approach integrating simultaneously both spatial and temporal attentions. We first perform a transformation of the input sequence data into a color matrix, called imaged skeleton, comprising Cartesian and rotational information. Then, this new representation is given as input to an architecture composed of a main trunk, that allows features extraction and classification, and several attention branches. Different experimental evaluations on two popular benchmark databases, namely UT-Kinect [1] and SBU Kinect Interaction [2], are conducted to verify the interest of our proposed approach, where better performances are reported. Index: convolutional neural network, spatio-temporal, skeleton-based action recognition, deep attention.","PeriodicalId":305943,"journal":{"name":"Proceedings of the 2021 International Conference on Pattern Recognition and Intelligent Systems","volume":"81 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-07-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2021 International Conference on Pattern Recognition and Intelligent Systems","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3480651.3480668","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

The use of skeletons as a modality to represent and recognize human actions has gained interest thanks to the compactness of the data, their reliable representativeness in addition to their strong robustness. The deep learning based recognition approaches which are based on it often propose to improve the recognition pipeline by integrating the concept of attention in their modeling. The idea is to allow the model to focus on the relevant information of the action instead of attempting some kind of blind modeling. In this article, we propose an action recognition approach integrating simultaneously both spatial and temporal attentions. We first perform a transformation of the input sequence data into a color matrix, called imaged skeleton, comprising Cartesian and rotational information. Then, this new representation is given as input to an architecture composed of a main trunk, that allows features extraction and classification, and several attention branches. Different experimental evaluations on two popular benchmark databases, namely UT-Kinect [1] and SBU Kinect Interaction [2], are conducted to verify the interest of our proposed approach, where better performances are reported. Index: convolutional neural network, spatio-temporal, skeleton-based action recognition, deep attention.
基于骨骼图像动作识别的同时时间和空间深度注意
由于数据的紧凑性、可靠的代表性以及强大的鲁棒性,使用骨架作为一种模态来表示和识别人类行为已经引起了人们的兴趣。基于此的基于深度学习的识别方法通常提出通过在其建模中集成注意力的概念来改进识别管道。其理念是允许模型专注于动作的相关信息,而不是尝试某种盲目建模。在本文中,我们提出了一种同时整合空间和时间注意力的动作识别方法。我们首先将输入序列数据转换成一个颜色矩阵,称为图像骨架,包含笛卡尔和旋转信息。然后,将这个新的表示作为输入输入到一个由主干组成的体系结构中,主干允许特征提取和分类,以及几个注意分支。在两种流行的基准数据库,即UT-Kinect[1]和SBU Kinect Interaction[2]上进行了不同的实验评估,以验证我们提出的方法的兴趣,其中报告了更好的性能。索引:卷积神经网络,时空,基于骨架的动作识别,深度注意。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信