Pose-invariant kinematic features for action recognition

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Pub Date : 2017-12-15 DOI:10.1109/APSIPA.2017.8282038

M. Ramanathan, W. Yau, E. Teoh, N. Magnenat-Thalmann

{"title":"Pose-invariant kinematic features for action recognition","authors":"M. Ramanathan, W. Yau, E. Teoh, N. Magnenat-Thalmann","doi":"10.1109/APSIPA.2017.8282038","DOIUrl":null,"url":null,"abstract":"Recognition of actions from videos is a difficult task due to several factors like dynamic backgrounds, occlusion, pose-variations observed. To tackle the pose variation problem, we propose a simple method based on a novel set of pose-invariant kinematic features which are encoded in a human body centric space. The proposed framework begins with detection of neck point, which will serve as a origin of body centric space. We propose a deep learning based classifier to detect neck point based on the output of fully connected network layer. With the help of the detected neck, propagation mechanism is proposed to divide the foreground region into head, torso and leg grids. The motion observed in each of these body part grids are represented using a set of pose-invariant kinematic features. These features represent motion of foreground or body region with respect to the detected neck point's motion and encoded based on view in a human body centric space. Based on these features, poseinvariant action recognition can be achieved. Due to the body centric space is used, non-upright human posture actions can also be handled easily. To test its effectiveness in non-upright human postures in actions, a new dataset is introduced with 8 non-upright actions performed by 35 subjects in 3 different views. Experiments have been conducted on benchmark and newly proposed non-upright action dataset to identify limitations and get insights on the proposed framework.","PeriodicalId":142091,"journal":{"name":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-12-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/APSIPA.2017.8282038","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Recognition of actions from videos is a difficult task due to several factors like dynamic backgrounds, occlusion, pose-variations observed. To tackle the pose variation problem, we propose a simple method based on a novel set of pose-invariant kinematic features which are encoded in a human body centric space. The proposed framework begins with detection of neck point, which will serve as a origin of body centric space. We propose a deep learning based classifier to detect neck point based on the output of fully connected network layer. With the help of the detected neck, propagation mechanism is proposed to divide the foreground region into head, torso and leg grids. The motion observed in each of these body part grids are represented using a set of pose-invariant kinematic features. These features represent motion of foreground or body region with respect to the detected neck point's motion and encoded based on view in a human body centric space. Based on these features, poseinvariant action recognition can be achieved. Due to the body centric space is used, non-upright human posture actions can also be handled easily. To test its effectiveness in non-upright human postures in actions, a new dataset is introduced with 8 non-upright actions performed by 35 subjects in 3 different views. Experiments have been conducted on benchmark and newly proposed non-upright action dataset to identify limitations and get insights on the proposed framework.

查看原文本刊更多论文

动作识别的位姿不变运动特征

由于动态背景、遮挡、姿势变化等因素的影响，从视频中识别动作是一项艰巨的任务。为了解决位姿变化问题，我们提出了一种基于一组新的以人体为中心空间编码的位姿不变运动特征的简单方法。该框架从颈部点的检测开始，颈部点将作为身体中心空间的原点。我们提出了一种基于深度学习的分类器，基于全连接网络层的输出来检测颈部点。利用检测到的颈部，提出了将前景区域划分为头部、躯干和腿部网格的传播机制。在这些身体部分网格中观察到的运动使用一组位姿不变的运动学特征来表示。这些特征表示前景或身体区域相对于检测到的颈部点的运动，并基于以人体为中心的空间中的视图进行编码。基于这些特征，可以实现定位不变的动作识别。由于使用了以身体为中心的空间，非直立的人体姿势动作也可以轻松处理。为了测试其在人类非直立姿势动作中的有效性，引入了一个新的数据集，其中包含35名受试者在3个不同的视图中执行的8个非直立动作。在基准和新提出的非直立动作数据集上进行了实验，以确定所提出框架的局限性并获得见解。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC)

自引率

0.00%

发文量