3D Human Action Recognition for Multi-view Camera Systems

2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission Pub Date : 2011-05-16 DOI:10.1109/3DIMPVT.2011.50

M. B. Holte, T. Moeslund, N. Nikolaidis, I. Pitas

{"title":"3D Human Action Recognition for Multi-view Camera Systems","authors":"M. B. Holte, T. Moeslund, N. Nikolaidis, I. Pitas","doi":"10.1109/3DIMPVT.2011.50","DOIUrl":null,"url":null,"abstract":"This paper presents a novel approach for combining optical flow into enhanced 3D motion vector fields for human action recognition. Our approach detects motion of the actors by computing optical flow in video data captured by a multi-view camera setup with an arbitrary number of views. Optical flow is estimated in each view and extended to 3D using 3D reconstructions of the actors and pixel-to-vertex correspondences. The resulting 3D optical flow for each view is combined into a 3D motion vector field by taking the significance of local motion and its reliability into account. 3D Motion Context (3D-MC) and Harmonic Motion Context (HMC) are used to represent the extracted 3D motion vector fields efficiently and in a view-invariant manner, while considering difference in anthropometry of the actors and their movement style variations. The resulting 3D-MC and HMC descriptors are classified into a set of human actions using normalized correlation, taking into account the performing speed variations of different actors. We compare the performance of the 3D-MC and HMC descriptors, and show promising experimental results for the publicly available i3DPost Multi View Human Action Dataset.","PeriodicalId":330003,"journal":{"name":"2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission","volume":"73 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-05-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"97","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/3DIMPVT.2011.50","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 97

Abstract

This paper presents a novel approach for combining optical flow into enhanced 3D motion vector fields for human action recognition. Our approach detects motion of the actors by computing optical flow in video data captured by a multi-view camera setup with an arbitrary number of views. Optical flow is estimated in each view and extended to 3D using 3D reconstructions of the actors and pixel-to-vertex correspondences. The resulting 3D optical flow for each view is combined into a 3D motion vector field by taking the significance of local motion and its reliability into account. 3D Motion Context (3D-MC) and Harmonic Motion Context (HMC) are used to represent the extracted 3D motion vector fields efficiently and in a view-invariant manner, while considering difference in anthropometry of the actors and their movement style variations. The resulting 3D-MC and HMC descriptors are classified into a set of human actions using normalized correlation, taking into account the performing speed variations of different actors. We compare the performance of the 3D-MC and HMC descriptors, and show promising experimental results for the publicly available i3DPost Multi View Human Action Dataset.

查看原文本刊更多论文

多视角相机系统的三维人体动作识别

提出了一种将光流与增强的三维运动矢量场相结合的新方法，用于人体动作识别。我们的方法通过计算视频数据中的光流来检测演员的运动，这些数据是由具有任意数量视图的多视图摄像机设置捕获的。在每个视图中估计光流，并使用角色的3D重建和像素到顶点的对应关系扩展到3D。考虑到局部运动的重要性及其可靠性，将每个视图的三维光流组合成一个三维运动矢量场。三维运动上下文(3D- mc)和谐波运动上下文(HMC)在考虑演员的人体测量差异和运动风格变化的情况下，以视图不变的方式高效地表示提取的三维运动矢量场。由此产生的3D-MC和HMC描述符使用归一化相关性将其分类为一组人类动作，并考虑到不同参与者的执行速度变化。我们比较了3D-MC和HMC描述符的性能，并在公开可用的i3DPost多视图人类动作数据集上展示了有希望的实验结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2011 International Conference on 3D Imaging, Modeling, Processing, Visualization and Transmission

自引率

0.00%

发文量