定位和识别人类行为的统一框架

CVPR 2011 Pub Date : 2011-06-20 DOI:10.1109/CVPR.2011.5995648

Yuelei Xie, Hong Chang, Zhe Li, Luhong Liang, Xilin Chen, Debin Zhao

{"title":"定位和识别人类行为的统一框架","authors":"Yuelei Xie, Hong Chang, Zhe Li, Luhong Liang, Xilin Chen, Debin Zhao","doi":"10.1109/CVPR.2011.5995648","DOIUrl":null,"url":null,"abstract":"In this paper, we present a pose based approach for locating and recognizing human actions in videos. In our method, human poses are detected and represented based on deformable part model. To our knowledge, this is the first work on exploring the effectiveness of deformable part models in combining human detection and pose estimation into action recognition. Comparing with previous methods, ours have three main advantages. First, our method does not rely on any assumption on video preprocessing quality, such as satisfactory foreground segmentation or reliable tracking; Second, we propose a novel compact representation for human pose which works together with human detection and can well represent the spatial and temporal structures inside an action; Third, with human detection taken into consideration in our framework, our method has the ability to locate and recognize multiple actions in the same scene. Experiments on benchmark datasets and recorded cluttered videos verified the efficacy of our method.","PeriodicalId":445398,"journal":{"name":"CVPR 2011","volume":"66 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"38","resultStr":"{\"title\":\"A unified framework for locating and recognizing human actions\",\"authors\":\"Yuelei Xie, Hong Chang, Zhe Li, Luhong Liang, Xilin Chen, Debin Zhao\",\"doi\":\"10.1109/CVPR.2011.5995648\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we present a pose based approach for locating and recognizing human actions in videos. In our method, human poses are detected and represented based on deformable part model. To our knowledge, this is the first work on exploring the effectiveness of deformable part models in combining human detection and pose estimation into action recognition. Comparing with previous methods, ours have three main advantages. First, our method does not rely on any assumption on video preprocessing quality, such as satisfactory foreground segmentation or reliable tracking; Second, we propose a novel compact representation for human pose which works together with human detection and can well represent the spatial and temporal structures inside an action; Third, with human detection taken into consideration in our framework, our method has the ability to locate and recognize multiple actions in the same scene. Experiments on benchmark datasets and recorded cluttered videos verified the efficacy of our method.\",\"PeriodicalId\":445398,\"journal\":{\"name\":\"CVPR 2011\",\"volume\":\"66 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2011-06-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"38\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CVPR 2011\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CVPR.2011.5995648\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CVPR 2011","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPR.2011.5995648","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 38

摘要

在本文中，我们提出了一种基于姿态的方法来定位和识别视频中的人类动作。在我们的方法中，人体姿态检测和表示基于可变形部分模型。据我们所知，这是第一次探索可变形零件模型将人体检测和姿态估计结合到动作识别中的有效性。与以前的方法相比，我们的方法有三个主要优点。首先，我们的方法不依赖于对视频预处理质量的任何假设，例如令人满意的前景分割或可靠的跟踪;其次，我们提出了一种新的紧凑的人体姿态表示，它与人体检测相结合，可以很好地表示动作内部的空间和时间结构;第三，在我们的框架中考虑了人类检测，我们的方法具有在同一场景中定位和识别多个动作的能力。在基准数据集和录制的杂乱视频上的实验验证了该方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

A unified framework for locating and recognizing human actions

In this paper, we present a pose based approach for locating and recognizing human actions in videos. In our method, human poses are detected and represented based on deformable part model. To our knowledge, this is the first work on exploring the effectiveness of deformable part models in combining human detection and pose estimation into action recognition. Comparing with previous methods, ours have three main advantages. First, our method does not rely on any assumption on video preprocessing quality, such as satisfactory foreground segmentation or reliable tracking; Second, we propose a novel compact representation for human pose which works together with human detection and can well represent the spatial and temporal structures inside an action; Third, with human detection taken into consideration in our framework, our method has the ability to locate and recognize multiple actions in the same scene. Experiments on benchmark datasets and recorded cluttered videos verified the efficacy of our method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

CVPR 2011

自引率

0.00%

发文量