Recognizing action events from multiple viewpoints

Proceedings IEEE Workshop on Detection and Recognition of Events in Video Pub Date : 2001-07-08 DOI:10.1109/EVENT.2001.938868

T. Syeda-Mahmood, M. Alex O. Vasilescu, Saratendu Sethi

引用次数: 115

Abstract

A first step towards an understanding of the semantic content in a video is the reliable detection and recognition of actions performed by objects. This is a difficult problem due to the enormous variability in an action's appearance when seen from different viewpoints and/or at different times. In this paper we address the recognition of actions by taking a novel approach that models actions as special types of 3D objects. Specifically, we observe that any action can be represented as a generalized cylinder, called the action cylinder. Reliable recognition is achieved by recovering the viewpoint transformation between the reference (model) and given action cylinders. A set of 8 corresponding points from time-wise corresponding cross-sections is shown to be sufficient to align the two cylinders under perspective projection. A surprising conclusion from visualizing actions as objects is that rigid, articulated, and nonrigid actions can all be modeled in a uniform framework.

查看原文本刊更多论文

从多个视点识别动作事件

理解视频中语义内容的第一步是对对象执行的动作进行可靠的检测和识别。这是一个困难的问题，因为当从不同的角度和/或在不同的时间观察时，动作的外观会有巨大的可变性。在本文中，我们通过采用一种新颖的方法将动作建模为特殊类型的3D对象来解决动作识别问题。具体地说，我们观察到任何作用都可以表示为一个广义的圆柱体，称为作用圆柱体。通过恢复参考(模型)与给定动作柱面之间的视点变换，实现了可靠的识别。在透视投影下，从时间方向上对应的横截面上得到的一组8个对应点足以使两个圆柱体对齐。将动作可视化为对象的一个令人惊讶的结论是，刚性、铰接和非刚性动作都可以在一个统一的框架中建模。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings IEEE Workshop on Detection and Recognition of Events in Video

自引率

0.00%

发文量