Functional categorization of objects using real-time markerless motion capture

CVPR 2011 Pub Date : 2011-06-20 DOI:10.1109/CVPR.2011.5995582

Juergen Gall, A. Fossati, L. Gool

引用次数: 70

Abstract

Unsupervised categorization of objects is a fundamental problem in computer vision. While appearance-based methods have become popular recently, other important cues like functionality are largely neglected. Motivated by psychological studies giving evidence that human demonstration has a facilitative effect on categorization in infancy, we propose an approach for object categorization from depth video streams. To this end, we have developed a method for capturing human motion in real-time. The captured data is then used to temporally segment the depth streams into actions. The set of segmented actions are then categorized in an un-supervised manner, through a novel descriptor for motion capture data that is robust to subject variations. Furthermore, we automatically localize the object that is manipulated within a video segment, and categorize it using the corresponding action. For evaluation, we have recorded a dataset that comprises depth data with registered video sequences for 6 subjects, 13 action classes, and 174 object manipulations.

查看原文本刊更多论文

使用实时无标记动作捕捉的物体功能分类

对象的无监督分类是计算机视觉中的一个基本问题。虽然基于外观的方法最近很流行，但其他重要的线索，如功能，在很大程度上被忽视了。心理学研究表明，人类的示范对婴儿的分类具有促进作用，因此我们提出了一种基于深度视频流的对象分类方法。为此，我们开发了一种实时捕捉人体运动的方法。然后使用捕获的数据将深度流暂时分割为动作。然后，通过对主题变化具有鲁棒性的运动捕捉数据的新颖描述符，以无监督的方式对分段动作集进行分类。此外，我们自动定位在视频片段中被操纵的对象，并使用相应的动作对其进行分类。为了评估，我们记录了一个包含深度数据的数据集，其中包含6个主题、13个动作类和174个对象操作的注册视频序列。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

CVPR 2011

自引率

0.00%

发文量