A compositional approach for 3D arm-hand action recognition

2013 IEEE Workshop on Robot Vision (WORV) Pub Date : 2013-05-30 DOI:10.1109/WORV.2013.6521926

I. Gori, S. Fanello, F. Odone, G. Metta

引用次数: 10

Abstract

In this paper we propose a fast and reliable vision-based framework for 3D arm-hand action modelling, learning and recognition in human-robot interaction scenarios. The architecture consists of a compositional model that divides the arm-hand action recognition problem into three levels. The bottom level is based on a simple but sufficiently accurate algorithm for the computation of the scene flow. The middle level serves to classify action primitives through descriptors obtained from 3D Histogram of Flow (3D-HOF); we further apply a sparse coding (SC) algorithm to deal with noise. Action Primitives are then modelled and classified by linear Support Vector Machines (SVMs), and we propose an on-line algorithm to cope with the real-time recognition of primitive sequences. The top level system synthesises combinations of primitives by means of a syntactic approach. In summary the main contribution of the paper is an incremental method for 3D arm-hand behaviour modelling and recognition, fully implemented and tested on the iCub robot, allowing it to learn new actions after a single demonstration.

查看原文本刊更多论文

三维手臂动作识别的合成方法

在本文中，我们提出了一个快速可靠的基于视觉的框架，用于人机交互场景下的三维手臂动作建模、学习和识别。该体系结构由一个组合模型组成，该模型将手臂动作识别问题分为三个层次。底层是基于一个简单但足够精确的算法来计算场景流。中间层通过三维流直方图(3D- hof)获得的描述符对动作基元进行分类;我们进一步应用稀疏编码(SC)算法来处理噪声。然后用线性支持向量机(svm)对动作原语进行建模和分类，并提出了一种在线算法来处理原语序列的实时识别。顶层系统通过语法方法合成原语的组合。总之，本文的主要贡献是一种3D手臂行为建模和识别的增量方法，在iCub机器人上完全实现和测试，使其在一次演示后学习新的动作。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2013 IEEE Workshop on Robot Vision (WORV)

自引率

0.00%

发文量