A multi-modal gesture recognition system in a Human-Robot Interaction scenario

2009 IEEE International Workshop on Robotic and Sensors Environments Pub Date : 2009-12-18 DOI:10.1109/ROSE.2009.5355984

Zhi Li, R. Jarvis

引用次数: 15

Abstract

Recognition of non-verbal gestures is essential for robots to understand a user's state and intention in a Human-Robot Interaction (HRI) scenario. In this paper a multi-modal system is proposed to recognize a user's hand gestures and estimate body poses from the robot's viewpoint only. A range camera is employed to derive the depth data at a high frame rate. Depth data is useful for image segmentation, objects detection and localization in 3D spaces. A pair of stereo cameras is used to sense the user's head gestures and eye gaze direction, which provide useful information about the user's attention direction. Both hand shapes and hand trajectories are recognized. Full configurations of body poses are estimated using a model-based algorithm. Poses are tracked by a Particle Filter method, and refined by a gradient-based searching method in the neighborhood of the particles which have top largest weights.

查看原文本刊更多论文

人机交互场景中的多模态手势识别系统

在人机交互(HRI)场景中，非语言手势的识别对于机器人理解用户的状态和意图至关重要。本文提出了一种多模态系统，仅从机器人的视点来识别用户的手势和估计身体姿势。采用距离相机在高帧率下获取深度数据。深度数据对于三维空间中的图像分割、目标检测和定位非常有用。一对立体摄像头用于感知用户的头部手势和眼睛注视方向，从而提供有关用户注意力方向的有用信息。手的形状和轨迹都被识别。使用基于模型的算法估计身体姿势的完整配置。采用粒子滤波方法对姿态进行跟踪，并在权值最大的粒子附近采用梯度搜索方法对姿态进行细化。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE International Workshop on Robotic and Sensors Environments

自引率

0.00%

发文量