EYEWATCHME—3D Hand and object tracking for inside out activity analysis

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops Pub Date : 2009-06-20 DOI:10.1109/CVPRW.2009.5204358

Li Sun, Ulrich Klank, M. Beetz

{"title":"EYEWATCHME—3D Hand and object tracking for inside out activity analysis","authors":"Li Sun, Ulrich Klank, M. Beetz","doi":"10.1109/CVPRW.2009.5204358","DOIUrl":null,"url":null,"abstract":"This paper investigates the “inside-out” recognition of everyday manipulation tasks using a gaze-directed camera, which is a camera that actively directs at the visual attention focus of the person wearing the camera. We present EYEWATCHME, an integrated vision and state estimation system that at the same time tracks the positions and the poses of the acting hands, the pose that the manipulated object, and the pose of the observing camera. Taken together, EYEWATCHME provides comprehensive data for learning predictive models of vision-guided manipulation that include the objects people are attending, the interaction of attention and reaching/grasping, and the segmentation of reaching and grasping using visual attention as evidence. Key technical contributions of this paper include an ego view hand tracking system that estimates 27 DOF hand poses. The hand tracking system is capable of detecting hands and estimating their poses despite substantial self-occlusion caused by the hand and occlusions caused by the manipulated object. EYEWATCHME can also cope with blurred images that are caused by rapid eye movements. The second key contribution is the of the integrated activity recognition system that simultaneously tracks the attention of the person, the hand poses, and the poses of the manipulated objects in terms of a global scene coordinates. We demonstrate the operation of EYEWATCHME in the context of kitchen tasks including filling a cup with water.","PeriodicalId":431981,"journal":{"name":"2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","volume":"994 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-06-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"50","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CVPRW.2009.5204358","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 50

Abstract

This paper investigates the “inside-out” recognition of everyday manipulation tasks using a gaze-directed camera, which is a camera that actively directs at the visual attention focus of the person wearing the camera. We present EYEWATCHME, an integrated vision and state estimation system that at the same time tracks the positions and the poses of the acting hands, the pose that the manipulated object, and the pose of the observing camera. Taken together, EYEWATCHME provides comprehensive data for learning predictive models of vision-guided manipulation that include the objects people are attending, the interaction of attention and reaching/grasping, and the segmentation of reaching and grasping using visual attention as evidence. Key technical contributions of this paper include an ego view hand tracking system that estimates 27 DOF hand poses. The hand tracking system is capable of detecting hands and estimating their poses despite substantial self-occlusion caused by the hand and occlusions caused by the manipulated object. EYEWATCHME can also cope with blurred images that are caused by rapid eye movements. The second key contribution is the of the integrated activity recognition system that simultaneously tracks the attention of the person, the hand poses, and the poses of the manipulated objects in terms of a global scene coordinates. We demonstrate the operation of EYEWATCHME in the context of kitchen tasks including filling a cup with water.

查看原文本刊更多论文

EYEWATCHME-3D手和对象跟踪从内到外的活动分析

本文研究了使用视线导向相机对日常操作任务的“由内到外”识别，这是一种主动指向佩戴相机的人的视觉注意力焦点的相机。我们提出了EYEWATCHME，一个集成的视觉和状态估计系统，同时跟踪动作手的位置和姿势，被操纵物体的姿势，以及观察相机的姿势。综上所述，EYEWATCHME为学习视觉引导操作的预测模型提供了全面的数据，包括人们正在关注的对象，注意和伸手/抓握的相互作用，以及以视觉注意为证据的伸手和抓握的分割。本文的关键技术贡献包括一个自我视图手部跟踪系统，该系统可以估计27个DOF手部姿势。该手部跟踪系统能够检测手部并估计其姿势，尽管手部和被操纵物体造成了严重的自遮挡。EYEWATCHME还可以处理由快速眼球运动引起的图像模糊问题。第二个关键贡献是集成的活动识别系统，该系统可以根据全局场景坐标同时跟踪人的注意力、手的姿势和被操纵物体的姿势。我们在厨房任务的背景下演示EYEWATCHME的操作，包括将杯子装满水。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops

自引率

0.00%

发文量