{"title":"Estimation of 3-D pose and shape from a monocular image sequence and real-time human tracking","authors":"Y. Shirai","doi":"10.1109/IM.1997.603858","DOIUrl":null,"url":null,"abstract":"This paper describes the recognition of the 3-D pose and shape of articulated objects like a human hand and visual tracking of moving persons from a sequence of images. In the first stage of pose and shape recognition, the rough estimation of the pose is obtained by silhouette matching to a rough model of a hand and fingers. In the second stage, the model is refined using restrictions of the shape and pose of the object. Modifying the extended Kalman filter so as to satisfy the restrictions, the depth ambiguity is gradually resolved from observed images. Next, a method is proposed for tracking an object from the optical flow and depth data acquired from a sequence of stereo images. A target region is extracted by Baysian inference in terms of the optical flow, disparity and the predicted target location. Occlusion of the target can also be detected from the abrupt change of the disparity of the target region. Real-time human tracking in a real image sequence is shown.","PeriodicalId":337843,"journal":{"name":"Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1997-05-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings. International Conference on Recent Advances in 3-D Digital Imaging and Modeling (Cat. No.97TB100134)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IM.1997.603858","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
This paper describes the recognition of the 3-D pose and shape of articulated objects like a human hand and visual tracking of moving persons from a sequence of images. In the first stage of pose and shape recognition, the rough estimation of the pose is obtained by silhouette matching to a rough model of a hand and fingers. In the second stage, the model is refined using restrictions of the shape and pose of the object. Modifying the extended Kalman filter so as to satisfy the restrictions, the depth ambiguity is gradually resolved from observed images. Next, a method is proposed for tracking an object from the optical flow and depth data acquired from a sequence of stereo images. A target region is extracted by Baysian inference in terms of the optical flow, disparity and the predicted target location. Occlusion of the target can also be detected from the abrupt change of the disparity of the target region. Real-time human tracking in a real image sequence is shown.