{"title":"Utilization of stereo disparity and optical flow information for human interaction","authors":"Ikushi Yoda, K. Sakaue","doi":"10.1109/ICCV.1998.710855","DOIUrl":null,"url":null,"abstract":"To attain smooth human interaction we propose a system which simultaneously utilizes the stereo disparity and optical flow information of real-time stereo gray and multi resolution images to recognize objects and gestures. For real-time calculation of the disparity and optical flow information of a stereo image the system first creates pyramid images by utilizing a Gaussian filter. The system then determines the disparity and optical flow of a low density image and extracts regions in front of a certain depth. The three foremost regions are recognized by higher order local autocorrelation features and a linear discriminant analysis. With this process the system recognizes the face and hand signs of users which are displayed foremost, and roughly recognizes movements within the region in real-time. With this framework, the system can discriminate, the face of a user, can monitor the basic movements of the user, can smoothly learn a presented object by users and can communicate with users from hand signs learned in advance.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.1998.710855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8
Abstract
To attain smooth human interaction we propose a system which simultaneously utilizes the stereo disparity and optical flow information of real-time stereo gray and multi resolution images to recognize objects and gestures. For real-time calculation of the disparity and optical flow information of a stereo image the system first creates pyramid images by utilizing a Gaussian filter. The system then determines the disparity and optical flow of a low density image and extracts regions in front of a certain depth. The three foremost regions are recognized by higher order local autocorrelation features and a linear discriminant analysis. With this process the system recognizes the face and hand signs of users which are displayed foremost, and roughly recognizes movements within the region in real-time. With this framework, the system can discriminate, the face of a user, can monitor the basic movements of the user, can smoothly learn a presented object by users and can communicate with users from hand signs learned in advance.