{"title":"立体视差和光流信息在人类交互中的应用","authors":"Ikushi Yoda, K. Sakaue","doi":"10.1109/ICCV.1998.710855","DOIUrl":null,"url":null,"abstract":"To attain smooth human interaction we propose a system which simultaneously utilizes the stereo disparity and optical flow information of real-time stereo gray and multi resolution images to recognize objects and gestures. For real-time calculation of the disparity and optical flow information of a stereo image the system first creates pyramid images by utilizing a Gaussian filter. The system then determines the disparity and optical flow of a low density image and extracts regions in front of a certain depth. The three foremost regions are recognized by higher order local autocorrelation features and a linear discriminant analysis. With this process the system recognizes the face and hand signs of users which are displayed foremost, and roughly recognizes movements within the region in real-time. With this framework, the system can discriminate, the face of a user, can monitor the basic movements of the user, can smoothly learn a presented object by users and can communicate with users from hand signs learned in advance.","PeriodicalId":270671,"journal":{"name":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1998-01-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":"{\"title\":\"Utilization of stereo disparity and optical flow information for human interaction\",\"authors\":\"Ikushi Yoda, K. Sakaue\",\"doi\":\"10.1109/ICCV.1998.710855\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To attain smooth human interaction we propose a system which simultaneously utilizes the stereo disparity and optical flow information of real-time stereo gray and multi resolution images to recognize objects and gestures. For real-time calculation of the disparity and optical flow information of a stereo image the system first creates pyramid images by utilizing a Gaussian filter. The system then determines the disparity and optical flow of a low density image and extracts regions in front of a certain depth. The three foremost regions are recognized by higher order local autocorrelation features and a linear discriminant analysis. With this process the system recognizes the face and hand signs of users which are displayed foremost, and roughly recognizes movements within the region in real-time. With this framework, the system can discriminate, the face of a user, can monitor the basic movements of the user, can smoothly learn a presented object by users and can communicate with users from hand signs learned in advance.\",\"PeriodicalId\":270671,\"journal\":{\"name\":\"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)\",\"volume\":\"2 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"1998-01-04\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"8\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCV.1998.710855\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sixth International Conference on Computer Vision (IEEE Cat. No.98CH36271)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCV.1998.710855","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Utilization of stereo disparity and optical flow information for human interaction
To attain smooth human interaction we propose a system which simultaneously utilizes the stereo disparity and optical flow information of real-time stereo gray and multi resolution images to recognize objects and gestures. For real-time calculation of the disparity and optical flow information of a stereo image the system first creates pyramid images by utilizing a Gaussian filter. The system then determines the disparity and optical flow of a low density image and extracts regions in front of a certain depth. The three foremost regions are recognized by higher order local autocorrelation features and a linear discriminant analysis. With this process the system recognizes the face and hand signs of users which are displayed foremost, and roughly recognizes movements within the region in real-time. With this framework, the system can discriminate, the face of a user, can monitor the basic movements of the user, can smoothly learn a presented object by users and can communicate with users from hand signs learned in advance.