Zucheul Lee, Ramsin Khoshabeh, Jason Juang, Truong Q. Nguyen
{"title":"Local stereo matching using motion cue and modified census in video disparity estimation","authors":"Zucheul Lee, Ramsin Khoshabeh, Jason Juang, Truong Q. Nguyen","doi":"10.5281/ZENODO.42823","DOIUrl":null,"url":null,"abstract":"In the human visual system, proximity, similarity, and motion are fundamental attributes that group visual objects together locally. The objects grouped by these attributes are most likely to have the same depth. In previous works, proximity and similarity have been considered in the computation of image disparity maps. However, they are insufficient for video disparity estimation because motion cues are very important for accurate depth estimation near edges of moving objects. We incorporate motion flow to compute each pixel's support weight, a measure directly affecting the accuracy of disparity maps in local methods. For robustness to image noise in flat areas, we propose a modified census transform with a noise buffer. The experimental results show that the proposed method produces more accurate disparity maps than current state-of-the-art methods, both on edges and in flat areas according to subjective and objective measures.","PeriodicalId":201182,"journal":{"name":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","volume":"31 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"11","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2012 Proceedings of the 20th European Signal Processing Conference (EUSIPCO)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5281/ZENODO.42823","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 11
Abstract
In the human visual system, proximity, similarity, and motion are fundamental attributes that group visual objects together locally. The objects grouped by these attributes are most likely to have the same depth. In previous works, proximity and similarity have been considered in the computation of image disparity maps. However, they are insufficient for video disparity estimation because motion cues are very important for accurate depth estimation near edges of moving objects. We incorporate motion flow to compute each pixel's support weight, a measure directly affecting the accuracy of disparity maps in local methods. For robustness to image noise in flat areas, we propose a modified census transform with a noise buffer. The experimental results show that the proposed method produces more accurate disparity maps than current state-of-the-art methods, both on edges and in flat areas according to subjective and objective measures.