{"title":"基于卷积神经网络的立体匹配视差优化训练","authors":"Guorui Song, Hong Zheng, Qingfeng Wang, Z. Su","doi":"10.1145/3155077.3155083","DOIUrl":null,"url":null,"abstract":"In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.","PeriodicalId":237079,"journal":{"name":"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Training a Convolutional Neural Network for Disparity Optimization in Stereo Matching\",\"authors\":\"Guorui Song, Hong Zheng, Qingfeng Wang, Z. Su\",\"doi\":\"10.1145/3155077.3155083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.\",\"PeriodicalId\":237079,\"journal\":{\"name\":\"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3155077.3155083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3155077.3155083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Training a Convolutional Neural Network for Disparity Optimization in Stereo Matching
In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.