基于卷积神经网络的立体匹配视差优化训练

Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics Pub Date : 2017-10-18 DOI:10.1145/3155077.3155083

Guorui Song, Hong Zheng, Qingfeng Wang, Z. Su

{"title":"基于卷积神经网络的立体匹配视差优化训练","authors":"Guorui Song, Hong Zheng, Qingfeng Wang, Z. Su","doi":"10.1145/3155077.3155083","DOIUrl":null,"url":null,"abstract":"In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.","PeriodicalId":237079,"journal":{"name":"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics","volume":"5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Training a Convolutional Neural Network for Disparity Optimization in Stereo Matching\",\"authors\":\"Guorui Song, Hong Zheng, Qingfeng Wang, Z. Su\",\"doi\":\"10.1145/3155077.3155083\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.\",\"PeriodicalId\":237079,\"journal\":{\"name\":\"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics\",\"volume\":\"5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/3155077.3155083\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3155077.3155083","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

本文受卷积神经网络近年来在视觉问题上的优异表现启发，提出了一种高效的立体匹配算法。该算法采用自适应平滑约束，利用视差不连续信息对整体视差图进行优化。首先，我们定义了一个叫做DD-CNN的CNN架构来分类图像中像素的差异是否连续。训练数据集由Middlebury立体数据集构建而成。得到视差不连续映射后，对以整个视差映射为参数的能量函数施加不同的惩罚。当中心像素的差异被预测为不连续时，该算法对像素与其邻域之间的差异施加较大的惩罚，否则施加较小的惩罚。实验结果表明，该算法的性能优于现有算法。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Training a Convolutional Neural Network for Disparity Optimization in Stereo Matching

In this paper, we describe an efficient stereo matching algorithm which is inspired by the excellent performances of convolutional neural network (CNN) on vision problems in recent years. Our algorithm applies adaptive smoothness constraints making use of disparity discontinuous information to optimize the overall disparity map. First, we define a CNN architecture called DD-CNN to classify whether disparities of pixels in the image is continuous or not. The training data set is constructed from Middlebury stereo data sets. Once we obtain the disparity discontinuous map, different penalizes are applied to the energy function which takes the whole disparity map as argument. The algorithm imposes large penalizes to disparity differences between pixels and their neighborhoods when disparities of the center pixels are predicted to be discontinuous and small penalizes otherwise. Experiments show that the proposed algorithm performs better than the state-of-art algorithm.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Proceedings of the 2017 International Conference on Computational Biology and Bioinformatics

自引率

0.00%

发文量