{"title":"Design of Binocular Stereo Vision System Via CNN-based Stereo Matching Algorithm","authors":"Yan Jiao, P. Ho","doi":"10.1109/NaNA53684.2021.00080","DOIUrl":null,"url":null,"abstract":"In this paper, we design a binocular stereo vision system based on an adjustable narrow-baseline stereo camera for extracting depth information from a rectified stereo pair. The camera calibration and rectification are performed to get a rectified stereo pair serving as the input to the stereo matching algorithm. This algorithm searches the corresponding points between the left and right images and produces a disparity map that is used to obtain the depths via the triangulation principle. We focus on the first stage of the algorithm and propose a CNN-based approach to calculating the matching cost. Fast and slow networks are presented and trained on standard stereo datasets. The output of either network is regarded as the initial matching cost, followed by a series of post-processing methods for generating qualified disparity maps. The contrast tests have demonstrated that the CNN-based methods outperform census transformation on the mentioned datasets. Finally, we advance two error criteria to acquire the range of system working distance under diverse baseline lengths.","PeriodicalId":414672,"journal":{"name":"2021 International Conference on Networking and Network Applications (NaNA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Networking and Network Applications (NaNA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/NaNA53684.2021.00080","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
In this paper, we design a binocular stereo vision system based on an adjustable narrow-baseline stereo camera for extracting depth information from a rectified stereo pair. The camera calibration and rectification are performed to get a rectified stereo pair serving as the input to the stereo matching algorithm. This algorithm searches the corresponding points between the left and right images and produces a disparity map that is used to obtain the depths via the triangulation principle. We focus on the first stage of the algorithm and propose a CNN-based approach to calculating the matching cost. Fast and slow networks are presented and trained on standard stereo datasets. The output of either network is regarded as the initial matching cost, followed by a series of post-processing methods for generating qualified disparity maps. The contrast tests have demonstrated that the CNN-based methods outperform census transformation on the mentioned datasets. Finally, we advance two error criteria to acquire the range of system working distance under diverse baseline lengths.