{"title":"DSN: A Fast Stereo Disparity Estimation Network based on Deformable Convolution","authors":"Wenrui Li, Zhiqiang Wang, Qing Zhu","doi":"10.1109/ICMCCE51767.2020.00509","DOIUrl":null,"url":null,"abstract":"In recent years, the stereo matching method based on the convolutional neural network has been greatly developed and achieved accurate disparity estimation results. However, the high precision stereo disparity estimation method is often slow in reasoning, so it cannot meet the requirements of real-time scene. Moreover, this type of method has a large number of parameters and is therefore not friendly to resource-constrained embedded devices. In this paper, a novel 2D cost aggregation module based on deformable convolution is constructed based on AnyNet, and a lightweight stereo matching network named DSN is constructed accordingly. We measured performance on SceneFlow and KTTTI2015 dataset. Experiments show that our method achieves the approximate inference time and parameter quantity of AnyNet, and the accuracy of disparity estimation is much better than AnyNet, achieving a better tradeoff between performance and time.","PeriodicalId":6712,"journal":{"name":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","volume":"102 1","pages":"2354-2360"},"PeriodicalIF":0.0000,"publicationDate":"2020-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Mechanical, Control and Computer Engineering (ICMCCE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMCCE51767.2020.00509","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In recent years, the stereo matching method based on the convolutional neural network has been greatly developed and achieved accurate disparity estimation results. However, the high precision stereo disparity estimation method is often slow in reasoning, so it cannot meet the requirements of real-time scene. Moreover, this type of method has a large number of parameters and is therefore not friendly to resource-constrained embedded devices. In this paper, a novel 2D cost aggregation module based on deformable convolution is constructed based on AnyNet, and a lightweight stereo matching network named DSN is constructed accordingly. We measured performance on SceneFlow and KTTTI2015 dataset. Experiments show that our method achieves the approximate inference time and parameter quantity of AnyNet, and the accuracy of disparity estimation is much better than AnyNet, achieving a better tradeoff between performance and time.