层次特征融合与多尺度代价聚合立体匹配

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET) Pub Date : 2022-08-19 DOI:10.1109/CCET55412.2022.9906319

Jiaquan Zhang, Pengfei Li, Xin'an Wang, Yong Zhao

{"title":"层次特征融合与多尺度代价聚合立体匹配","authors":"Jiaquan Zhang, Pengfei Li, Xin'an Wang, Yong Zhao","doi":"10.1109/CCET55412.2022.9906319","DOIUrl":null,"url":null,"abstract":"To further improve the accuracy of disparity estimation in ill-posed regions and weak texture regions, in this paper we propose HFMANet: which is a stereo matching method based on hierarchical feature fusion and multi-scale cost aggregation. Specifically, we first propose a hierarchical feature fusion module, which innovatively fuses low-level features and high-level features to obtain rich semantic information while retaining the edge information of the image. Secondly, we propose a multi-scale cost aggregation module to extract rich global context information. At the same time, the layer-by-layer fusion optimization helps increase the receptive field to capture more structural information, reduce the dependence on local information, and help the disparity estimation of ill-posed regions and weak-textured regions. Comprehensive experiments are conducted on the SceneFlow and KITTI datasets, and achieve competitive results, which proves the effectiveness of the proposed method.","PeriodicalId":329327,"journal":{"name":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","volume":"55 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-08-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Hierarchical Feature Fusion and Multi-scale Cost Aggregation for Stereo Matching\",\"authors\":\"Jiaquan Zhang, Pengfei Li, Xin'an Wang, Yong Zhao\",\"doi\":\"10.1109/CCET55412.2022.9906319\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"To further improve the accuracy of disparity estimation in ill-posed regions and weak texture regions, in this paper we propose HFMANet: which is a stereo matching method based on hierarchical feature fusion and multi-scale cost aggregation. Specifically, we first propose a hierarchical feature fusion module, which innovatively fuses low-level features and high-level features to obtain rich semantic information while retaining the edge information of the image. Secondly, we propose a multi-scale cost aggregation module to extract rich global context information. At the same time, the layer-by-layer fusion optimization helps increase the receptive field to capture more structural information, reduce the dependence on local information, and help the disparity estimation of ill-posed regions and weak-textured regions. Comprehensive experiments are conducted on the SceneFlow and KITTI datasets, and achieve competitive results, which proves the effectiveness of the proposed method.\",\"PeriodicalId\":329327,\"journal\":{\"name\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"volume\":\"55 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-08-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCET55412.2022.9906319\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCET55412.2022.9906319","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

为了进一步提高病态区域和弱纹理区域视差估计的精度，本文提出了一种基于层次特征融合和多尺度代价聚合的立体匹配方法HFMANet。具体而言，我们首先提出了一种分层特征融合模块，该模块创新性地融合了低级特征和高级特征，在保留图像边缘信息的同时获得丰富的语义信息。其次，我们提出了一个多尺度成本聚合模块来提取丰富的全局上下文信息。同时，通过逐层融合优化，增加接收野以捕获更多的结构信息，减少对局部信息的依赖，有助于病态区域和弱纹理区域的视差估计。在SceneFlow和KITTI数据集上进行了综合实验，取得了比较好的结果，证明了所提方法的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Hierarchical Feature Fusion and Multi-scale Cost Aggregation for Stereo Matching

To further improve the accuracy of disparity estimation in ill-posed regions and weak texture regions, in this paper we propose HFMANet: which is a stereo matching method based on hierarchical feature fusion and multi-scale cost aggregation. Specifically, we first propose a hierarchical feature fusion module, which innovatively fuses low-level features and high-level features to obtain rich semantic information while retaining the edge information of the image. Secondly, we propose a multi-scale cost aggregation module to extract rich global context information. At the same time, the layer-by-layer fusion optimization helps increase the receptive field to capture more structural information, reduce the dependence on local information, and help the disparity estimation of ill-posed regions and weak-textured regions. Comprehensive experiments are conducted on the SceneFlow and KITTI datasets, and achieve competitive results, which proves the effectiveness of the proposed method.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2022 IEEE 5th International Conference on Computer and Communication Engineering Technology (CCET)

自引率

0.00%

发文量