Digital surface model generation from high-resolution satellite stereos based on hybrid feature fusion network

Zhi Zheng, Yi Wan, Yongjun Zhang, Zhonghua Hu, Dong Wei, Yongxiang Yao, Chenming Zhu, Kun Yang, Rang Xiao
{"title":"Digital surface model generation from high-resolution satellite stereos based on hybrid feature fusion network","authors":"Zhi Zheng, Yi Wan, Yongjun Zhang, Zhonghua Hu, Dong Wei, Yongxiang Yao, Chenming Zhu, Kun Yang, Rang Xiao","doi":"10.1111/phor.12471","DOIUrl":null,"url":null,"abstract":"Recent studies have demonstrated that deep learning-based stereo matching methods (DLSMs) can far exceed conventional ones on most benchmark datasets by both improving visual performance and decreasing the mismatching rate. However, applying DLSMs on high-resolution satellite stereos with broad image coverage and wide terrain variety is still challenging. First, the broad coverage of satellite stereos brings a wide disparity range, while DLSMs are limited to a narrow disparity range in most cases, resulting in incorrect disparity estimation in areas with contradictory disparity ranges. Second, high-resolution satellite stereos always comprise various terrain types, which is more complicated than carefully prepared datasets. Thus, the performance of DLSMs on satellite stereos is unstable, especially for intractable regions such as texture-less and occluded regions. Third, generating DSMs requires occlusion-aware disparity maps, while traditional occlusion detection methods are not always applicable for DLSMs with continuous disparity. To tackle these problems, this paper proposes a novel DLSM-based DSM generation workflow. The workflow comprises three steps: pre-processing, disparity estimation and post-processing. The pre-processing step introduces low-resolution terrain to shift unmatched disparity ranges into a fixed scope and crops satellite stereos to regular patches. The disparity estimation step proposes a hybrid feature fusion network (HF<sup>2</sup>Net) to improve the matching performance. In detail, HF<sup>2</sup>Net designs a cross-scale feature extractor (CSF) and a multi-scale cost filter. The feature extractor differentiates structural-context features in complex scenes and thus enhances HF<sup>2</sup>Net's robustness to satellite stereos, especially on intractable regions. The cost filter filters out most matching errors to ensure accurate disparity estimation. The post-processing step generates initial DSM patches with estimated disparity maps and then refines them for the final large-scale DSMs. Primary experiments on the public US3D dataset showed better accuracy than state-of-the-art methods, indicating HF<sup>2</sup>Net's superiority. We then created a self-made Gaofen-7 dataset to train HF<sup>2</sup>Net and conducted DSM generation experiments on two Gaofen-7 stereos to further demonstrate the effectiveness and practical capability of the proposed workflow.","PeriodicalId":22881,"journal":{"name":"The Photogrammetric Record","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"The Photogrammetric Record","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1111/phor.12471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Recent studies have demonstrated that deep learning-based stereo matching methods (DLSMs) can far exceed conventional ones on most benchmark datasets by both improving visual performance and decreasing the mismatching rate. However, applying DLSMs on high-resolution satellite stereos with broad image coverage and wide terrain variety is still challenging. First, the broad coverage of satellite stereos brings a wide disparity range, while DLSMs are limited to a narrow disparity range in most cases, resulting in incorrect disparity estimation in areas with contradictory disparity ranges. Second, high-resolution satellite stereos always comprise various terrain types, which is more complicated than carefully prepared datasets. Thus, the performance of DLSMs on satellite stereos is unstable, especially for intractable regions such as texture-less and occluded regions. Third, generating DSMs requires occlusion-aware disparity maps, while traditional occlusion detection methods are not always applicable for DLSMs with continuous disparity. To tackle these problems, this paper proposes a novel DLSM-based DSM generation workflow. The workflow comprises three steps: pre-processing, disparity estimation and post-processing. The pre-processing step introduces low-resolution terrain to shift unmatched disparity ranges into a fixed scope and crops satellite stereos to regular patches. The disparity estimation step proposes a hybrid feature fusion network (HF2Net) to improve the matching performance. In detail, HF2Net designs a cross-scale feature extractor (CSF) and a multi-scale cost filter. The feature extractor differentiates structural-context features in complex scenes and thus enhances HF2Net's robustness to satellite stereos, especially on intractable regions. The cost filter filters out most matching errors to ensure accurate disparity estimation. The post-processing step generates initial DSM patches with estimated disparity maps and then refines them for the final large-scale DSMs. Primary experiments on the public US3D dataset showed better accuracy than state-of-the-art methods, indicating HF2Net's superiority. We then created a self-made Gaofen-7 dataset to train HF2Net and conducted DSM generation experiments on two Gaofen-7 stereos to further demonstrate the effectiveness and practical capability of the proposed workflow.

Abstract Image

基于混合特征融合网络从高分辨率卫星立体图生成数字地表模型
最近的研究表明,基于深度学习的立体匹配方法(DLSMs)在大多数基准数据集上都能远远超过传统方法,既提高了视觉性能,又降低了不匹配率。然而,在图像覆盖面广、地形种类繁多的高分辨率卫星立体图像上应用 DLSM 仍然充满挑战。首先,卫星立体图像的广阔覆盖面带来了宽广的差异范围,而 DLSM 在大多数情况下仅限于较窄的差异范围,从而导致在差异范围相互矛盾的区域出现不正确的差异估计。其次,高分辨率卫星立体图像总是包含各种地形类型,比精心准备的数据集更加复杂。因此,DLSM 在卫星立体图像上的表现并不稳定,尤其是在无纹理和遮挡区域等难以处理的区域。第三,生成 DSM 需要闭塞感知的差异图,而传统的闭塞检测方法并不总是适用于连续差异的 DLSM。为了解决这些问题,本文提出了一种新颖的基于 DLSM 的 DSM 生成工作流程。该工作流程包括三个步骤:预处理、差异估计和后处理。预处理步骤引入低分辨率地形,将不匹配的差异范围转移到一个固定的范围内,并将卫星立体裁剪为规则的补丁。差异估计步骤提出了一种混合特征融合网络(HF2Net)来提高匹配性能。具体来说,HF2Net 设计了一个跨尺度特征提取器(CSF)和一个多尺度成本过滤器。特征提取器可区分复杂场景中的结构-上下文特征,从而增强 HF2Net 对卫星立体的鲁棒性,尤其是在难以处理的区域。成本滤波器能过滤掉大部分匹配误差,以确保准确的差异估计。后处理步骤生成带有估计差异图的初始 DSM 补丁,然后将其细化为最终的大尺度 DSM。在公开的 US3D 数据集上进行的初步实验表明,HF2Net 比最先进的方法具有更高的准确性,这表明了 HF2Net 的优越性。然后,我们创建了一个自制的高分七号数据集来训练 HF2Net,并在两个高分七号立体上进行了 DSM 生成实验,进一步证明了所提出的工作流程的有效性和实用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信