A Multiscale Framework With Unsupervised Learning for Remote Sensing Image Registration

IF 8.6 1区地球科学 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Geoscience and Remote Sensing Pub Date : 2022-04-18 DOI:10.1109/TGRS.2022.3167644

Yuanxin Ye;Tengfeng Tang;Bai Zhu;Chao Yang;Bo Li;Siyuan Hao

{"title":"A Multiscale Framework With Unsupervised Learning for Remote Sensing Image Registration","authors":"Yuanxin Ye;Tengfeng Tang;Bai Zhu;Chao Yang;Bo Li;Siyuan Hao","doi":"10.1109/TGRS.2022.3167644","DOIUrl":null,"url":null,"abstract":"Registration for multisensor or multimodal image pairs with a large degree of distortions is a fundamental task for many remote sensing applications. To achieve accurate and low-cost remote sensing image registration, we propose a multiscale framework with unsupervised learning, named MU-Net. Without costly ground truth labels, MU-Net directly learns the end-to-end mapping from the image pairs to their transformation parameters. MU-Net stacks several deep neural network (DNN) models on multiple scales to generate a coarse-to-fine registration pipeline, which prevents the backpropagation from falling into a local extremum and resists significant image distortions. We design a novel loss function paradigm based on structural similarity, which makes MU-Net suitable for various types of multimodal images. MU-Net is compared with traditional feature-based and area-based methods, as well as supervised and other unsupervised learning methods on the optical-optical, optical-infrared, optical-synthetic aperture radar (SAR), and optical-map datasets. Experimental results show that MU-Net achieves more comprehensive and accurate registration performance between these image pairs with geometric and radiometric distortions. We share the code implemented by Pytorch at \n<uri>https://github.com/yeyuanxin110/MU-Net</uri>\n.","PeriodicalId":13213,"journal":{"name":"IEEE Transactions on Geoscience and Remote Sensing","volume":"60 ","pages":"1-15"},"PeriodicalIF":8.6000,"publicationDate":"2022-04-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Geoscience and Remote Sensing","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/9758703/","RegionNum":1,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 14

Abstract

Registration for multisensor or multimodal image pairs with a large degree of distortions is a fundamental task for many remote sensing applications. To achieve accurate and low-cost remote sensing image registration, we propose a multiscale framework with unsupervised learning, named MU-Net. Without costly ground truth labels, MU-Net directly learns the end-to-end mapping from the image pairs to their transformation parameters. MU-Net stacks several deep neural network (DNN) models on multiple scales to generate a coarse-to-fine registration pipeline, which prevents the backpropagation from falling into a local extremum and resists significant image distortions. We design a novel loss function paradigm based on structural similarity, which makes MU-Net suitable for various types of multimodal images. MU-Net is compared with traditional feature-based and area-based methods, as well as supervised and other unsupervised learning methods on the optical-optical, optical-infrared, optical-synthetic aperture radar (SAR), and optical-map datasets. Experimental results show that MU-Net achieves more comprehensive and accurate registration performance between these image pairs with geometric and radiometric distortions. We share the code implemented by Pytorch at https://github.com/yeyuanxin110/MU-Net .

查看原文本刊更多论文

一种基于无监督学习的遥感图像配准多尺度框架

具有大失真度的多传感器或多模式图像对的配准是许多遥感应用的基本任务。为了实现准确、低成本的遥感图像配准，我们提出了一种具有无监督学习的多尺度框架，称为MU-Net。在没有昂贵的基本事实标签的情况下，MU-Net直接学习从图像对到它们的变换参数的端到端映射。MU-Net在多个尺度上堆叠多个深度神经网络（DNN）模型，以生成从粗到细的配准流水线，从而防止反向传播陷入局部极值，并抵抗显著的图像失真。我们设计了一种新的基于结构相似性的损失函数范式，使MU-Net适用于各种类型的多模式图像。在光学、光学红外、光学合成孔径雷达（SAR）和光学地图数据集上，将MU-Net与传统的基于特征和基于区域的方法以及监督和其他无监督学习方法进行了比较。实验结果表明，MU-Net在具有几何失真和辐射失真的图像对之间实现了更全面、更准确的配准性能。我们在上分享Pytorch实现的代码https://github.com/yeyuanxin110/MU-Net.

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Geoscience and Remote Sensing 工程技术-地球化学与地球物理

CiteScore

11.50

自引率

28.00%

发文量

1912

审稿时长

4.0 months

期刊介绍： IEEE Transactions on Geoscience and Remote Sensing (TGRS) is a monthly publication that focuses on the theory, concepts, and techniques of science and engineering as applied to sensing the land, oceans, atmosphere, and space; and the processing, interpretation, and dissemination of this information.