Two-Phase Multimodal Image Fusion Using Convolutional Neural Networks

2021 IEEE International Conference on Image Processing (ICIP) Pub Date : 2021-09-19 DOI:10.1109/ICIP42928.2021.9506703

Kushal Kusram, S. Transue, Min-Hyung Choi

引用次数: 0

Abstract

The fusion of multiple imaging modalities presents an important contribution to machine vision, but remains an ongoing challenge due to the limitations in traditional calibration methods that perform a single, global alignment. For depth and thermal imaging devices, sensor and lens intrinsics (FOV, resolution, etc.) may vary considerably, making per-pixel fusion accuracy difficult. In this paper, we present AccuFusion, a two-phase non-linear registration method to fuse multimodal images at a per-pixel level to obtain an efficient and accurate image registration. The two phases: the Coarse Fusion Network (CFN) and Refining Fusion Network (RFN), are designed to learn a robust image-space fusion that provides a non-linear mapping for accurate alignment. By employing the refinement process, we obtain per-pixel displacements to minimize local alignment errors and observe an increase of 18% in average accuracy over global registration.

查看原文本刊更多论文

基于卷积神经网络的两相多模态图像融合

多种成像模式的融合对机器视觉做出了重要贡献，但由于传统校准方法执行单一全局校准的局限性，仍然是一个持续的挑战。对于深度和热成像设备，传感器和透镜的特性(视场、分辨率等)可能会有很大的差异，这使得每像素的融合精度变得困难。在本文中，我们提出了AccuFusion，一种两阶段非线性配准方法，用于在每像素级别融合多模态图像，以获得高效准确的图像配准。这两个阶段:粗融合网络(CFN)和精炼融合网络(RFN)，旨在学习一种鲁棒的图像空间融合，为精确对准提供非线性映射。通过采用细化过程，我们获得了每像素的位移，以最小化局部对准误差，并观察到平均精度比全局配准提高了18%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2021 IEEE International Conference on Image Processing (ICIP)

自引率

0.00%

发文量