Cross-Modal Collaborative Evolution Reinforced by Semantic Coupling for Image Registration and Fusion

IF 5.6 2区工程技术 Q1 ENGINEERING, ELECTRICAL & ELECTRONIC

IEEE Transactions on Instrumentation and Measurement Pub Date : 2024-11-13 DOI:10.1109/TIM.2024.3497157

Yan Xiong;Jun Kong;Yunde Zhang;Ming Lu;Min Jiang

{"title":"Cross-Modal Collaborative Evolution Reinforced by Semantic Coupling for Image Registration and Fusion","authors":"Yan Xiong;Jun Kong;Yunde Zhang;Ming Lu;Min Jiang","doi":"10.1109/TIM.2024.3497157","DOIUrl":null,"url":null,"abstract":"Joint image registration and fusion aim to align and integrate source images to generate an image with salient targets and rich texture details. Current methods pursue spatially optimal deformation fields. However, these methods often overlook local semantic alignment, leading to exacerbated heterogeneity in cascaded fusion and vision tasks. To address these issues, we propose a collaborative evolution network reinforced by semantic coupling for image registration and fusion, named CE-SCNet. First, to correct spatial misalignments, we design a multiscale deformation estimator (MSDE). This module is to estimate spatial deformation fields by modeling global relationships across multiple scales. Second, to further enhance semantic alignment and mitigate heterogeneity, we design the semantic interaction module (SIM). This module is to integrate contextual information within the semantic domain for feature coupling. Third, to reconstruct images with high visual perception, we design the feature discrimination module (FDM) and the detail awareness module (DAM). Both modules are to capture texture information from multiple perspectives. Finally, to optimize the joint paradigm, we construct a multilabel semantic loss. Extensive experimental validations have shown that CE-SCNet significantly outperforms state-of-the-art methods in alleviating semantic misalignments. The semantic segmentation experiments demonstrate that CE-SCNet can adapt to the semantic demands of high-level vision tasks.","PeriodicalId":13341,"journal":{"name":"IEEE Transactions on Instrumentation and Measurement","volume":"74 ","pages":"1-12"},"PeriodicalIF":5.6000,"publicationDate":"2024-11-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Instrumentation and Measurement","FirstCategoryId":"5","ListUrlMain":"https://ieeexplore.ieee.org/document/10752656/","RegionNum":2,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, ELECTRICAL & ELECTRONIC","Score":null,"Total":0}

引用次数: 0

Abstract

Joint image registration and fusion aim to align and integrate source images to generate an image with salient targets and rich texture details. Current methods pursue spatially optimal deformation fields. However, these methods often overlook local semantic alignment, leading to exacerbated heterogeneity in cascaded fusion and vision tasks. To address these issues, we propose a collaborative evolution network reinforced by semantic coupling for image registration and fusion, named CE-SCNet. First, to correct spatial misalignments, we design a multiscale deformation estimator (MSDE). This module is to estimate spatial deformation fields by modeling global relationships across multiple scales. Second, to further enhance semantic alignment and mitigate heterogeneity, we design the semantic interaction module (SIM). This module is to integrate contextual information within the semantic domain for feature coupling. Third, to reconstruct images with high visual perception, we design the feature discrimination module (FDM) and the detail awareness module (DAM). Both modules are to capture texture information from multiple perspectives. Finally, to optimize the joint paradigm, we construct a multilabel semantic loss. Extensive experimental validations have shown that CE-SCNet significantly outperforms state-of-the-art methods in alleviating semantic misalignments. The semantic segmentation experiments demonstrate that CE-SCNet can adapt to the semantic demands of high-level vision tasks.

查看原文本刊更多论文

通过语义耦合强化跨模态协作进化，实现图像注册与融合

联合图像配准和融合旨在对齐和整合源图像，生成具有突出目标和丰富纹理细节的图像。目前的方法追求空间最佳变形场。然而，这些方法往往忽略了局部语义对齐，导致级联融合和视觉任务中的异质性加剧。为了解决这些问题，我们提出了一种通过语义耦合强化图像配准和融合的协作进化网络，命名为 CE-SCNet。首先，为了纠正空间错位，我们设计了一个多尺度形变估算器（MSDE）。该模块通过模拟多个尺度的全局关系来估计空间形变场。其次，为了进一步加强语义对齐并减少异质性，我们设计了语义交互模块（SIM）。该模块用于整合语义域内的上下文信息，以实现特征耦合。第三，为了重建视觉感知度高的图像，我们设计了特征识别模块（FDM）和细节感知模块（DAM）。这两个模块从多个角度捕捉纹理信息。最后，为了优化联合范式，我们构建了多标签语义损失。广泛的实验验证表明，CE-SCNet 在缓解语义错位方面明显优于最先进的方法。语义分割实验证明，CE-SCNet 能够适应高级视觉任务的语义需求。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE Transactions on Instrumentation and Measurement 工程技术-工程：电子与电气

CiteScore

9.00

自引率

23.20%

发文量

1294

审稿时长

3.9 months

期刊介绍： Papers are sought that address innovative solutions to the development and use of electrical and electronic instruments and equipment to measure, monitor and/or record physical phenomena for the purpose of advancing measurement science, methods, functionality and applications. The scope of these papers may encompass: (1) theory, methodology, and practice of measurement; (2) design, development and evaluation of instrumentation and measurement systems and components used in generating, acquiring, conditioning and processing signals; (3) analysis, representation, display, and preservation of the information obtained from a set of measurements; and (4) scientific and technical support to establishment and maintenance of technical standards in the field of Instrumentation and Measurement.