Cycle Translation-Based Collaborative Training for Hyperspectral-RGB Multimodal Change Detection

IF 13.7

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society Pub Date : 2025-09-16 DOI:10.1109/TIP.2025.3607609

Wenqian Dong;Junying Ren;Song Xiao;Leyuan Fang;Jiahui Qu;Yunsong Li

{"title":"Cycle Translation-Based Collaborative Training for Hyperspectral-RGB Multimodal Change Detection","authors":"Wenqian Dong;Junying Ren;Song Xiao;Leyuan Fang;Jiahui Qu;Yunsong Li","doi":"10.1109/TIP.2025.3607609","DOIUrl":null,"url":null,"abstract":"Hyperspectral image change detection (HSI-CD) benefits from HSIs with continuous spectral bands, which uniquely enables the analysis of more subtle changes. Existing methods have achieved desirable performance relying on multi-temporal homogenous HSIs over the same region, which is generally difficult to obtain in real scenes. HSI-RGB multimodal CD overcomes the constraint of limited HSI availability by incorporating another temporal RGB data, and the combination of advantages within different modalities enhances the robustness of detection results. Nevertheless, due to the different imaging mechanisms between two modalities, existing HSI CD methods cannot be directly applied. In this paper, we propose a cycle translation-based collaborative training (co-training) for HSI-RGB multimodal CD, which achieves cross-modal mutual guidance to collaboratively learn complementary difference information from diverse modalities for identifying changes. Specifically, a cross-modal guided CycleGAN-based image translation module is designed to implement bi-directional image translation, which mitigates modal difference and enables the extraction of information related to land cover changes. Then, a spatial-spectral interactive co-training CD module is proposed to achieve iterative interaction between cross-modal information, which jointly extracts the multimodal difference features to generate the final results. The proposed method outperforms several leading CD methods in extensive experiments carried out on both real and synthetic datasets. In addition, a new public HSI-RGB multimodal dataset along with our code are available at <uri>https://github.com/Jiahuiqu/CT2Net</uri>","PeriodicalId":94032,"journal":{"name":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","volume":"34 ","pages":"6347-6360"},"PeriodicalIF":13.7000,"publicationDate":"2025-09-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on image processing : a publication of the IEEE Signal Processing Society","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/11164958/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Hyperspectral image change detection (HSI-CD) benefits from HSIs with continuous spectral bands, which uniquely enables the analysis of more subtle changes. Existing methods have achieved desirable performance relying on multi-temporal homogenous HSIs over the same region, which is generally difficult to obtain in real scenes. HSI-RGB multimodal CD overcomes the constraint of limited HSI availability by incorporating another temporal RGB data, and the combination of advantages within different modalities enhances the robustness of detection results. Nevertheless, due to the different imaging mechanisms between two modalities, existing HSI CD methods cannot be directly applied. In this paper, we propose a cycle translation-based collaborative training (co-training) for HSI-RGB multimodal CD, which achieves cross-modal mutual guidance to collaboratively learn complementary difference information from diverse modalities for identifying changes. Specifically, a cross-modal guided CycleGAN-based image translation module is designed to implement bi-directional image translation, which mitigates modal difference and enables the extraction of information related to land cover changes. Then, a spatial-spectral interactive co-training CD module is proposed to achieve iterative interaction between cross-modal information, which jointly extracts the multimodal difference features to generate the final results. The proposed method outperforms several leading CD methods in extensive experiments carried out on both real and synthetic datasets. In addition, a new public HSI-RGB multimodal dataset along with our code are available at https://github.com/Jiahuiqu/CT2Net

查看原文本刊更多论文

基于循环翻译的高光谱- rgb多模态变化检测协同训练。

高光谱图像变化检测（HSI-CD）受益于具有连续光谱带的hsi，它独特地能够分析更细微的变化。现有方法依靠同一区域上的多时间同质hsi获得了理想的性能，而这在真实场景中通常难以获得。HSI-RGB多模态CD通过结合另一种时间RGB数据克服了HSI可用性有限的约束，并且不同模态的优势组合增强了检测结果的鲁棒性。然而，由于两种模式之间的成像机制不同，现有的HSI CD方法不能直接应用。本文提出基于循环翻译的HSI-RGB多模态CD协同训练（co-training），实现跨模态相互指导，协同学习不同模态的互补差异信息，识别变化。具体而言，设计了基于cyclegan的跨模态引导图像翻译模块，实现双向图像翻译，减轻模态差异，提取与土地覆盖变化相关的信息。然后，提出空间-光谱交互共训练CD模块，实现跨模态信息的迭代交互，共同提取多模态差异特征，生成最终结果。在真实和合成数据集上进行的大量实验中，该方法优于几种领先的CD方法。此外，一个新的公共HSI-RGB多模态数据集以及我们的代码可以在https://github.com/Jiahuiqu/CT2Net上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

自引率

0.00%

发文量