用简单的注意力取代复杂的变压器，实现高光谱和多光谱图像融合

IF 8 2区计算机科学 Q1 AUTOMATION & CONTROL SYSTEMS

Engineering Applications of Artificial Intelligence Pub Date : 2025-08-13 DOI:10.1016/j.engappai.2025.111959

Kunpeng Mu , Wenqing Wang , Mingze Gao , Han Liu

{"title":"用简单的注意力取代复杂的变压器，实现高光谱和多光谱图像融合","authors":"Kunpeng Mu , Wenqing Wang , Mingze Gao , Han Liu","doi":"10.1016/j.engappai.2025.111959","DOIUrl":null,"url":null,"abstract":"<div><div>The fusion of hyperspectral (HSI) and multispectral (MSI) images to obtain high-resolution hyperspectral images is crucial for hyperspectral image processing and interpretation. In recent years, the Transformer architecture is extensively utilized in the domain of HSI-MSI fusion, yielding promising results. However, the introduction of various modified Transformer architectures leads to increasingly complex network structures, which impose significant computational resource demands. To address this issue, this paper proposes a Mask and Cross-Attention Network (MCANet) for HSI and MSI fusion. The network comprises three components: masked feature extraction, channel and spatial attention cross-feature fusion, and multi-scale step-by-step reconstruction. Our network abandons the existing patchwork of various advanced attention mechanisms and instead employs only the most straightforward channel spectrum and spatial attention mechanisms. This approach allows us to thoroughly extract spectral and spatial features while minimizing the computational resources required by the model. We conduct experiments on 8 HSI datasets and compare them with state-of-the-art fusion methods. The Pavia Center, Pavia University, Washington DC, Salinas, Houston and Botswana datasets all achieve the best fusion images and metrics. In addition, efficiency experiments confirm that the proposed method saves a significant amount of computational resources while fusing high-quality images. The source code and pre-trained models are available at <span><span>https://github.com/xiaomudsg/MCANet</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":50523,"journal":{"name":"Engineering Applications of Artificial Intelligence","volume":"160 ","pages":"Article 111959"},"PeriodicalIF":8.0000,"publicationDate":"2025-08-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Replacing complex transformer with simple attention to achieve hyperspectral and multispectral image fusion\",\"authors\":\"Kunpeng Mu , Wenqing Wang , Mingze Gao , Han Liu\",\"doi\":\"10.1016/j.engappai.2025.111959\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><div>The fusion of hyperspectral (HSI) and multispectral (MSI) images to obtain high-resolution hyperspectral images is crucial for hyperspectral image processing and interpretation. In recent years, the Transformer architecture is extensively utilized in the domain of HSI-MSI fusion, yielding promising results. However, the introduction of various modified Transformer architectures leads to increasingly complex network structures, which impose significant computational resource demands. To address this issue, this paper proposes a Mask and Cross-Attention Network (MCANet) for HSI and MSI fusion. The network comprises three components: masked feature extraction, channel and spatial attention cross-feature fusion, and multi-scale step-by-step reconstruction. Our network abandons the existing patchwork of various advanced attention mechanisms and instead employs only the most straightforward channel spectrum and spatial attention mechanisms. This approach allows us to thoroughly extract spectral and spatial features while minimizing the computational resources required by the model. We conduct experiments on 8 HSI datasets and compare them with state-of-the-art fusion methods. The Pavia Center, Pavia University, Washington DC, Salinas, Houston and Botswana datasets all achieve the best fusion images and metrics. In addition, efficiency experiments confirm that the proposed method saves a significant amount of computational resources while fusing high-quality images. The source code and pre-trained models are available at <span><span>https://github.com/xiaomudsg/MCANet</span><svg><path></path></svg></span>.</div></div>\",\"PeriodicalId\":50523,\"journal\":{\"name\":\"Engineering Applications of Artificial Intelligence\",\"volume\":\"160 \",\"pages\":\"Article 111959\"},\"PeriodicalIF\":8.0000,\"publicationDate\":\"2025-08-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Engineering Applications of Artificial Intelligence\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S0952197625019670\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"AUTOMATION & CONTROL SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Engineering Applications of Artificial Intelligence","FirstCategoryId":"94","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0952197625019670","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AUTOMATION & CONTROL SYSTEMS","Score":null,"Total":0}

引用次数: 0

摘要

高光谱（HSI）和多光谱（MSI）图像融合获得高分辨率高光谱图像是高光谱图像处理和解译的关键。近年来，Transformer架构在HSI-MSI融合领域得到了广泛的应用，并取得了可喜的成果。然而，各种改进的Transformer体系结构的引入导致了越来越复杂的网络结构，这增加了大量的计算资源需求。为了解决这一问题，本文提出了一种用于HSI和MSI融合的掩码和交叉关注网络（MCANet）。该网络由三个部分组成：掩模特征提取、通道与空间注意力交叉特征融合和多尺度分步重建。我们的网络放弃了现有的各种高级注意机制的拼凑，而只采用最直接的通道频谱和空间注意机制。这种方法使我们能够彻底提取光谱和空间特征，同时最大限度地减少模型所需的计算资源。我们在8个HSI数据集上进行了实验，并将它们与最先进的融合方法进行了比较。帕维亚中心、帕维亚大学、华盛顿特区、萨利纳斯、休斯顿和博茨瓦纳的数据集都实现了最佳的融合图像和指标。此外，效率实验表明，该方法在融合高质量图像的同时节省了大量的计算资源。源代码和预训练模型可在https://github.com/xiaomudsg/MCANet上获得。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Replacing complex transformer with simple attention to achieve hyperspectral and multispectral image fusion

The fusion of hyperspectral (HSI) and multispectral (MSI) images to obtain high-resolution hyperspectral images is crucial for hyperspectral image processing and interpretation. In recent years, the Transformer architecture is extensively utilized in the domain of HSI-MSI fusion, yielding promising results. However, the introduction of various modified Transformer architectures leads to increasingly complex network structures, which impose significant computational resource demands. To address this issue, this paper proposes a Mask and Cross-Attention Network (MCANet) for HSI and MSI fusion. The network comprises three components: masked feature extraction, channel and spatial attention cross-feature fusion, and multi-scale step-by-step reconstruction. Our network abandons the existing patchwork of various advanced attention mechanisms and instead employs only the most straightforward channel spectrum and spatial attention mechanisms. This approach allows us to thoroughly extract spectral and spatial features while minimizing the computational resources required by the model. We conduct experiments on 8 HSI datasets and compare them with state-of-the-art fusion methods. The Pavia Center, Pavia University, Washington DC, Salinas, Houston and Botswana datasets all achieve the best fusion images and metrics. In addition, efficiency experiments confirm that the proposed method saves a significant amount of computational resources while fusing high-quality images. The source code and pre-trained models are available at https://github.com/xiaomudsg/MCANet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Engineering Applications of Artificial Intelligence 工程技术-工程：电子与电气

CiteScore

9.60

自引率

10.00%

发文量

505

审稿时长

68 days

期刊介绍： Artificial Intelligence (AI) is pivotal in driving the fourth industrial revolution, witnessing remarkable advancements across various machine learning methodologies. AI techniques have become indispensable tools for practicing engineers, enabling them to tackle previously insurmountable challenges. Engineering Applications of Artificial Intelligence serves as a global platform for the swift dissemination of research elucidating the practical application of AI methods across all engineering disciplines. Submitted papers are expected to present novel aspects of AI utilized in real-world engineering applications, validated using publicly available datasets to ensure the replicability of research outcomes. Join us in exploring the transformative potential of AI in engineering.