基于多关注交互和多尺度融合网络的RGB图像光谱重建

IF 1.2 3区 工程技术 Q4 CHEMISTRY, APPLIED
Suyu Wang, Lihao Xu
{"title":"基于多关注交互和多尺度融合网络的RGB图像光谱重建","authors":"Suyu Wang,&nbsp;Lihao Xu","doi":"10.1002/col.22979","DOIUrl":null,"url":null,"abstract":"<div>\n \n <p>In the present era, hyperspectral images have become a pervasive tool in a multitude of fields. In order to provide a feasible alternative for scenarios where hyperspectral imaging equipment is not accessible, numerous researchers have endeavored to reconstruct hyperspectral information from limited spectral measurements, leading to the development of spectral reconstruction (SR) algorithms that primarily focus on the visible spectrum. In light of the remarkable advancements achieved in many computer vision tasks through the application of deep learning, an increasing number of SR works aim to leverage deeper and wider convolutional neural networks (CNNs) to learn intricate mappings of SR. However, the majority of deep learning methods tend to neglect the design of initial up-sampling when constructing networks. While some methods introduce innovative attention mechanisms, their transferability is limited, impeding further improvement in SR accuracy. To address these issues, we propose a multi-attention interaction and multi-scale fusion network (MAMSN) for SR. It employs a shunt-confluence multi-branch architecture to learn multi-scale information in images. Furthermore, we have devised a separable enhanced up-sampling (SEU) module, situated at the network head, which processes spatial and channel information separately to produce more refined initial up-sampling results. To fully extract features at different scales for visible-spectrum spectral reconstruction, we introduce an adaptive enhanced channel attention (AECA) mechanism and a joint complementary multi-head self-attention (JCMS) mechanism, which are combined into a more powerful feature extraction module, the dual residual double attention block (DRDAB), through a dual residual structure. The experimental results show that the proposed MAMSN network outperforms other SR methods in overall performance, particularly in quantitative metrics and perceptual quality.</p>\n </div>","PeriodicalId":10459,"journal":{"name":"Color Research and Application","volume":"50 4","pages":"388-402"},"PeriodicalIF":1.2000,"publicationDate":"2025-02-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MAMSN: Multi-Attention Interaction and Multi-Scale Fusion Network for Spectral Reconstruction From RGB Images\",\"authors\":\"Suyu Wang,&nbsp;Lihao Xu\",\"doi\":\"10.1002/col.22979\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n <p>In the present era, hyperspectral images have become a pervasive tool in a multitude of fields. In order to provide a feasible alternative for scenarios where hyperspectral imaging equipment is not accessible, numerous researchers have endeavored to reconstruct hyperspectral information from limited spectral measurements, leading to the development of spectral reconstruction (SR) algorithms that primarily focus on the visible spectrum. In light of the remarkable advancements achieved in many computer vision tasks through the application of deep learning, an increasing number of SR works aim to leverage deeper and wider convolutional neural networks (CNNs) to learn intricate mappings of SR. However, the majority of deep learning methods tend to neglect the design of initial up-sampling when constructing networks. While some methods introduce innovative attention mechanisms, their transferability is limited, impeding further improvement in SR accuracy. To address these issues, we propose a multi-attention interaction and multi-scale fusion network (MAMSN) for SR. It employs a shunt-confluence multi-branch architecture to learn multi-scale information in images. Furthermore, we have devised a separable enhanced up-sampling (SEU) module, situated at the network head, which processes spatial and channel information separately to produce more refined initial up-sampling results. To fully extract features at different scales for visible-spectrum spectral reconstruction, we introduce an adaptive enhanced channel attention (AECA) mechanism and a joint complementary multi-head self-attention (JCMS) mechanism, which are combined into a more powerful feature extraction module, the dual residual double attention block (DRDAB), through a dual residual structure. The experimental results show that the proposed MAMSN network outperforms other SR methods in overall performance, particularly in quantitative metrics and perceptual quality.</p>\\n </div>\",\"PeriodicalId\":10459,\"journal\":{\"name\":\"Color Research and Application\",\"volume\":\"50 4\",\"pages\":\"388-402\"},\"PeriodicalIF\":1.2000,\"publicationDate\":\"2025-02-28\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Color Research and Application\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/col.22979\",\"RegionNum\":3,\"RegionCategory\":\"工程技术\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"CHEMISTRY, APPLIED\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Color Research and Application","FirstCategoryId":"5","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/col.22979","RegionNum":3,"RegionCategory":"工程技术","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"CHEMISTRY, APPLIED","Score":null,"Total":0}
引用次数: 0

摘要

在当今时代,高光谱图像已成为众多领域的普遍工具。为了在无法获得高光谱成像设备的情况下提供可行的替代方案,许多研究人员努力从有限的光谱测量中重建高光谱信息,从而导致了主要关注可见光谱的光谱重建(SR)算法的发展。鉴于深度学习在许多计算机视觉任务中取得的显著进步,越来越多的SR工作旨在利用更深更广的卷积神经网络(cnn)来学习复杂的SR映射。然而,大多数深度学习方法在构建网络时往往忽略了初始上采样的设计。虽然一些方法引入了创新的注意力机制,但它们的可转移性有限,阻碍了SR精度的进一步提高。为了解决这些问题,我们提出了一种多注意力交互和多尺度融合网络(MAMSN),它采用分流融合的多分支架构来学习图像中的多尺度信息。此外,我们设计了一个可分离的增强上采样(SEU)模块,位于网络头部,它分别处理空间和信道信息,以产生更精细的初始上采样结果。为了充分提取不同尺度的特征进行可见光谱重建,我们引入了自适应增强通道注意(AECA)机制和联合互补多头自注意(JCMS)机制,并通过双残差结构将其组合成一个更强大的特征提取模块——双残差双注意块(DRDAB)。实验结果表明,所提出的MAMSN网络在整体性能上优于其他SR方法,特别是在定量指标和感知质量方面。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MAMSN: Multi-Attention Interaction and Multi-Scale Fusion Network for Spectral Reconstruction From RGB Images

In the present era, hyperspectral images have become a pervasive tool in a multitude of fields. In order to provide a feasible alternative for scenarios where hyperspectral imaging equipment is not accessible, numerous researchers have endeavored to reconstruct hyperspectral information from limited spectral measurements, leading to the development of spectral reconstruction (SR) algorithms that primarily focus on the visible spectrum. In light of the remarkable advancements achieved in many computer vision tasks through the application of deep learning, an increasing number of SR works aim to leverage deeper and wider convolutional neural networks (CNNs) to learn intricate mappings of SR. However, the majority of deep learning methods tend to neglect the design of initial up-sampling when constructing networks. While some methods introduce innovative attention mechanisms, their transferability is limited, impeding further improvement in SR accuracy. To address these issues, we propose a multi-attention interaction and multi-scale fusion network (MAMSN) for SR. It employs a shunt-confluence multi-branch architecture to learn multi-scale information in images. Furthermore, we have devised a separable enhanced up-sampling (SEU) module, situated at the network head, which processes spatial and channel information separately to produce more refined initial up-sampling results. To fully extract features at different scales for visible-spectrum spectral reconstruction, we introduce an adaptive enhanced channel attention (AECA) mechanism and a joint complementary multi-head self-attention (JCMS) mechanism, which are combined into a more powerful feature extraction module, the dual residual double attention block (DRDAB), through a dual residual structure. The experimental results show that the proposed MAMSN network outperforms other SR methods in overall performance, particularly in quantitative metrics and perceptual quality.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Color Research and Application
Color Research and Application 工程技术-工程:化工
CiteScore
3.70
自引率
7.10%
发文量
62
审稿时长
>12 weeks
期刊介绍: Color Research and Application provides a forum for the publication of peer-reviewed research reviews, original research articles, and editorials of the highest quality on the science, technology, and application of color in multiple disciplines. Due to the highly interdisciplinary influence of color, the readership of the journal is similarly widespread and includes those in business, art, design, education, as well as various industries.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信