Multispectral image reconstruction from RGB image for maize growth status monitoring based on window-adaptive spatial-spectral attention transformer

IF 8.9 1区 农林科学 Q1 AGRICULTURE, MULTIDISCIPLINARY
Di Song , Hong Sun , Esther Ngumbi , Mohammed Kamruzzaman
{"title":"Multispectral image reconstruction from RGB image for maize growth status monitoring based on window-adaptive spatial-spectral attention transformer","authors":"Di Song ,&nbsp;Hong Sun ,&nbsp;Esther Ngumbi ,&nbsp;Mohammed Kamruzzaman","doi":"10.1016/j.compag.2025.111062","DOIUrl":null,"url":null,"abstract":"<div><div>Multispectral image analysis is an effective way to detect crop growth status. However, the complexity of manufacturing process and technology of multispectral image acquisition equipment make data acquisition expensive. Therefore, a method based on a window-adaptive spatial-spectral attention transformer is proposed to reconstruct multispectral images using RGB images of maize. First, RGB and hyperspectral images of the maize are obtained, and the reflectance data from classic and preferred band combinations are extracted from the hyperspectral image. Then, a transformer model is constructed to evaluate and compare the reconstruction efficacy of the 5-band and 10-band combinations across four attention modes: spatial, spectral, spatial-spectral, and window-adaptive spatial-spectral attention. The best-performing reconstruction results are selected and compared with the original data from three perspectives: image, spectrum, and model effect. The 10-band multispectral image reconstructed by the window-adaptive spatial-spectral attention mechanism is highly similar to the original image, with a reflectance correlation exceeding 0.99. Furthermore, its application in monitoring crop growth status (i.e., maize chlorophyll) yields results closely aligned with actual reflectance data: R<sub>C</sub><sup>2</sup> is 0.76, R<sub>V</sub><sup>2</sup> is 0.64, while RMSE<sub>C</sub> and RMSE<sub>V</sub> are 3.63 mg/L and 2.94 mg/L, respectively. To further explore the model performance, the new sensitive bands are selected to be reconstructed in the maize V7 stage. The results from the chlorophyll content prediction model are as: R<sub>C</sub><sup>2</sup> is 0.64, R<sub>V</sub><sup>2</sup> is 0.60, with RMSE<sub>C</sub> and RMSE<sub>V</sub> are 5.61 mg/L and 5.62 mg/L, respectively. Therefore, the window-adaptive spatial-spectral attention transformer can accurately reconstruct multispectral images and establish precise growth status monitoring models, providing technical support for low-cost field maize growth detection.</div></div>","PeriodicalId":50627,"journal":{"name":"Computers and Electronics in Agriculture","volume":"239 ","pages":"Article 111062"},"PeriodicalIF":8.9000,"publicationDate":"2025-10-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers and Electronics in Agriculture","FirstCategoryId":"97","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0168169925011688","RegionNum":1,"RegionCategory":"农林科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"AGRICULTURE, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Multispectral image analysis is an effective way to detect crop growth status. However, the complexity of manufacturing process and technology of multispectral image acquisition equipment make data acquisition expensive. Therefore, a method based on a window-adaptive spatial-spectral attention transformer is proposed to reconstruct multispectral images using RGB images of maize. First, RGB and hyperspectral images of the maize are obtained, and the reflectance data from classic and preferred band combinations are extracted from the hyperspectral image. Then, a transformer model is constructed to evaluate and compare the reconstruction efficacy of the 5-band and 10-band combinations across four attention modes: spatial, spectral, spatial-spectral, and window-adaptive spatial-spectral attention. The best-performing reconstruction results are selected and compared with the original data from three perspectives: image, spectrum, and model effect. The 10-band multispectral image reconstructed by the window-adaptive spatial-spectral attention mechanism is highly similar to the original image, with a reflectance correlation exceeding 0.99. Furthermore, its application in monitoring crop growth status (i.e., maize chlorophyll) yields results closely aligned with actual reflectance data: RC2 is 0.76, RV2 is 0.64, while RMSEC and RMSEV are 3.63 mg/L and 2.94 mg/L, respectively. To further explore the model performance, the new sensitive bands are selected to be reconstructed in the maize V7 stage. The results from the chlorophyll content prediction model are as: RC2 is 0.64, RV2 is 0.60, with RMSEC and RMSEV are 5.61 mg/L and 5.62 mg/L, respectively. Therefore, the window-adaptive spatial-spectral attention transformer can accurately reconstruct multispectral images and establish precise growth status monitoring models, providing technical support for low-cost field maize growth detection.
基于窗口自适应空间-光谱注意力转换器的RGB图像多光谱重建玉米生长状态监测
多光谱图像分析是检测作物生长状况的有效手段。然而,多光谱图像采集设备的制造工艺和技术的复杂性使得数据采集成本高昂。为此,提出了一种基于窗口自适应空间光谱注意力转换器的玉米RGB多光谱图像重构方法。首先,获取玉米的RGB和高光谱图像,并从高光谱图像中提取经典波段和优选波段组合的反射率数据;在此基础上,构建了一个变压器模型,对5波段和10波段组合在空间、光谱、空间-光谱和窗口自适应空间-光谱4种注意模式下的重建效果进行了评价和比较。从图像、光谱和模型效应三个角度选择表现最好的重建结果与原始数据进行比较。利用窗自适应空间-光谱注意机制重建的10波段多光谱图像与原始图像高度相似,反射率相关系数超过0.99。此外,将其应用于作物生长状态(即玉米叶绿素)监测,其结果与实际反射率数据非常接近:RC2为0.76,RV2为0.64,RMSEC和RMSEV分别为3.63 mg/L和2.94 mg/L。为了进一步探索模型的性能,选择新的敏感波段在玉米V7期进行重建。叶绿素含量预测模型结果为:RC2为0.64,RV2为0.60,RMSEC和RMSEV分别为5.61 mg/L和5.62 mg/L。因此,窗口自适应空间光谱注意力转换器可以精确地重建多光谱图像,建立精确的生长状态监测模型,为低成本的田间玉米生长检测提供技术支持。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers and Electronics in Agriculture
Computers and Electronics in Agriculture 工程技术-计算机:跨学科应用
CiteScore
15.30
自引率
14.50%
发文量
800
审稿时长
62 days
期刊介绍: Computers and Electronics in Agriculture provides international coverage of advancements in computer hardware, software, electronic instrumentation, and control systems applied to agricultural challenges. Encompassing agronomy, horticulture, forestry, aquaculture, and animal farming, the journal publishes original papers, reviews, and applications notes. It explores the use of computers and electronics in plant or animal agricultural production, covering topics like agricultural soils, water, pests, controlled environments, and waste. The scope extends to on-farm post-harvest operations and relevant technologies, including artificial intelligence, sensors, machine vision, robotics, networking, and simulation modeling. Its companion journal, Smart Agricultural Technology, continues the focus on smart applications in production agriculture.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信