Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images

IF 5.9 Q1 ENGINEERING, MULTIDISCIPLINARY
Feng Huang , Gonghan Yang , Jing Chen , Yixuan Xu , Jingze Su , Guimin Huang , Shu Wang , Wenxi Liu
{"title":"Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images","authors":"Feng Huang ,&nbsp;Gonghan Yang ,&nbsp;Jing Chen ,&nbsp;Yixuan Xu ,&nbsp;Jingze Su ,&nbsp;Guimin Huang ,&nbsp;Shu Wang ,&nbsp;Wenxi Liu","doi":"10.1016/j.dt.2025.04.006","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multispectral-RGB based technology shows promise, conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities, limiting their performance. Here, we propose the Reconstructed Multispectral-RGB Fusion Network (RMRF-Net), which reconstructs RGB images into multispectral ones, enabling efficient multimodal segmentation using only an RGB camera. Specifically, RMRF-Net employs a divergent-similarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours. Notably, we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset, including 11 object categories. Experimental results demonstrate that RMRF-Net outperforms existing methods, achieving 17.38 FPS on the NVIDIA Jetson AGX Orin, with only a 0.96% drop in mIoU compared to the RTX 3090, showing its practical applicability in multimodal remote sensing.</div></div>","PeriodicalId":58209,"journal":{"name":"Defence Technology(防务技术)","volume":"50 ","pages":"Pages 324-337"},"PeriodicalIF":5.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Defence Technology(防务技术)","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214914725001229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0

Abstract

Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multispectral-RGB based technology shows promise, conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities, limiting their performance. Here, we propose the Reconstructed Multispectral-RGB Fusion Network (RMRF-Net), which reconstructs RGB images into multispectral ones, enabling efficient multimodal segmentation using only an RGB camera. Specifically, RMRF-Net employs a divergent-similarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours. Notably, we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset, including 11 object categories. Experimental results demonstrate that RMRF-Net outperforms existing methods, achieving 17.38 FPS on the NVIDIA Jetson AGX Orin, with only a 0.96% drop in mIoU compared to the RTX 3090, showing its practical applicability in multimodal remote sensing.
基于重构多光谱和RGB图像融合的伪装目标语义分割
航拍图像中伪装目标的准确分割对于提高无人机侦察救援任务的效率至关重要。然而,由于伪装材料和生物模拟技术的进步,伪装目标分割越来越具有挑战性。尽管基于多光谱rgb的技术显示出前景,但传统的双孔径多光谱rgb成像系统受到不精确和耗时的限制,限制了它们的性能。在此,我们提出了重构多光谱-RGB融合网络(RMRF-Net),它将RGB图像重建为多光谱图像,仅使用RGB相机即可实现高效的多模态分割。具体来说,RMRF-Net采用发散相似特征校正策略来最大限度地减少重建误差,并包括一个有效的边界感知解码器来增强目标轮廓。值得注意的是,我们建立了第一个真实世界的空中多光谱rgb伪装目标数据集,包括11个目标类别。实验结果表明,RMRF-Net优于现有方法,在NVIDIA Jetson AGX Orin上实现了17.38 FPS,与RTX 3090相比mIoU仅下降了0.96%,显示了其在多模态遥感中的实际适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Defence Technology(防务技术)
Defence Technology(防务技术) Mechanical Engineering, Control and Systems Engineering, Industrial and Manufacturing Engineering
CiteScore
8.70
自引率
0.00%
发文量
728
审稿时长
25 days
期刊介绍: Defence Technology, a peer reviewed journal, is published monthly and aims to become the best international academic exchange platform for the research related to defence technology. It publishes original research papers having direct bearing on defence, with a balanced coverage on analytical, experimental, numerical simulation and applied investigations. It covers various disciplines of science, technology and engineering.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信