Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images

IF 5.9 Q1 ENGINEERING, MULTIDISCIPLINARY

Defence Technology(防务技术) Pub Date : 2025-08-01 DOI:10.1016/j.dt.2025.04.006

Feng Huang , Gonghan Yang , Jing Chen , Yixuan Xu , Jingze Su , Guimin Huang , Shu Wang , Wenxi Liu

{"title":"Semantic segmentation of camouflage objects via fusing reconstructed multispectral and RGB images","authors":"Feng Huang , Gonghan Yang , Jing Chen , Yixuan Xu , Jingze Su , Guimin Huang , Shu Wang , Wenxi Liu","doi":"10.1016/j.dt.2025.04.006","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multispectral-RGB based technology shows promise, conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities, limiting their performance. Here, we propose the Reconstructed Multispectral-RGB Fusion Network (RMRF-Net), which reconstructs RGB images into multispectral ones, enabling efficient multimodal segmentation using only an RGB camera. Specifically, RMRF-Net employs a divergent-similarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours. Notably, we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset, including 11 object categories. Experimental results demonstrate that RMRF-Net outperforms existing methods, achieving 17.38 FPS on the NVIDIA Jetson AGX Orin, with only a 0.96% drop in mIoU compared to the RTX 3090, showing its practical applicability in multimodal remote sensing.</div></div>","PeriodicalId":58209,"journal":{"name":"Defence Technology(防务技术)","volume":"50 ","pages":"Pages 324-337"},"PeriodicalIF":5.9000,"publicationDate":"2025-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Defence Technology(防务技术)","FirstCategoryId":"1087","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214914725001229","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, MULTIDISCIPLINARY","Score":null,"Total":0}

引用次数: 0

Abstract

Accurate segmentation of camouflage objects in aerial imagery is vital for improving the efficiency of UAV-based reconnaissance and rescue missions. However, camouflage object segmentation is increasingly challenging due to advances in both camouflage materials and biological mimicry. Although multispectral-RGB based technology shows promise, conventional dual-aperture multispectral-RGB imaging systems are constrained by imprecise and time-consuming registration and fusion across different modalities, limiting their performance. Here, we propose the Reconstructed Multispectral-RGB Fusion Network (RMRF-Net), which reconstructs RGB images into multispectral ones, enabling efficient multimodal segmentation using only an RGB camera. Specifically, RMRF-Net employs a divergent-similarity feature correction strategy to minimize reconstruction errors and includes an efficient boundary-aware decoder to enhance object contours. Notably, we establish the first real-world aerial multispectral-RGB semantic segmentation of camouflage objects dataset, including 11 object categories. Experimental results demonstrate that RMRF-Net outperforms existing methods, achieving 17.38 FPS on the NVIDIA Jetson AGX Orin, with only a 0.96% drop in mIoU compared to the RTX 3090, showing its practical applicability in multimodal remote sensing.

查看原文本刊更多论文

基于重构多光谱和RGB图像融合的伪装目标语义分割

航拍图像中伪装目标的准确分割对于提高无人机侦察救援任务的效率至关重要。然而，由于伪装材料和生物模拟技术的进步，伪装目标分割越来越具有挑战性。尽管基于多光谱rgb的技术显示出前景，但传统的双孔径多光谱rgb成像系统受到不精确和耗时的限制，限制了它们的性能。在此，我们提出了重构多光谱-RGB融合网络（RMRF-Net），它将RGB图像重建为多光谱图像，仅使用RGB相机即可实现高效的多模态分割。具体来说，RMRF-Net采用发散相似特征校正策略来最大限度地减少重建误差，并包括一个有效的边界感知解码器来增强目标轮廓。值得注意的是，我们建立了第一个真实世界的空中多光谱rgb伪装目标数据集，包括11个目标类别。实验结果表明，RMRF-Net优于现有方法，在NVIDIA Jetson AGX Orin上实现了17.38 FPS，与RTX 3090相比mIoU仅下降了0.96%，显示了其在多模态遥感中的实际适用性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Defence Technology(防务技术) Mechanical Engineering, Control and Systems Engineering, Industrial and Manufacturing Engineering

CiteScore

8.70

自引率

0.00%

发文量

728

审稿时长

25 days

期刊介绍： Defence Technology, a peer reviewed journal, is published monthly and aims to become the best international academic exchange platform for the research related to defence technology. It publishes original research papers having direct bearing on defence, with a balanced coverage on analytical, experimental, numerical simulation and applied investigations. It covers various disciplines of science, technology and engineering.