{"title":"基于混合卷积框架的陆基高光谱图像伪装目标分类研究","authors":"Jiale Zhao, Dan Fang, Jianghu Deng, Jiaju Ying, Yudan Chen, Guanglong Wang, Bing Zhou","doi":"10.1049/cit2.70051","DOIUrl":null,"url":null,"abstract":"<p>In recent years, camouflage technology has evolved from single-spectral-band applications to multifunctional and multispectral implementations. Hyperspectral imaging has emerged as a powerful technique for target identification due to its capacity to capture both spectral and spatial information. The advancement of imaging spectroscopy technology has significantly enhanced reconnaissance capabilities, offering substantial advantages in camouflaged target classification and detection. However, the increasing spectral similarity between camouflaged targets and their backgrounds has significantly compromised detection performance in specific scenarios. Conventional feature extraction methods are often limited to single, shallow spectral or spatial features, failing to extract deep features and consequently yielding suboptimal classification accuracy. To address these limitations, this study proposes an innovative 3D-2D convolutional neural networks architecture incorporating depthwise separable convolution (DSC) and attention mechanisms (AM). The framework first applies dimensionality reduction to hyperspectral images and extracts preliminary spectral-spatial features. It then employs an alternating combination of 3D and 2D convolutions for deep feature extraction. For target classification, the LogSoftmax function is implemented. The integration of depthwise separable convolution not only enhances classification accuracy but also substantially reduces model parameters. Furthermore, the attention mechanisms significantly improve the network's ability to represent multidimensional features. Extensive experiments were conducted on a custom land-based hyperspectral image dataset. The results demonstrate remarkable classification accuracy: 98.74% for grassland camouflage, 99.13% for dead leaf camouflage and 98.94% for wild grass camouflage. Comparative analysis shows that the proposed framework is outstanding in terms of classification accuracy and robustness for camouflage target classification.</p>","PeriodicalId":46211,"journal":{"name":"CAAI Transactions on Intelligence Technology","volume":"10 5","pages":"1559-1572"},"PeriodicalIF":7.3000,"publicationDate":"2025-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70051","citationCount":"0","resultStr":"{\"title\":\"Exploring a Hybrid Convolutional Framework for Camouflage Target Classification in Land-Based Hyperspectral Images\",\"authors\":\"Jiale Zhao, Dan Fang, Jianghu Deng, Jiaju Ying, Yudan Chen, Guanglong Wang, Bing Zhou\",\"doi\":\"10.1049/cit2.70051\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In recent years, camouflage technology has evolved from single-spectral-band applications to multifunctional and multispectral implementations. Hyperspectral imaging has emerged as a powerful technique for target identification due to its capacity to capture both spectral and spatial information. The advancement of imaging spectroscopy technology has significantly enhanced reconnaissance capabilities, offering substantial advantages in camouflaged target classification and detection. However, the increasing spectral similarity between camouflaged targets and their backgrounds has significantly compromised detection performance in specific scenarios. Conventional feature extraction methods are often limited to single, shallow spectral or spatial features, failing to extract deep features and consequently yielding suboptimal classification accuracy. To address these limitations, this study proposes an innovative 3D-2D convolutional neural networks architecture incorporating depthwise separable convolution (DSC) and attention mechanisms (AM). The framework first applies dimensionality reduction to hyperspectral images and extracts preliminary spectral-spatial features. It then employs an alternating combination of 3D and 2D convolutions for deep feature extraction. For target classification, the LogSoftmax function is implemented. The integration of depthwise separable convolution not only enhances classification accuracy but also substantially reduces model parameters. Furthermore, the attention mechanisms significantly improve the network's ability to represent multidimensional features. Extensive experiments were conducted on a custom land-based hyperspectral image dataset. The results demonstrate remarkable classification accuracy: 98.74% for grassland camouflage, 99.13% for dead leaf camouflage and 98.94% for wild grass camouflage. Comparative analysis shows that the proposed framework is outstanding in terms of classification accuracy and robustness for camouflage target classification.</p>\",\"PeriodicalId\":46211,\"journal\":{\"name\":\"CAAI Transactions on Intelligence Technology\",\"volume\":\"10 5\",\"pages\":\"1559-1572\"},\"PeriodicalIF\":7.3000,\"publicationDate\":\"2025-07-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://ietresearch.onlinelibrary.wiley.com/doi/epdf/10.1049/cit2.70051\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"CAAI Transactions on Intelligence Technology\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cit2.70051\",\"RegionNum\":2,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"CAAI Transactions on Intelligence Technology","FirstCategoryId":"94","ListUrlMain":"https://ietresearch.onlinelibrary.wiley.com/doi/10.1049/cit2.70051","RegionNum":2,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, ARTIFICIAL INTELLIGENCE","Score":null,"Total":0}
Exploring a Hybrid Convolutional Framework for Camouflage Target Classification in Land-Based Hyperspectral Images
In recent years, camouflage technology has evolved from single-spectral-band applications to multifunctional and multispectral implementations. Hyperspectral imaging has emerged as a powerful technique for target identification due to its capacity to capture both spectral and spatial information. The advancement of imaging spectroscopy technology has significantly enhanced reconnaissance capabilities, offering substantial advantages in camouflaged target classification and detection. However, the increasing spectral similarity between camouflaged targets and their backgrounds has significantly compromised detection performance in specific scenarios. Conventional feature extraction methods are often limited to single, shallow spectral or spatial features, failing to extract deep features and consequently yielding suboptimal classification accuracy. To address these limitations, this study proposes an innovative 3D-2D convolutional neural networks architecture incorporating depthwise separable convolution (DSC) and attention mechanisms (AM). The framework first applies dimensionality reduction to hyperspectral images and extracts preliminary spectral-spatial features. It then employs an alternating combination of 3D and 2D convolutions for deep feature extraction. For target classification, the LogSoftmax function is implemented. The integration of depthwise separable convolution not only enhances classification accuracy but also substantially reduces model parameters. Furthermore, the attention mechanisms significantly improve the network's ability to represent multidimensional features. Extensive experiments were conducted on a custom land-based hyperspectral image dataset. The results demonstrate remarkable classification accuracy: 98.74% for grassland camouflage, 99.13% for dead leaf camouflage and 98.94% for wild grass camouflage. Comparative analysis shows that the proposed framework is outstanding in terms of classification accuracy and robustness for camouflage target classification.
期刊介绍:
CAAI Transactions on Intelligence Technology is a leading venue for original research on the theoretical and experimental aspects of artificial intelligence technology. We are a fully open access journal co-published by the Institution of Engineering and Technology (IET) and the Chinese Association for Artificial Intelligence (CAAI) providing research which is openly accessible to read and share worldwide.