Sensitivity of hyperspectral classification algorithms to training sample size

2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing Pub Date : 2009-10-16 DOI:10.1109/WHISPERS.2009.5288983

Matthew A. Lee, S. Prasad, L. Bruce, Terrance R. West, Daniel Reynolds, T. Irby, H. Kalluri

{"title":"Sensitivity of hyperspectral classification algorithms to training sample size","authors":"Matthew A. Lee, S. Prasad, L. Bruce, Terrance R. West, Daniel Reynolds, T. Irby, H. Kalluri","doi":"10.1109/WHISPERS.2009.5288983","DOIUrl":null,"url":null,"abstract":"Algorithms that exploit hyperspectral imagery often encounter problems related to the high dimensionality of the data, particularly when the amount of training data is limited. Recently, two algorithms were proposed to alleviate the small sample size problem - one is based on employing a Multi-Classifier Decision Fusion (MCDF) in the raw reflectance domain, and the other employed the MCDF framework in the Discrete Wavelet Transform domain (DWT-MCDF). This paper investigates the sensitivity of conventional single classifier based classification approaches, as well as MCDF and DWT-MCDF to variations in the amount of data employed for training the classification system. The hyperspectral data in this experiment was obtained using an airborne hyperspectral imager used by SpecTIR™. The results of the experimental analysis show that for the given application, the MCDF and DWT-MCDF algorithms are significantly less sensitive than the conventional algorithms to limited training data. PCA consistently results in overall accuracies of about 35%. LDA accuracies are very high, about 75%, when there is an abundance of training data - about 10X (i.e. number of training samples is 10 times the number of spectral bands); remains above 60% for training data abundances of 2X and higher; but dramatically decreases to ∼20% for abundances of 1X. MCDF results in accuracies ranging between 65% and 75% for training data abundance of 3X and higher, but the accuracies drop to ∼60% for 2X and ∼55% for 1X. DWT-MCDF results in high accuracies with the least sensitivity to training data abundance. Its accuracies range between ∼60–65% for abundances of 1X to 10X.","PeriodicalId":242447,"journal":{"name":"2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WHISPERS.2009.5288983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 18

Abstract

Algorithms that exploit hyperspectral imagery often encounter problems related to the high dimensionality of the data, particularly when the amount of training data is limited. Recently, two algorithms were proposed to alleviate the small sample size problem - one is based on employing a Multi-Classifier Decision Fusion (MCDF) in the raw reflectance domain, and the other employed the MCDF framework in the Discrete Wavelet Transform domain (DWT-MCDF). This paper investigates the sensitivity of conventional single classifier based classification approaches, as well as MCDF and DWT-MCDF to variations in the amount of data employed for training the classification system. The hyperspectral data in this experiment was obtained using an airborne hyperspectral imager used by SpecTIR™. The results of the experimental analysis show that for the given application, the MCDF and DWT-MCDF algorithms are significantly less sensitive than the conventional algorithms to limited training data. PCA consistently results in overall accuracies of about 35%. LDA accuracies are very high, about 75%, when there is an abundance of training data - about 10X (i.e. number of training samples is 10 times the number of spectral bands); remains above 60% for training data abundances of 2X and higher; but dramatically decreases to ∼20% for abundances of 1X. MCDF results in accuracies ranging between 65% and 75% for training data abundance of 3X and higher, but the accuracies drop to ∼60% for 2X and ∼55% for 1X. DWT-MCDF results in high accuracies with the least sensitivity to training data abundance. Its accuracies range between ∼60–65% for abundances of 1X to 10X.

查看原文本刊更多论文

高光谱分类算法对训练样本大小的敏感性

利用高光谱图像的算法经常遇到与数据高维相关的问题，特别是在训练数据量有限的情况下。近年来，针对小样本问题提出了两种算法，一种是基于原始反射域的多分类器决策融合(MCDF)，另一种是基于离散小波变换域的MCDF框架(DWT-MCDF)。本文研究了传统的基于单分类器的分类方法，以及MCDF和DWT-MCDF对用于训练分类系统的数据量变化的敏感性。本实验中的高光谱数据使用SpecTIR™机载高光谱成像仪获得。实验分析结果表明，对于给定的应用，MCDF和DWT-MCDF算法对有限训练数据的敏感性明显低于传统算法。PCA的总体准确率始终在35%左右。当训练数据丰富时，LDA的准确率非常高，约为75%，约为10倍(即训练样本数量是光谱频带数量的10倍);训练数据丰度为2X及以上时保持在60%以上;但对于1X丰度，则急剧下降到20%。对于训练数据丰度为3X或更高的情况，MCDF的准确率在65%到75%之间，但对于2X和1X，准确率分别下降到60%和55%。DWT-MCDF的准确度高，对训练数据丰度的敏感度最低。在丰度为1X至10X的情况下，其精度范围在~ 60-65%之间。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing

自引率

0.00%

发文量