Sensitivity of hyperspectral classification algorithms to training sample size

Matthew A. Lee, S. Prasad, L. Bruce, Terrance R. West, Daniel Reynolds, T. Irby, H. Kalluri
{"title":"Sensitivity of hyperspectral classification algorithms to training sample size","authors":"Matthew A. Lee, S. Prasad, L. Bruce, Terrance R. West, Daniel Reynolds, T. Irby, H. Kalluri","doi":"10.1109/WHISPERS.2009.5288983","DOIUrl":null,"url":null,"abstract":"Algorithms that exploit hyperspectral imagery often encounter problems related to the high dimensionality of the data, particularly when the amount of training data is limited. Recently, two algorithms were proposed to alleviate the small sample size problem - one is based on employing a Multi-Classifier Decision Fusion (MCDF) in the raw reflectance domain, and the other employed the MCDF framework in the Discrete Wavelet Transform domain (DWT-MCDF). This paper investigates the sensitivity of conventional single classifier based classification approaches, as well as MCDF and DWT-MCDF to variations in the amount of data employed for training the classification system. The hyperspectral data in this experiment was obtained using an airborne hyperspectral imager used by SpecTIR™. The results of the experimental analysis show that for the given application, the MCDF and DWT-MCDF algorithms are significantly less sensitive than the conventional algorithms to limited training data. PCA consistently results in overall accuracies of about 35%. LDA accuracies are very high, about 75%, when there is an abundance of training data - about 10X (i.e. number of training samples is 10 times the number of spectral bands); remains above 60% for training data abundances of 2X and higher; but dramatically decreases to ∼20% for abundances of 1X. MCDF results in accuracies ranging between 65% and 75% for training data abundance of 3X and higher, but the accuracies drop to ∼60% for 2X and ∼55% for 1X. DWT-MCDF results in high accuracies with the least sensitivity to training data abundance. Its accuracies range between ∼60–65% for abundances of 1X to 10X.","PeriodicalId":242447,"journal":{"name":"2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-10-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"18","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 First Workshop on Hyperspectral Image and Signal Processing: Evolution in Remote Sensing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WHISPERS.2009.5288983","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 18

Abstract

Algorithms that exploit hyperspectral imagery often encounter problems related to the high dimensionality of the data, particularly when the amount of training data is limited. Recently, two algorithms were proposed to alleviate the small sample size problem - one is based on employing a Multi-Classifier Decision Fusion (MCDF) in the raw reflectance domain, and the other employed the MCDF framework in the Discrete Wavelet Transform domain (DWT-MCDF). This paper investigates the sensitivity of conventional single classifier based classification approaches, as well as MCDF and DWT-MCDF to variations in the amount of data employed for training the classification system. The hyperspectral data in this experiment was obtained using an airborne hyperspectral imager used by SpecTIR™. The results of the experimental analysis show that for the given application, the MCDF and DWT-MCDF algorithms are significantly less sensitive than the conventional algorithms to limited training data. PCA consistently results in overall accuracies of about 35%. LDA accuracies are very high, about 75%, when there is an abundance of training data - about 10X (i.e. number of training samples is 10 times the number of spectral bands); remains above 60% for training data abundances of 2X and higher; but dramatically decreases to ∼20% for abundances of 1X. MCDF results in accuracies ranging between 65% and 75% for training data abundance of 3X and higher, but the accuracies drop to ∼60% for 2X and ∼55% for 1X. DWT-MCDF results in high accuracies with the least sensitivity to training data abundance. Its accuracies range between ∼60–65% for abundances of 1X to 10X.
高光谱分类算法对训练样本大小的敏感性
利用高光谱图像的算法经常遇到与数据高维相关的问题,特别是在训练数据量有限的情况下。近年来,针对小样本问题提出了两种算法,一种是基于原始反射域的多分类器决策融合(MCDF),另一种是基于离散小波变换域的MCDF框架(DWT-MCDF)。本文研究了传统的基于单分类器的分类方法,以及MCDF和DWT-MCDF对用于训练分类系统的数据量变化的敏感性。本实验中的高光谱数据使用SpecTIR™机载高光谱成像仪获得。实验分析结果表明,对于给定的应用,MCDF和DWT-MCDF算法对有限训练数据的敏感性明显低于传统算法。PCA的总体准确率始终在35%左右。当训练数据丰富时,LDA的准确率非常高,约为75%,约为10倍(即训练样本数量是光谱频带数量的10倍);训练数据丰度为2X及以上时保持在60%以上;但对于1X丰度,则急剧下降到20%。对于训练数据丰度为3X或更高的情况,MCDF的准确率在65%到75%之间,但对于2X和1X,准确率分别下降到60%和55%。DWT-MCDF的准确度高,对训练数据丰度的敏感度最低。在丰度为1X至10X的情况下,其精度范围在~ 60-65%之间。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信