Impact of Training Set Size on Object-Based Land Cover Classification: A Comparison of Three Classifiers

Int. J. Appl. Geospat. Res. Pub Date : 2014-07-01 DOI:10.4018/IJAGR.2014070104

G. Myburgh, A. Niekerk

{"title":"Impact of Training Set Size on Object-Based Land Cover Classification: A Comparison of Three Classifiers","authors":"G. Myburgh, A. Niekerk","doi":"10.4018/IJAGR.2014070104","DOIUrl":null,"url":null,"abstract":"Supervised classifiers are commonly employed in remote sensing to extract land cover information, but various factors affect their accuracy. The number of available training samples, in particular, is known to have a significant impact on classification accuracies. Obtaining a sufficient number of samples is, however, not always practical. The support vector machine (SVM) is a supervised classifier known to perform well with limited training samples and has been compared favourably to other classifiers for various problems in pixel-based land cover classification. Very little research on training-sample size and classifier performance has been done in a geographical object-based image analysis (GEOBIA) environment. This paper compares the performance of SVM, nearest neighbour (NN) and maximum likelihood (ML) classifiers in a GEOBIA environment, with a focus on the influence of training-set size. Training-set sizes ranging from 4-20 per land cover class were tested. Classification tree analysis (CTA) was used for feature selection. The results indicate that the performance of all the classifiers improved significantly as the size of the training set increased. The ML classifier performed poorly when few (<10 per class) training samples were used and the NN classifier performed poorly compared to SVM throughout the experiment. SVM was the superior classifier for all training-set sizes although ML achieved competitive results for sets of 12 or more training areas per class.","PeriodicalId":368300,"journal":{"name":"Int. J. Appl. Geospat. Res.","volume":"39 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Int. J. Appl. Geospat. Res.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.4018/IJAGR.2014070104","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

Supervised classifiers are commonly employed in remote sensing to extract land cover information, but various factors affect their accuracy. The number of available training samples, in particular, is known to have a significant impact on classification accuracies. Obtaining a sufficient number of samples is, however, not always practical. The support vector machine (SVM) is a supervised classifier known to perform well with limited training samples and has been compared favourably to other classifiers for various problems in pixel-based land cover classification. Very little research on training-sample size and classifier performance has been done in a geographical object-based image analysis (GEOBIA) environment. This paper compares the performance of SVM, nearest neighbour (NN) and maximum likelihood (ML) classifiers in a GEOBIA environment, with a focus on the influence of training-set size. Training-set sizes ranging from 4-20 per land cover class were tested. Classification tree analysis (CTA) was used for feature selection. The results indicate that the performance of all the classifiers improved significantly as the size of the training set increased. The ML classifier performed poorly when few (<10 per class) training samples were used and the NN classifier performed poorly compared to SVM throughout the experiment. SVM was the superior classifier for all training-set sizes although ML achieved competitive results for sets of 12 or more training areas per class.

查看原文本刊更多论文

训练集大小对基于目标的土地覆盖分类的影响:三种分类器的比较

监督分类器是遥感中常用的土地覆盖信息提取方法，但影响其准确性的因素很多。特别是，已知可用训练样本的数量对分类准确性有重大影响。然而，获得足够数量的样本并不总是可行的。支持向量机(SVM)是一种监督分类器，已知在有限的训练样本下表现良好，并且在基于像素的土地覆盖分类中的各种问题上与其他分类器进行了比较。在基于地理对象的图像分析(GEOBIA)环境中，关于训练样本大小和分类器性能的研究很少。本文比较了支持向量机(SVM)、最近邻(NN)和最大似然(ML)分类器在GEOBIA环境下的性能，重点研究了训练集大小的影响。每个土地覆盖类别测试了4-20个训练集的大小。使用分类树分析(CTA)进行特征选择。结果表明，随着训练集的增加，所有分类器的性能都有显著提高。当使用少量(每类<10个)训练样本时，ML分类器表现不佳，而在整个实验过程中，与SVM相比，NN分类器表现不佳。SVM是所有训练集大小的优秀分类器，尽管ML在每个类有12个或更多训练区域的集上取得了竞争结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Int. J. Appl. Geospat. Res.

自引率

0.00%

发文量