A semi-supervised learning method for remote sensing data mining

17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05) Pub Date : 2005-11-14 DOI:10.1109/ICTAI.2005.17

Ranga Raju Vatsavai, S. Shekhar, T. Burk

{"title":"A semi-supervised learning method for remote sensing data mining","authors":"Ranga Raju Vatsavai, S. Shekhar, T. Burk","doi":"10.1109/ICTAI.2005.17","DOIUrl":null,"url":null,"abstract":"New approaches are needed to extract useful patterns from increasingly large multi-spectral remote sensing image databases in order to understand global climatic changes, vegetation dynamics, ocean processes, etc. Supervised learning, which is often used in land cover (thematic) classification of remote sensing imagery, requires large amounts of accurate training data. However, in many situations it is very difficult to collect labels for all training samples. In this paper we explore methods that utilize unlabeled samples in supervised learning for thematic information extraction from remote sensing imagery. Our objectives are to understand the impact of parameter estimation with small learning samples on classification accuracy, and to augment the parameter estimation with unlabeled training samples to improve land cover predictions. We have developed a semi-supervised learning method based on the expectation-maximization (EM) algorithm, and maximum likelihood and maximum a posteriori classifiers. This scheme utilizes a small set of labeled and a large number of unlabeled training samples. We have conducted several experiments on multi-spectral images to understand the impact of unlabeled samples on the classification performance. Our study shows that though in general classification accuracy improves with the addition of unlabeled training samples, it is not guaranteed to get consistently higher accuracies unless sufficient care is exercised when designing a semi-supervised classifier","PeriodicalId":294694,"journal":{"name":"17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2005-11-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"29","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICTAI.2005.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 29

Abstract

New approaches are needed to extract useful patterns from increasingly large multi-spectral remote sensing image databases in order to understand global climatic changes, vegetation dynamics, ocean processes, etc. Supervised learning, which is often used in land cover (thematic) classification of remote sensing imagery, requires large amounts of accurate training data. However, in many situations it is very difficult to collect labels for all training samples. In this paper we explore methods that utilize unlabeled samples in supervised learning for thematic information extraction from remote sensing imagery. Our objectives are to understand the impact of parameter estimation with small learning samples on classification accuracy, and to augment the parameter estimation with unlabeled training samples to improve land cover predictions. We have developed a semi-supervised learning method based on the expectation-maximization (EM) algorithm, and maximum likelihood and maximum a posteriori classifiers. This scheme utilizes a small set of labeled and a large number of unlabeled training samples. We have conducted several experiments on multi-spectral images to understand the impact of unlabeled samples on the classification performance. Our study shows that though in general classification accuracy improves with the addition of unlabeled training samples, it is not guaranteed to get consistently higher accuracies unless sufficient care is exercised when designing a semi-supervised classifier

查看原文本刊更多论文

一种用于遥感数据挖掘的半监督学习方法

为了了解全球气候变化、植被动态、海洋过程等，需要从日益庞大的多光谱遥感影像数据库中提取有用模式的新方法。监督学习通常用于遥感影像的土地覆盖(专题)分类，它需要大量准确的训练数据。然而，在许多情况下，收集所有训练样本的标签是非常困难的。在本文中，我们探索了在监督学习中利用未标记样本从遥感图像中提取主题信息的方法。我们的目标是了解小学习样本参数估计对分类精度的影响，并用未标记的训练样本增强参数估计，以改进土地覆盖预测。我们开发了一种基于期望最大化(EM)算法、最大似然和最大后验分类器的半监督学习方法。该方案利用了少量有标记和大量未标记的训练样本。我们对多光谱图像进行了多次实验，以了解未标记样本对分类性能的影响。我们的研究表明，虽然在一般情况下，分类精度随着未标记训练样本的增加而提高，但除非在设计半监督分类器时足够小心，否则不能保证获得始终如一的更高精度

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

17th IEEE International Conference on Tools with Artificial Intelligence (ICTAI'05)

自引率

0.00%

发文量