An Approach for Treatment of the Incomplete Data Based on WaveCluster and Weighted 1-Nearest Neighbor

Xing-yi Li, Junyun Lu, Huaji Shi, Suqin Ma
{"title":"An Approach for Treatment of the Incomplete Data Based on WaveCluster and Weighted 1-Nearest Neighbor","authors":"Xing-yi Li, Junyun Lu, Huaji Shi, Suqin Ma","doi":"10.1109/IACSIT-SC.2009.38","DOIUrl":null,"url":null,"abstract":"For the incomplete data that usually exists in the process of pretreatment, this article presents an approach for treatment of the incomplete data based on WaveCluster and weighted 1-Nearest Neighbor (1-NN).The proposed method firstly carries out the WaveCluster in the complete record set of the whole set, which can reduce the volume of comparative data and rule out outliers, improve computational efficiency of the algorithm and the clustering accuracy. Then, the weighted 1-NN method is used, according to the contribution attributes made to the classification in the algorithm, the information gain of attribute is calculated and each attribute is endowed with certain weight using in the nearest neighbor measure, thus it can enhance the filling precision of the missing value. Experimental results show the proposed method is an appropriate and effective method in treatment of the incomplete data.","PeriodicalId":286158,"journal":{"name":"2009 International Association of Computer Science and Information Technology - Spring Conference","volume":"358 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-04-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 International Association of Computer Science and Information Technology - Spring Conference","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IACSIT-SC.2009.38","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

Abstract

For the incomplete data that usually exists in the process of pretreatment, this article presents an approach for treatment of the incomplete data based on WaveCluster and weighted 1-Nearest Neighbor (1-NN).The proposed method firstly carries out the WaveCluster in the complete record set of the whole set, which can reduce the volume of comparative data and rule out outliers, improve computational efficiency of the algorithm and the clustering accuracy. Then, the weighted 1-NN method is used, according to the contribution attributes made to the classification in the algorithm, the information gain of attribute is calculated and each attribute is endowed with certain weight using in the nearest neighbor measure, thus it can enhance the filling precision of the missing value. Experimental results show the proposed method is an appropriate and effective method in treatment of the incomplete data.
基于波聚类和加权1近邻的不完全数据处理方法
针对预处理过程中通常存在的不完整数据,本文提出了一种基于WaveCluster和加权1-近邻(weighted 1-Nearest Neighbor, 1-NN)的不完整数据处理方法。本文提出的方法首先在整个记录集的完整记录集上进行WaveCluster,可以减少比较数据量并排除异常值,提高算法的计算效率和聚类精度。然后,采用加权1-NN方法,根据算法中对分类做出贡献的属性,计算属性的信息增益,并利用最近邻度量赋予每个属性一定的权重,从而提高缺失值的填充精度。实验结果表明,该方法是一种处理不完全数据的有效方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信