{"title":"Improve regression accuracy by using an attribute weighted KNN approach","authors":"Ziqi Chen, Bing Li, Bo Han","doi":"10.1109/FSKD.2017.8393046","DOIUrl":null,"url":null,"abstract":"KNN (K nearest neighbor) algorithm is a widely used regression method, with a very simple principle about neighborhood. Though it achieves success in many application areas, the method has a shortcoming of weighting equal contributions to each attribute when computing distance between instances. In this paper, we applied a weighted KNN approach by using weights obtained from optimization and feature selection methods and compared the performance and efficiency of these two types of algorithms in regression problems. Experiments on two UCI datasets show that optimization algorithms like particle swarm optimization can obtain more valuable weights than feature selection algorithms, such as information gain and RelefF, with the tradeoff of running time cost. Both of them canimprove the performance of traditional KNN with equal feature weights.","PeriodicalId":236093,"journal":{"name":"2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 13th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery (ICNC-FSKD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/FSKD.2017.8393046","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
KNN (K nearest neighbor) algorithm is a widely used regression method, with a very simple principle about neighborhood. Though it achieves success in many application areas, the method has a shortcoming of weighting equal contributions to each attribute when computing distance between instances. In this paper, we applied a weighted KNN approach by using weights obtained from optimization and feature selection methods and compared the performance and efficiency of these two types of algorithms in regression problems. Experiments on two UCI datasets show that optimization algorithms like particle swarm optimization can obtain more valuable weights than feature selection algorithms, such as information gain and RelefF, with the tradeoff of running time cost. Both of them canimprove the performance of traditional KNN with equal feature weights.