{"title":"Feature selection with a supervised similarity-based k-medoids clustering","authors":"Chen-Sen Ouyang","doi":"10.1109/ICMLC.2014.7009669","DOIUrl":null,"url":null,"abstract":"A supervised similarity-based k-medoids (SSKM) clustering algorithm is proposed for feature selection in classification problems. The set of original features is iteratively partitioned into k clusters, each of which is composed of similar features and represented by a feature yielding the maximum total of similarities with the other features in the duster. A supervised similarity measure is introduced to evaluate the similarity between two features for incorporating information of class labels of training patterns during clustering and representative selection. Experimental results show that our proposed method can select a more effective set of features for classification problems.","PeriodicalId":335296,"journal":{"name":"2014 International Conference on Machine Learning and Cybernetics","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Machine Learning and Cybernetics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLC.2014.7009669","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
A supervised similarity-based k-medoids (SSKM) clustering algorithm is proposed for feature selection in classification problems. The set of original features is iteratively partitioned into k clusters, each of which is composed of similar features and represented by a feature yielding the maximum total of similarities with the other features in the duster. A supervised similarity measure is introduced to evaluate the similarity between two features for incorporating information of class labels of training patterns during clustering and representative selection. Experimental results show that our proposed method can select a more effective set of features for classification problems.