G. Georgiev, N. Gueorguieva, Matthew Chiappa, Austin Krauza
{"title":"基于Gustafson-Kessel模糊算法的高维数据聚类特征选择","authors":"G. Georgiev, N. Gueorguieva, Matthew Chiappa, Austin Krauza","doi":"10.1109/ICMLA.2015.57","DOIUrl":null,"url":null,"abstract":"The performance of objective function-based fuzzy clustering algorithms depends on the shape and the volume of clusters, the initialization of clustering algorithm, the distribution of the data objects, and the number of clusters in the data. Feature selection is also one of the most important issues in high dimension data clustering specifically in bioinformatics, data mining, signal processing etc., where the feature space dimension tends to be very large, making both clustering and classification tasks very difficult. It is evident that the feature subset needed to successfully perform a given clustering and recognition task depends on the discriminatory qualities of the chosen features. We propose a new hybrid approach addressing feature selection, based on informative weights, which takes into account the membership degrees of the features performed by Gustafson-Kessel fuzzy algorithm. The purpose is to efficiently achieve high degree of dimensionality reduction and enhance or maintain predictive accuracy with selected features. The candidate feature subsets are generated by using iterative feature elimination procedure which results in estimation of feature informative weights. We use both supervised and unsupervised methods in order to evaluate the clustering abilities of feature subsets.","PeriodicalId":288427,"journal":{"name":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2015-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Feature Selection Using Gustafson-Kessel Fuzzy Algorithm in High Dimension Data Clustering\",\"authors\":\"G. Georgiev, N. Gueorguieva, Matthew Chiappa, Austin Krauza\",\"doi\":\"10.1109/ICMLA.2015.57\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The performance of objective function-based fuzzy clustering algorithms depends on the shape and the volume of clusters, the initialization of clustering algorithm, the distribution of the data objects, and the number of clusters in the data. Feature selection is also one of the most important issues in high dimension data clustering specifically in bioinformatics, data mining, signal processing etc., where the feature space dimension tends to be very large, making both clustering and classification tasks very difficult. It is evident that the feature subset needed to successfully perform a given clustering and recognition task depends on the discriminatory qualities of the chosen features. We propose a new hybrid approach addressing feature selection, based on informative weights, which takes into account the membership degrees of the features performed by Gustafson-Kessel fuzzy algorithm. The purpose is to efficiently achieve high degree of dimensionality reduction and enhance or maintain predictive accuracy with selected features. The candidate feature subsets are generated by using iterative feature elimination procedure which results in estimation of feature informative weights. We use both supervised and unsupervised methods in order to evaluate the clustering abilities of feature subsets.\",\"PeriodicalId\":288427,\"journal\":{\"name\":\"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)\",\"volume\":\"40 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2015-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICMLA.2015.57\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2015 IEEE 14th International Conference on Machine Learning and Applications (ICMLA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICMLA.2015.57","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Feature Selection Using Gustafson-Kessel Fuzzy Algorithm in High Dimension Data Clustering
The performance of objective function-based fuzzy clustering algorithms depends on the shape and the volume of clusters, the initialization of clustering algorithm, the distribution of the data objects, and the number of clusters in the data. Feature selection is also one of the most important issues in high dimension data clustering specifically in bioinformatics, data mining, signal processing etc., where the feature space dimension tends to be very large, making both clustering and classification tasks very difficult. It is evident that the feature subset needed to successfully perform a given clustering and recognition task depends on the discriminatory qualities of the chosen features. We propose a new hybrid approach addressing feature selection, based on informative weights, which takes into account the membership degrees of the features performed by Gustafson-Kessel fuzzy algorithm. The purpose is to efficiently achieve high degree of dimensionality reduction and enhance or maintain predictive accuracy with selected features. The candidate feature subsets are generated by using iterative feature elimination procedure which results in estimation of feature informative weights. We use both supervised and unsupervised methods in order to evaluate the clustering abilities of feature subsets.