{"title":"基于多变量聚类的高维数据粒度最优特征选择","authors":"SRINIVAS KOLLI, M. Sreedevi","doi":"10.17762/TURCOMAT.V12I3.2031","DOIUrl":null,"url":null,"abstract":"Clustering is the most complex in multi/high dimensional data because of sub feature selection fromoverall features present in categorical data sources. Sub set feature be the aggressive approach to decreasefeature dimensionality in mining of data, identification of patterns. Main aim behind selection of feature withrespect to selection of optimal feature and decrease the redundancy. In-order to compute withredundant/irrelevant features in high dimensional sample data exploration based on feature selection calculationwith data granular described in this document. Propose aNovel Granular Feature Multi-variant Clustering basedGenetic Algorithm (NGFMCGA) model to evaluate the performance results in this implementation. This modelmain consists two phases, in first phase, based on theoretic graph grouping procedure divide features intodifferent clusters, in second phase, select strongly representative related feature from each cluster with respectto matching of subset of features. Features present in this concept are independent because of features selectfrom different clusters, proposed approach clustering have high probability in processing and increasing thequality of independent and useful features.Optimal subset feature selection improves accuracy of clustering andfeature classification, performance of proposed approach describes better accuracy with respect to optimalsubset selection is applied on publicly related data sets and it is compared with traditional supervisedevolutionary approaches.","PeriodicalId":21779,"journal":{"name":"Solid State Technology","volume":"63 1","pages":"2881-2896"},"PeriodicalIF":0.0000,"publicationDate":"2020-04-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":"{\"title\":\"A Novel Granularity Optimal Feature Selection based on Multi-Variant Clustering for High Dimensional Data\",\"authors\":\"SRINIVAS KOLLI, M. Sreedevi\",\"doi\":\"10.17762/TURCOMAT.V12I3.2031\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Clustering is the most complex in multi/high dimensional data because of sub feature selection fromoverall features present in categorical data sources. Sub set feature be the aggressive approach to decreasefeature dimensionality in mining of data, identification of patterns. Main aim behind selection of feature withrespect to selection of optimal feature and decrease the redundancy. In-order to compute withredundant/irrelevant features in high dimensional sample data exploration based on feature selection calculationwith data granular described in this document. Propose aNovel Granular Feature Multi-variant Clustering basedGenetic Algorithm (NGFMCGA) model to evaluate the performance results in this implementation. This modelmain consists two phases, in first phase, based on theoretic graph grouping procedure divide features intodifferent clusters, in second phase, select strongly representative related feature from each cluster with respectto matching of subset of features. Features present in this concept are independent because of features selectfrom different clusters, proposed approach clustering have high probability in processing and increasing thequality of independent and useful features.Optimal subset feature selection improves accuracy of clustering andfeature classification, performance of proposed approach describes better accuracy with respect to optimalsubset selection is applied on publicly related data sets and it is compared with traditional supervisedevolutionary approaches.\",\"PeriodicalId\":21779,\"journal\":{\"name\":\"Solid State Technology\",\"volume\":\"63 1\",\"pages\":\"2881-2896\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-04-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"7\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Solid State Technology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.17762/TURCOMAT.V12I3.2031\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q4\",\"JCRName\":\"Materials Science\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Solid State Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.17762/TURCOMAT.V12I3.2031","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"Materials Science","Score":null,"Total":0}
A Novel Granularity Optimal Feature Selection based on Multi-Variant Clustering for High Dimensional Data
Clustering is the most complex in multi/high dimensional data because of sub feature selection fromoverall features present in categorical data sources. Sub set feature be the aggressive approach to decreasefeature dimensionality in mining of data, identification of patterns. Main aim behind selection of feature withrespect to selection of optimal feature and decrease the redundancy. In-order to compute withredundant/irrelevant features in high dimensional sample data exploration based on feature selection calculationwith data granular described in this document. Propose aNovel Granular Feature Multi-variant Clustering basedGenetic Algorithm (NGFMCGA) model to evaluate the performance results in this implementation. This modelmain consists two phases, in first phase, based on theoretic graph grouping procedure divide features intodifferent clusters, in second phase, select strongly representative related feature from each cluster with respectto matching of subset of features. Features present in this concept are independent because of features selectfrom different clusters, proposed approach clustering have high probability in processing and increasing thequality of independent and useful features.Optimal subset feature selection improves accuracy of clustering andfeature classification, performance of proposed approach describes better accuracy with respect to optimalsubset selection is applied on publicly related data sets and it is compared with traditional supervisedevolutionary approaches.