Sonali P. Kadam, Varsha S. Naikwadi, Kaveree S. Belamkar, Aruna S. Andhare, Mayuri M. Mohite
{"title":"基于聚类的高维数据快速算法","authors":"Sonali P. Kadam, Varsha S. Naikwadi, Kaveree S. Belamkar, Aruna S. Andhare, Mayuri M. Mohite","doi":"10.1109/IC3I.2014.7019751","DOIUrl":null,"url":null,"abstract":"The rapid advance of computer technologies in data processing, collection and storage has provided unparalleled opportunities to expand capabilities in production, services communication and research. However, a feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. It finds the subset of features. There are two steps of FAST algorithm. First, using graph theoretic method features are divided into clusters. Second the features which are highly related to target class are selected. We are comparing FAST algorithm with the some representative feature subset selection algorithm name as Fast correlation based filter, Relief-F, Correlation based feature selection, Consist and FOCUS-SF. The results are available on high-dimensional data, microarray, text data and image data. Experimental results show that our FAST algorithm implementation can run faster and obtain better-extracted features than other methods.","PeriodicalId":430848,"journal":{"name":"2014 International Conference on Contemporary Computing and Informatics (IC3I)","volume":"67 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Clustering based - A FAST algorithm on high dimensional data\",\"authors\":\"Sonali P. Kadam, Varsha S. Naikwadi, Kaveree S. Belamkar, Aruna S. Andhare, Mayuri M. Mohite\",\"doi\":\"10.1109/IC3I.2014.7019751\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rapid advance of computer technologies in data processing, collection and storage has provided unparalleled opportunities to expand capabilities in production, services communication and research. However, a feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. It finds the subset of features. There are two steps of FAST algorithm. First, using graph theoretic method features are divided into clusters. Second the features which are highly related to target class are selected. We are comparing FAST algorithm with the some representative feature subset selection algorithm name as Fast correlation based filter, Relief-F, Correlation based feature selection, Consist and FOCUS-SF. The results are available on high-dimensional data, microarray, text data and image data. Experimental results show that our FAST algorithm implementation can run faster and obtain better-extracted features than other methods.\",\"PeriodicalId\":430848,\"journal\":{\"name\":\"2014 International Conference on Contemporary Computing and Informatics (IC3I)\",\"volume\":\"67 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 International Conference on Contemporary Computing and Informatics (IC3I)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IC3I.2014.7019751\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 International Conference on Contemporary Computing and Informatics (IC3I)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3I.2014.7019751","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Clustering based - A FAST algorithm on high dimensional data
The rapid advance of computer technologies in data processing, collection and storage has provided unparalleled opportunities to expand capabilities in production, services communication and research. However, a feature selection algorithm may be evaluated from both the efficiency and effectiveness points of view. It finds the subset of features. There are two steps of FAST algorithm. First, using graph theoretic method features are divided into clusters. Second the features which are highly related to target class are selected. We are comparing FAST algorithm with the some representative feature subset selection algorithm name as Fast correlation based filter, Relief-F, Correlation based feature selection, Consist and FOCUS-SF. The results are available on high-dimensional data, microarray, text data and image data. Experimental results show that our FAST algorithm implementation can run faster and obtain better-extracted features than other methods.