{"title":"非确定性局部搜索特征选择方法的实验研究","authors":"Marina P. Fernandez-Perez, F. F. González-Navarro","doi":"10.1109/MICAI.2014.16","DOIUrl":null,"url":null,"abstract":"The dimensionality reduction by feature selection is one of the fundamental steps in the pre-processing data stage in the intelligent data analysis. Feature selection (FS) literature embodies a wide spectrum of algorithms, methods and strategies, but mostly all fall into two classes, the well known wrappers and filters. The decision of which feature or variable is selected or discarded from the best current subset is still subject of research nowadays. In this paper, an experimental study about non-deterministic local search methods as main engine to this decision making is presented. The Simulated Annealing Algorithm, the Genetic Algorithm, the Tabu Search and the Threshold Accepting Algorithm are analyzed. They are used to select subset of features on several real and artificial data sets with different configurations -- i.e. Continuous and discrete data, high-low number of cases/features -- in a wrapper fashion. The Nearest Neighbor Classifier, the Linear and Quadratic Discriminant Classifier, the Naive Bayes classifier and the Support Vector Machine are evaluated as the performance function in the wrapper scheme.","PeriodicalId":189896,"journal":{"name":"2014 13th Mexican International Conference on Artificial Intelligence","volume":"266 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Non-deterministic Local Search Methods for Feature Selection: An Experimental Study\",\"authors\":\"Marina P. Fernandez-Perez, F. F. González-Navarro\",\"doi\":\"10.1109/MICAI.2014.16\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The dimensionality reduction by feature selection is one of the fundamental steps in the pre-processing data stage in the intelligent data analysis. Feature selection (FS) literature embodies a wide spectrum of algorithms, methods and strategies, but mostly all fall into two classes, the well known wrappers and filters. The decision of which feature or variable is selected or discarded from the best current subset is still subject of research nowadays. In this paper, an experimental study about non-deterministic local search methods as main engine to this decision making is presented. The Simulated Annealing Algorithm, the Genetic Algorithm, the Tabu Search and the Threshold Accepting Algorithm are analyzed. They are used to select subset of features on several real and artificial data sets with different configurations -- i.e. Continuous and discrete data, high-low number of cases/features -- in a wrapper fashion. The Nearest Neighbor Classifier, the Linear and Quadratic Discriminant Classifier, the Naive Bayes classifier and the Support Vector Machine are evaluated as the performance function in the wrapper scheme.\",\"PeriodicalId\":189896,\"journal\":{\"name\":\"2014 13th Mexican International Conference on Artificial Intelligence\",\"volume\":\"266 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-11-16\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2014 13th Mexican International Conference on Artificial Intelligence\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/MICAI.2014.16\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2014 13th Mexican International Conference on Artificial Intelligence","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/MICAI.2014.16","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Non-deterministic Local Search Methods for Feature Selection: An Experimental Study
The dimensionality reduction by feature selection is one of the fundamental steps in the pre-processing data stage in the intelligent data analysis. Feature selection (FS) literature embodies a wide spectrum of algorithms, methods and strategies, but mostly all fall into two classes, the well known wrappers and filters. The decision of which feature or variable is selected or discarded from the best current subset is still subject of research nowadays. In this paper, an experimental study about non-deterministic local search methods as main engine to this decision making is presented. The Simulated Annealing Algorithm, the Genetic Algorithm, the Tabu Search and the Threshold Accepting Algorithm are analyzed. They are used to select subset of features on several real and artificial data sets with different configurations -- i.e. Continuous and discrete data, high-low number of cases/features -- in a wrapper fashion. The Nearest Neighbor Classifier, the Linear and Quadratic Discriminant Classifier, the Naive Bayes classifier and the Support Vector Machine are evaluated as the performance function in the wrapper scheme.