{"title":"Support Vector Machines Optimization - An Income Prediction Study","authors":"A. Lazar, R. Zaremba","doi":"10.1109/ICCGI.2006.59","DOIUrl":null,"url":null,"abstract":"Relevant features selection through principal component analysis is employed to increase the efficiency of support vector machine (SVM) methods. In particular, a detailed study is presented on the effects of this statistical narrowing, when used to generate income prediction data based on the current population survey provided by the U.S. Census Bureau. A systematic analysis of the grid parameter search, training time, accuracy, and number of support vectors shows increases not only in the efficiency of the SVM methods, but also in the classification accuracy. Proper identification of the relevant features for specific problems allows accuracy values as high as 93% against a test population, to be obtained, while reducing the total computational. Tailoring computational methods around specific real data sets is critical in designing powerful algorithms","PeriodicalId":112974,"journal":{"name":"2006 International Multi-Conference on Computing in the Global Information Technology - (ICCGI'06)","volume":"41 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2006-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2006 International Multi-Conference on Computing in the Global Information Technology - (ICCGI'06)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCGI.2006.59","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5
Abstract
Relevant features selection through principal component analysis is employed to increase the efficiency of support vector machine (SVM) methods. In particular, a detailed study is presented on the effects of this statistical narrowing, when used to generate income prediction data based on the current population survey provided by the U.S. Census Bureau. A systematic analysis of the grid parameter search, training time, accuracy, and number of support vectors shows increases not only in the efficiency of the SVM methods, but also in the classification accuracy. Proper identification of the relevant features for specific problems allows accuracy values as high as 93% against a test population, to be obtained, while reducing the total computational. Tailoring computational methods around specific real data sets is critical in designing powerful algorithms