N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto
{"title":"高维数据集支持向量支持机分类的蚁群与疯狂粒子群算法","authors":"N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto","doi":"10.1063/1.5139759","DOIUrl":null,"url":null,"abstract":"The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.","PeriodicalId":246056,"journal":{"name":"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ant colony optimization and crazy particle swarm optimization for support vector support machine classification on high-dimensional dataset\",\"authors\":\"N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto\",\"doi\":\"10.1063/1.5139759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.\",\"PeriodicalId\":246056,\"journal\":{\"name\":\"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1063/1.5139759\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/1.5139759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Ant colony optimization and crazy particle swarm optimization for support vector support machine classification on high-dimensional dataset
The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.