高维数据集支持向量支持机分类的蚁群与疯狂粒子群算法

N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto
{"title":"高维数据集支持向量支持机分类的蚁群与疯狂粒子群算法","authors":"N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto","doi":"10.1063/1.5139759","DOIUrl":null,"url":null,"abstract":"The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.","PeriodicalId":246056,"journal":{"name":"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-12-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Ant colony optimization and crazy particle swarm optimization for support vector support machine classification on high-dimensional dataset\",\"authors\":\"N. A. Firdausanti, Irhamah, M. Aritsugi, H. Kuswanto\",\"doi\":\"10.1063/1.5139759\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.\",\"PeriodicalId\":246056,\"journal\":{\"name\":\"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-12-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1063/1.5139759\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"THE 2ND INTERNATIONAL CONFERENCE ON SCIENCE, MATHEMATICS, ENVIRONMENT, AND EDUCATION","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1063/1.5139759","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

DNA微阵列技术产生的数据可用于预测和分类从人体某些组织中提取的基因,以归类为癌症或非癌症。微阵列数据包含数千个变量,但可用的数据有限。支持向量机(SVM)是一种用于高维数据集分类的监督学习方法。支持向量机分类器中存在两个影响分类精度的问题,即支持向量机参数的调整和SVM分类器的最佳特征子集的选取。在特征选择过程和优化支持向量机参数方面,已经提出了几种方法,其中包括基于包装器的方法。本研究使用的基于包装的算法是疯狂粒子群优化(CRAZYPSO)和蚁群优化(ACO)。这两种算法都是基于计算智能的算法,可用于解决特征选择和参数优化等优化问题。这些算法的灵感来自于现实世界中的动物行为。与其他优化算法相比,CRAZYPSO的计算非常简单。而蚁群算法具有鲁棒性强、分布式计算机制好、易于与其他方法结合等优点。本研究想比较在微阵列数据分类的情况下,CRAZYPSO和ACO算法。本研究中使用的微阵列数据集是前列腺数据集和结肠数据集。本研究使用k倍交叉验证精度来比较使用支持向量机的微阵列数据分类情况下的CRAZYPSO和ACO算法。结果表明,蚁群算法在特征选择上优于CRAZYPSO算法,准确率更高,选择的特征更少。本研究还表明,使用蚁群算法优化的SVM参数比使用CRAZYPSO算法优化的参数具有更高的分类准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Ant colony optimization and crazy particle swarm optimization for support vector support machine classification on high-dimensional dataset
The data generated by DNA microarray technology can be used to predict and classify genes taken from certain tissues in humans to be classified as cancer or not. Microarray data consists of thousands of variables, but limited data is available. Support Vector Machine (SVM) is a supervised learning method that can be used for classification on the high-dimensional dataset. There are two problems in SVM classifier that influence the classification accuracy, which are tuning SVM parameters and selecting the best features subset to the SVM classifier. Several approaches have been carried out for the feature selection process and tuning SVM parameter, including a wrapper-based approach. The wrapper-based algorithm used in this research is Crazy Particle Swarm Optimization (CRAZYPSO) and Ant Colony Optimization (ACO). Both algorithms are the computational intelligence-based algorithm that can be used to solve the optimization problems, such as feature selection and parameter optimization. These algorithms are inspired by animal behavior in the real world. CRAZYPSO calculations are very simple compared to other optimization algorithms. While ACO has several advantages, such as strong robustness, well-distributed computing mechanism and easily combined with other methods. This study wants to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification. The microarray datasets used in this study are the prostate dataset and colon dataset. This study uses k-fold cross-validation accuracy to compare the CRAZYPSO and ACO algorithm in the case of microarray data classification using Support Vector Machine. The result shows that the ACO algorithm gives a better result in feature selection than the CRAZYPSO algorithm with higher accuracy rate and less selected features. This study also shows that the SVM parameter optimized using ACO algorithm gives higher classification accuracy rate than parameter optimized using CRAZYPSO algorithm.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信