基于优化选择(进化)的数据挖掘

Jurnal Informatika Universitas Pamulang Pub Date : 2021-03-31 DOI:10.32493/informatika.v6i1.9291

Ahmad Fauzi

{"title":"基于优化选择(进化)的数据挖掘","authors":"Ahmad Fauzi","doi":"10.32493/informatika.v6i1.9291","DOIUrl":null,"url":null,"abstract":"In determining marketing strategies, the bank performs a classification from a customer database, the database will be analyzed by a decision maker and this is not easy for a decision maker, because of the complexity of the vast data and the many attributes of the data owned, so that it becomes an obstacle and obstacle. in decision making. This of course can have a negative effect on the company's business processes because there will be delays in determining marketing strategies. Data mining method is a method that can classify large data to determine the level of accuracy of a database. In overcoming these problems, it is necessary to do a database analysis to determine the accuracy level of the database classification owned by the company. For this reason, in this study a classification process will be carried out with the Bank Direct Marketing dataset taken from the UCI Machine Learning Repository web, using the Naïve Bayes algorithm, K-Nearest Neighbor, Support Vector Machine with Optimize Selection (Evolutionary) optimization, the calculation process using a data mining application. namely Rapidminer 5.3, to find the highest accuracy value from the calculation algorithm. Test method with 10-fold cross validation. In this study, the classification results with the highest level of accuracy were obtained using Optimize Selection (Evolutionary) optimization, namely the Naïve Bayes algorithm 90.18%, then K-Nearest Neighbor 86.66%, and Support Vector Machine 89.40%. ","PeriodicalId":251854,"journal":{"name":"Jurnal Informatika Universitas Pamulang","volume":"19 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Analisis Data Bank Direct Marketing dengan Perbandingan Klasifikasi Data Mining Berbasis Optimize Selection (Evolutionary)\",\"authors\":\"Ahmad Fauzi\",\"doi\":\"10.32493/informatika.v6i1.9291\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In determining marketing strategies, the bank performs a classification from a customer database, the database will be analyzed by a decision maker and this is not easy for a decision maker, because of the complexity of the vast data and the many attributes of the data owned, so that it becomes an obstacle and obstacle. in decision making. This of course can have a negative effect on the company's business processes because there will be delays in determining marketing strategies. Data mining method is a method that can classify large data to determine the level of accuracy of a database. In overcoming these problems, it is necessary to do a database analysis to determine the accuracy level of the database classification owned by the company. For this reason, in this study a classification process will be carried out with the Bank Direct Marketing dataset taken from the UCI Machine Learning Repository web, using the Naïve Bayes algorithm, K-Nearest Neighbor, Support Vector Machine with Optimize Selection (Evolutionary) optimization, the calculation process using a data mining application. namely Rapidminer 5.3, to find the highest accuracy value from the calculation algorithm. Test method with 10-fold cross validation. In this study, the classification results with the highest level of accuracy were obtained using Optimize Selection (Evolutionary) optimization, namely the Naïve Bayes algorithm 90.18%, then K-Nearest Neighbor 86.66%, and Support Vector Machine 89.40%. \",\"PeriodicalId\":251854,\"journal\":{\"name\":\"Jurnal Informatika Universitas Pamulang\",\"volume\":\"19 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Jurnal Informatika Universitas Pamulang\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.32493/informatika.v6i1.9291\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Jurnal Informatika Universitas Pamulang","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.32493/informatika.v6i1.9291","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

在确定营销策略时，银行从客户数据库中进行分类，数据库将由决策者进行分析，这对于决策者来说并不容易，因为庞大的数据的复杂性以及所拥有的数据的许多属性，使其成为障碍和障碍。在决策方面。这当然会对公司的业务流程产生负面影响，因为在确定营销策略方面会有延迟。数据挖掘方法是一种对大数据进行分类以确定数据库准确性水平的方法。在克服这些问题时，有必要进行数据库分析，以确定公司拥有的数据库分类的准确性水平。因此，在本研究中，将使用来自UCI机器学习存储库web的银行直销数据集进行分类过程，使用Naïve贝叶斯算法，k -最近邻，支持向量机优化选择(进化)优化，计算过程使用数据挖掘应用程序。即Rapidminer 5.3，从计算算法中找到最高精度值。具有10倍交叉验证的测试方法。在本研究中，使用优化选择(进化)优化获得的分类结果准确率最高，即Naïve贝叶斯算法90.18%，其次是k -最近邻算法86.66%，支持向量机算法89.40%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Analisis Data Bank Direct Marketing dengan Perbandingan Klasifikasi Data Mining Berbasis Optimize Selection (Evolutionary)

In determining marketing strategies, the bank performs a classification from a customer database, the database will be analyzed by a decision maker and this is not easy for a decision maker, because of the complexity of the vast data and the many attributes of the data owned, so that it becomes an obstacle and obstacle. in decision making. This of course can have a negative effect on the company's business processes because there will be delays in determining marketing strategies. Data mining method is a method that can classify large data to determine the level of accuracy of a database. In overcoming these problems, it is necessary to do a database analysis to determine the accuracy level of the database classification owned by the company. For this reason, in this study a classification process will be carried out with the Bank Direct Marketing dataset taken from the UCI Machine Learning Repository web, using the Naïve Bayes algorithm, K-Nearest Neighbor, Support Vector Machine with Optimize Selection (Evolutionary) optimization, the calculation process using a data mining application. namely Rapidminer 5.3, to find the highest accuracy value from the calculation algorithm. Test method with 10-fold cross validation. In this study, the classification results with the highest level of accuracy were obtained using Optimize Selection (Evolutionary) optimization, namely the Naïve Bayes algorithm 90.18%, then K-Nearest Neighbor 86.66%, and Support Vector Machine 89.40%.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Jurnal Informatika Universitas Pamulang

自引率

0.00%

发文量