Optimization of the C4.5 Algorithm Using Particle Swarm Optimization and Discretization in Predicting the Results of English Premier League Football Matches

Muhammad Bahyul Anwar Fuadi, A. Alamsyah
{"title":"Optimization of the C4.5 Algorithm Using Particle Swarm Optimization and Discretization in Predicting the Results of English Premier League Football Matches","authors":"Muhammad Bahyul Anwar Fuadi, A. Alamsyah","doi":"10.15294/jaist.v4i2.59531","DOIUrl":null,"url":null,"abstract":"Football is one of the most popular sports. One of the most competitive football competitions is the English Premier League. This study aims to determine the prediction of the results of the football match in English Premier League. The prediction results in the form of home win, away win, and draw. This prediction uses data mining techniques, namely using the C4.5 algorithm as a classification algorithm with Particle Swarm Optimization as a feature selection method and Discretization as a preprocessing method. The dataset used was obtained from the football-data.co.uk website for four league seasons from the 2017/2018 season to the 2020/2021 season with a total of 1,520 instances. In this study, a comparison was made to the methods used to determine the increase in accuracy obtained. Based on ten times the data mining process, the final result of the best accuracy from using the C4.5 algorithm is 57.24%, then the C4.5 algorithm with Discretization gets an accuracy of 65.13%, and the C4.5 algorithm with Discretization and Particle Swarm Optimization gets accuracy of 71.05%. The conclusion is that the use of Discretization and Particle Swarm Optimization can improve the performance of the C4.5 algorithm in predicting the results of English Premier League matches with an increase in accuracy of 13.81%.","PeriodicalId":418742,"journal":{"name":"Journal of Advances in Information Systems and Technology","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-03-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Advances in Information Systems and Technology","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15294/jaist.v4i2.59531","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Football is one of the most popular sports. One of the most competitive football competitions is the English Premier League. This study aims to determine the prediction of the results of the football match in English Premier League. The prediction results in the form of home win, away win, and draw. This prediction uses data mining techniques, namely using the C4.5 algorithm as a classification algorithm with Particle Swarm Optimization as a feature selection method and Discretization as a preprocessing method. The dataset used was obtained from the football-data.co.uk website for four league seasons from the 2017/2018 season to the 2020/2021 season with a total of 1,520 instances. In this study, a comparison was made to the methods used to determine the increase in accuracy obtained. Based on ten times the data mining process, the final result of the best accuracy from using the C4.5 algorithm is 57.24%, then the C4.5 algorithm with Discretization gets an accuracy of 65.13%, and the C4.5 algorithm with Discretization and Particle Swarm Optimization gets accuracy of 71.05%. The conclusion is that the use of Discretization and Particle Swarm Optimization can improve the performance of the C4.5 algorithm in predicting the results of English Premier League matches with an increase in accuracy of 13.81%.
基于粒子群优化和离散化的C4.5算法在英超比赛结果预测中的优化
足球是最受欢迎的运动之一。竞争最激烈的足球比赛之一是英超联赛。本研究旨在确定英超足球比赛结果的预测。预测结果以主场胜利、客场胜利和平局的形式出现。该预测使用了数据挖掘技术,即使用C4.5算法作为分类算法,使用粒子群优化作为特征选择方法,使用离散化作为预处理方法。所使用的数据集来自football-data.co。从2017/2018赛季到2020/2021赛季的四个联赛的英国网站,共有1520个实例。在这项研究中,进行了比较,以确定所获得的准确性增加的方法。基于10倍的数据挖掘过程,C4.5算法的最终精度为57.24%,其次是离散化的C4.5算法的精度为65.13%,离散化和粒子群优化的C4.5算法的精度为71.05%。结果表明,采用离散化和粒子群优化方法可以提高C4.5算法预测英超比赛结果的性能,准确率提高13.81%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信