Machine learning-driven market value prediction for European football players

Abdullah Tamim , Md. Wadud Jahan , Md. Rashid Shahriar Chowdhury , Ahammad Hossain , Md. Mizanur Rahman , A.H.M. Rahmatullah Imon
{"title":"Machine learning-driven market value prediction for European football players","authors":"Abdullah Tamim ,&nbsp;Md. Wadud Jahan ,&nbsp;Md. Rashid Shahriar Chowdhury ,&nbsp;Ahammad Hossain ,&nbsp;Md. Mizanur Rahman ,&nbsp;A.H.M. Rahmatullah Imon","doi":"10.1016/j.jcmds.2025.100118","DOIUrl":null,"url":null,"abstract":"<div><div>Football is globally recognized as the most widely practiced and watched sport. Precise player value is crucial for clubs seeking to maximize their player acquisition strategy and overall success in football. Conventional player valuation methodologies are mainly dependent on expert judgments and subjective assessments, missing the objectivity and precision provided by data-driven approaches. This study seeks to close this disparity by utilizing machine learning techniques to predict the market valuations of football players. The analysis is conducted using an extensive dataset sourced from the FIFA 22 video game, which was obtained via sofifa.com. The collection includes more than 16,000 players. The Machine Learning (ML) techniques used in this study are Multiple Linear Regression (MLR), Ridge Regression (RR), Support Vector Regression (SVR), and Random Forest Regression (RFR). The machine learning algorithms undergo training using 80% of the samples and are subsequently tested using the remaining 20% of the samples. We evaluate each algorithm’s performance using metrics like Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R-squared (R<sup>2</sup>) value. Numerical results show that the RFR model demonstrates superior performance by achieving the lowest MAE, MSE, RMSE, and the highest R<sup>2</sup> value across all samples. The RFR effectively captures non-linear interactions and reliably prevents overfitting. This research finding will enhance the existing knowledge in sports economics by demonstrating how ML can be used to anticipate the market prices of football players with better accuracy. This will provide football teams with valuable insights to make more strategic decisions.</div></div>","PeriodicalId":100768,"journal":{"name":"Journal of Computational Mathematics and Data Science","volume":"15 ","pages":"Article 100118"},"PeriodicalIF":0.0000,"publicationDate":"2025-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Computational Mathematics and Data Science","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772415825000100","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Football is globally recognized as the most widely practiced and watched sport. Precise player value is crucial for clubs seeking to maximize their player acquisition strategy and overall success in football. Conventional player valuation methodologies are mainly dependent on expert judgments and subjective assessments, missing the objectivity and precision provided by data-driven approaches. This study seeks to close this disparity by utilizing machine learning techniques to predict the market valuations of football players. The analysis is conducted using an extensive dataset sourced from the FIFA 22 video game, which was obtained via sofifa.com. The collection includes more than 16,000 players. The Machine Learning (ML) techniques used in this study are Multiple Linear Regression (MLR), Ridge Regression (RR), Support Vector Regression (SVR), and Random Forest Regression (RFR). The machine learning algorithms undergo training using 80% of the samples and are subsequently tested using the remaining 20% of the samples. We evaluate each algorithm’s performance using metrics like Mean Absolute Error (MAE), Mean Squared Error (MSE), Root Mean Squared Error (RMSE), and R-squared (R2) value. Numerical results show that the RFR model demonstrates superior performance by achieving the lowest MAE, MSE, RMSE, and the highest R2 value across all samples. The RFR effectively captures non-linear interactions and reliably prevents overfitting. This research finding will enhance the existing knowledge in sports economics by demonstrating how ML can be used to anticipate the market prices of football players with better accuracy. This will provide football teams with valuable insights to make more strategic decisions.
机器学习驱动的欧洲足球运动员市场价值预测
足球是全球公认的最广泛练习和观看的运动。精确的球员价值对于寻求最大化球员获取策略和足球整体成功的俱乐部至关重要。传统的玩家评估方法主要依赖于专家判断和主观评估,缺少数据驱动方法所提供的客观性和精确性。这项研究试图通过利用机器学习技术来预测足球运动员的市场估值来缩小这种差距。该分析使用了来自FIFA 22视频游戏的广泛数据集,该数据集是通过soffifa.com获得的。收藏了超过16000名球员。本研究中使用的机器学习(ML)技术是多元线性回归(MLR)、岭回归(RR)、支持向量回归(SVR)和随机森林回归(RFR)。机器学习算法使用80%的样本进行训练,随后使用剩余20%的样本进行测试。我们使用平均绝对误差(MAE)、均方误差(MSE)、均方根误差(RMSE)和r平方(R2)值等指标来评估每种算法的性能。数值结果表明,RFR模型通过在所有样本中实现最低的MAE、MSE、RMSE和最高的R2值,显示出优越的性能。RFR有效捕获非线性相互作用,可靠地防止过拟合。这一研究发现将通过展示如何使用ML来更好地预测足球运动员的市场价格,从而增强体育经济学的现有知识。这将为足球队提供有价值的见解,以做出更多的战略决策。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.00
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信