Prediction of defensive success in elite soccer using machine learning - Tactical analysis of defensive play using tracking data and explainable AI.

Science & medicine in football Pub Date : 2024-11-01 Epub Date: 2023-08-04 DOI:10.1080/24733938.2023.2239766
Leander Forcher, Tobias Beckmann, Oliver Wohak, Christian Romeike, Ferdinand Graf, Stefan Altmann
{"title":"Prediction of defensive success in elite soccer using machine learning - Tactical analysis of defensive play using tracking data and explainable AI.","authors":"Leander Forcher, Tobias Beckmann, Oliver Wohak, Christian Romeike, Ferdinand Graf, Stefan Altmann","doi":"10.1080/24733938.2023.2239766","DOIUrl":null,"url":null,"abstract":"<p><p>The interest in sports performance analysis is rising and tracking data holds high potential for game analysis in team sports due to its accuracy and informative content. Together with machine learning approaches one can obtain deeper and more objective insights into the performance structure. In soccer, the analysis of the defense was neglected in comparison to the offense. Therefore, the aim of this study is to predict ball gains in defense using tracking data to identify tactical variables that drive defensive success. We evaluated tracking data of 153 games of German Bundesliga season 2020/21. With it, we derived player (defensive pressure, distance to the ball, & velocity) and team metrics (inter-line distances, numerical superiority, surface area, & spread) each containing a tactical idea. Afterwards, we trained supervised machine learning classifiers (logistic regression, XGBoost, & Random Forest Classifier) to predict successful (ball gain) vs. unsuccessful defensive plays (no ball gain). The expert-reduction-model (Random Forest Classifier with 16 features) showed the best and satisfying prediction performance (F1-Score (test) = 0.57). Analyzing the most important input features of this model, we are able to identify tactical principles of defensive play that appear to be related to gaining the ball: press the ball leading player, create numerical superiority in areas close to the ball (press short pass options), compact organization of defending team. Those principles are highly interesting for practitioners to gain valuable insights in the tactical behavior of soccer players that may be related to the success of defensive play.</p>","PeriodicalId":74767,"journal":{"name":"Science & medicine in football","volume":" ","pages":"317-332"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Science & medicine in football","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1080/24733938.2023.2239766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2023/8/4 0:00:00","PubModel":"Epub","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The interest in sports performance analysis is rising and tracking data holds high potential for game analysis in team sports due to its accuracy and informative content. Together with machine learning approaches one can obtain deeper and more objective insights into the performance structure. In soccer, the analysis of the defense was neglected in comparison to the offense. Therefore, the aim of this study is to predict ball gains in defense using tracking data to identify tactical variables that drive defensive success. We evaluated tracking data of 153 games of German Bundesliga season 2020/21. With it, we derived player (defensive pressure, distance to the ball, & velocity) and team metrics (inter-line distances, numerical superiority, surface area, & spread) each containing a tactical idea. Afterwards, we trained supervised machine learning classifiers (logistic regression, XGBoost, & Random Forest Classifier) to predict successful (ball gain) vs. unsuccessful defensive plays (no ball gain). The expert-reduction-model (Random Forest Classifier with 16 features) showed the best and satisfying prediction performance (F1-Score (test) = 0.57). Analyzing the most important input features of this model, we are able to identify tactical principles of defensive play that appear to be related to gaining the ball: press the ball leading player, create numerical superiority in areas close to the ball (press short pass options), compact organization of defending team. Those principles are highly interesting for practitioners to gain valuable insights in the tactical behavior of soccer players that may be related to the success of defensive play.

利用机器学习预测精英足球的防守成功率 - 利用跟踪数据和可解释的人工智能对防守战术进行分析。
人们对运动成绩分析的兴趣日益高涨,而跟踪数据因其准确性和信息量大,在团队运动的比赛分析中具有很大的潜力。结合机器学习方法,我们可以更深入、更客观地了解成绩结构。在足球比赛中,与进攻相比,对防守的分析被忽视了。因此,本研究的目的是利用跟踪数据预测防守中的得球率,以确定推动防守成功的战术变量。我们评估了 2020/21 赛季德国足球甲级联赛 153 场比赛的跟踪数据。通过这些数据,我们得出了球员(防守压力、到球距离和速度)和球队指标(线间距离、人数优势、表面积和散布),每个指标都包含一个战术思想。之后,我们训练了有监督的机器学习分类器(逻辑回归、XGBoost 和随机森林分类器),以预测成功(得球)与不成功(无球)的防守战术。专家还原模型(具有 16 个特征的随机森林分类器)显示出最佳和令人满意的预测性能(F1-分数(测试)= 0.57)。通过分析该模型最重要的输入特征,我们可以确定与得球有关的防守战术原则:压迫带球球员、在靠近球的区域创造人数优势(压迫短传选择)、防守队伍的紧凑组织。这些原则对实践者来说非常有趣,他们可以从中获得足球运动员战术行为的宝贵见解,而这些见解可能与防守战术的成功有关。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信