José M Oliva-Lozano, Miguel Vidal, Farzad Yousefian, Rick Cost, Tim J Gabbett
{"title":"Predicting the Match Outcome in the 2023 FIFA Women's World Cup and Analysis of Influential Features.","authors":"José M Oliva-Lozano, Miguel Vidal, Farzad Yousefian, Rick Cost, Tim J Gabbett","doi":"10.5114/jhk/195563","DOIUrl":null,"url":null,"abstract":"<p><p>The aim of this study was to build an XGBoost model to predict the match outcome and analyze match-related technical, tactical and physical performance features that may influence the predicted outcome of the match. This is an observational study which follows a retrospective design. The FIFA post-match summary reports were downloaded at the end of the 2023 Women's World Cup and used to create a dataset which consisted of match-related technical, tactical and physical performance variables. Then, an XGBoost model was built to predict the match outcome and investigate which performance features might influence the predicted outcome of the match. The overall model achieved accuracy of 0.58 ± 0.05. Losses and wins had similar predictive accuracy (0.67 ± 0.06 and 0.67 ± 0.08, respectively), but the prediction of draws performed was significantly worse with accuracy of 0.32 ± 0.16. The top ten features for predicting wins were: (1) out to in actions by the opponent, (2) attempts at the goal, (3) in-behind actions, (4) interceptions by the opponent, (5) loose ball receptions, (6) sprinting per minute by the opponent, (7) offers received by the opponent, (8) in-front opponent, (9) interceptions, and (10) total distance per minute. The top ten features for predicting losses were: (1) attempts at the goal by the opponent, (2) interceptions, (3) out to in actions, (4) possessions interrupted, (5) loose ball receptions by the opponent, (6) in front movements, (7) distance covered by the opponent, (8) in-behind actions by the opponent, (9) total distance, and (10) sprinting per minute. In conclusion, using an XGBoost model, this is the first study to successfully predict the match outcome for wins and losses from the FIFA Women's World Cup, but also explain which features significantly influence the prediction. This study may serve as a guide for practitioners regarding the use and application of XGBoost models in high performance.</p>","PeriodicalId":16055,"journal":{"name":"Journal of Human Kinetics","volume":"98 ","pages":"169-182"},"PeriodicalIF":2.8000,"publicationDate":"2025-05-29","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12360935/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Human Kinetics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.5114/jhk/195563","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/7/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"SPORT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
The aim of this study was to build an XGBoost model to predict the match outcome and analyze match-related technical, tactical and physical performance features that may influence the predicted outcome of the match. This is an observational study which follows a retrospective design. The FIFA post-match summary reports were downloaded at the end of the 2023 Women's World Cup and used to create a dataset which consisted of match-related technical, tactical and physical performance variables. Then, an XGBoost model was built to predict the match outcome and investigate which performance features might influence the predicted outcome of the match. The overall model achieved accuracy of 0.58 ± 0.05. Losses and wins had similar predictive accuracy (0.67 ± 0.06 and 0.67 ± 0.08, respectively), but the prediction of draws performed was significantly worse with accuracy of 0.32 ± 0.16. The top ten features for predicting wins were: (1) out to in actions by the opponent, (2) attempts at the goal, (3) in-behind actions, (4) interceptions by the opponent, (5) loose ball receptions, (6) sprinting per minute by the opponent, (7) offers received by the opponent, (8) in-front opponent, (9) interceptions, and (10) total distance per minute. The top ten features for predicting losses were: (1) attempts at the goal by the opponent, (2) interceptions, (3) out to in actions, (4) possessions interrupted, (5) loose ball receptions by the opponent, (6) in front movements, (7) distance covered by the opponent, (8) in-behind actions by the opponent, (9) total distance, and (10) sprinting per minute. In conclusion, using an XGBoost model, this is the first study to successfully predict the match outcome for wins and losses from the FIFA Women's World Cup, but also explain which features significantly influence the prediction. This study may serve as a guide for practitioners regarding the use and application of XGBoost models in high performance.
期刊介绍:
The Journal of Human Kinetics is an open access interdisciplinary periodical offering the latest research in the science of human movement studies. This comprehensive professional journal features articles and research notes encompassing such topic areas as: Kinesiology, Exercise Physiology and Nutrition, Sports Training and Behavioural Sciences in Sport, but especially considering elite and competitive aspects of sport.
The journal publishes original papers, invited reviews, short communications and letters to the Editors. Manuscripts submitted to the journal must contain novel data on theoretical or experimental research or on practical applications in the field of sport sciences.
The Journal of Human Kinetics is published in March, June, September and December.
We encourage scientists from around the world to submit their papers to our periodical.