Xiangyu Ren, Simon Boisbluche, Kilian Philippe, Mathieu Demy, Sami Äyrämö, Ilkka Rautiainen, Shuzhe Ding, Jacques Prioux
{"title":"全球定位系统衍生的指标和机器学习模型用于职业橄榄球联盟球员的损伤预测。","authors":"Xiangyu Ren, Simon Boisbluche, Kilian Philippe, Mathieu Demy, Sami Äyrämö, Ilkka Rautiainen, Shuzhe Ding, Jacques Prioux","doi":"10.1002/ejsc.70057","DOIUrl":null,"url":null,"abstract":"<p>In sports, injury prevention is a key factor for success. Although injuries are challenging to predict, new technologies and the application of data science can provide valuable insights. This study aimed to predict injury risk among professional rugby union players using machine learning (ML) models. We analyzed data from 63 professional rugby union players during three seasons, categorized them into forwards and backs, and further classified them into five specific positions (tight five, back row, scrum-half, inside backs, outside backs). The dataset included GPS data and derived metrics such as total workload in the 1, 2, and 3 weeks prior to injury, acute-to-chronic workload ratio over different time windows, monotony, and strain. Injury prediction was assessed separately for different player positions using five ML classification models: logistic regression, naïve Bayes (NB), support vector machine, random forest (RF), and eXtreme gradient boosting (XGBoost). RF performed best for forwards overall, with XGBoost excelling in the tight five and SVM in the back row, whereas among backs, RF led for inside backs and NB for outside backs. Additionally, feature importance plots were used to examine the impact of various factors on injury occurrence. In conclusion, our ML-based approach can effectively predict injuries, with average <i>F</i>1 scores up to 0.66 (± 0.14), particularly when applying a combination of GPS-derived metrics. Additionally, key characteristics indicative of injury for players in various positions have been successfully identified. These findings underscored the potential of ML to enhance injury prediction and inform tailored training strategies for athletes.</p>","PeriodicalId":93999,"journal":{"name":"European journal of sport science","volume":"25 10","pages":""},"PeriodicalIF":3.0000,"publicationDate":"2025-09-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ejsc.70057","citationCount":"0","resultStr":"{\"title\":\"Global Positioning System-Derived Metrics and Machine Learning Models for Injury Prediction in Professional Rugby Union Players\",\"authors\":\"Xiangyu Ren, Simon Boisbluche, Kilian Philippe, Mathieu Demy, Sami Äyrämö, Ilkka Rautiainen, Shuzhe Ding, Jacques Prioux\",\"doi\":\"10.1002/ejsc.70057\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>In sports, injury prevention is a key factor for success. Although injuries are challenging to predict, new technologies and the application of data science can provide valuable insights. This study aimed to predict injury risk among professional rugby union players using machine learning (ML) models. We analyzed data from 63 professional rugby union players during three seasons, categorized them into forwards and backs, and further classified them into five specific positions (tight five, back row, scrum-half, inside backs, outside backs). The dataset included GPS data and derived metrics such as total workload in the 1, 2, and 3 weeks prior to injury, acute-to-chronic workload ratio over different time windows, monotony, and strain. Injury prediction was assessed separately for different player positions using five ML classification models: logistic regression, naïve Bayes (NB), support vector machine, random forest (RF), and eXtreme gradient boosting (XGBoost). RF performed best for forwards overall, with XGBoost excelling in the tight five and SVM in the back row, whereas among backs, RF led for inside backs and NB for outside backs. Additionally, feature importance plots were used to examine the impact of various factors on injury occurrence. In conclusion, our ML-based approach can effectively predict injuries, with average <i>F</i>1 scores up to 0.66 (± 0.14), particularly when applying a combination of GPS-derived metrics. Additionally, key characteristics indicative of injury for players in various positions have been successfully identified. These findings underscored the potential of ML to enhance injury prediction and inform tailored training strategies for athletes.</p>\",\"PeriodicalId\":93999,\"journal\":{\"name\":\"European journal of sport science\",\"volume\":\"25 10\",\"pages\":\"\"},\"PeriodicalIF\":3.0000,\"publicationDate\":\"2025-09-24\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1002/ejsc.70057\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"European journal of sport science\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/ejsc.70057\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"European journal of sport science","FirstCategoryId":"1085","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/ejsc.70057","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Global Positioning System-Derived Metrics and Machine Learning Models for Injury Prediction in Professional Rugby Union Players
In sports, injury prevention is a key factor for success. Although injuries are challenging to predict, new technologies and the application of data science can provide valuable insights. This study aimed to predict injury risk among professional rugby union players using machine learning (ML) models. We analyzed data from 63 professional rugby union players during three seasons, categorized them into forwards and backs, and further classified them into five specific positions (tight five, back row, scrum-half, inside backs, outside backs). The dataset included GPS data and derived metrics such as total workload in the 1, 2, and 3 weeks prior to injury, acute-to-chronic workload ratio over different time windows, monotony, and strain. Injury prediction was assessed separately for different player positions using five ML classification models: logistic regression, naïve Bayes (NB), support vector machine, random forest (RF), and eXtreme gradient boosting (XGBoost). RF performed best for forwards overall, with XGBoost excelling in the tight five and SVM in the back row, whereas among backs, RF led for inside backs and NB for outside backs. Additionally, feature importance plots were used to examine the impact of various factors on injury occurrence. In conclusion, our ML-based approach can effectively predict injuries, with average F1 scores up to 0.66 (± 0.14), particularly when applying a combination of GPS-derived metrics. Additionally, key characteristics indicative of injury for players in various positions have been successfully identified. These findings underscored the potential of ML to enhance injury prediction and inform tailored training strategies for athletes.