Athletic signature: predicting the next game lineup in collegiate basketball

Srishti Sharma, Srikrishnan Divakaran, Tolga Kaya, Mehul Raval
{"title":"Athletic signature: predicting the next game lineup in collegiate basketball","authors":"Srishti Sharma, Srikrishnan Divakaran, Tolga Kaya, Mehul Raval","doi":"10.1007/s00521-024-10383-z","DOIUrl":null,"url":null,"abstract":"<p>The advances in machine learning (ML) tools and techniques have enabled the non-intrusive collection and rapid analysis of massive amounts of data involving athletes in competitive collegiate sports. It has facilitated the development of services that a coach can employ in analyzing these data into actionable insights in designing training schedules and effective strategies for maximizing an athlete’s performance, while minimizing injury risk. Collegiate sports utilize data to get a competitive advantage. While game statistics are publicly available, relying on more than one form of data can help reveal a pattern. We developed a framework that considers various modalities and creates an athletic signature to predict their future performance. Our research involves the study of 42 distinct features that quantify various internal/external stressors the athletes face to characterize and estimate their athletic readiness (in the form of reactive strength index modified—RSImod) using ML algorithms. Our study, conducted over 26 weeks with 17 collegiate women’s basketball athletes, developed a framework that first performed sensitivity analysis using a hybrid approach combining the strengths of various filter-based, wrapper-based, and embedded feature importance techniques to identify the features most significantly impacting athlete readiness. These features were then categorized into four moderating variables (MVs, i.e. factors): sleep, cardiac rhythm, training strain, and travel schedule. Further, we used factor analysis to enhance interpretability and reduce computational complexity. A hybrid boosted-decision-trees-based model designed based on athlete clusters predicted future athletic readiness based on MVs with a mean squared error (MSE) of 0.0102. Partial dependence plots (PDPs) helped qualitatively learn the relationship between the moderating variables and the RSImod score. Athletic signatures, uniquely defining athlete-specific MV patterns, account for intra-individual variability, offering a better statistical basis for predicting game lineup (green/yellow/red card assignment) in combination with model predictions. SHAP (SHapley Additive exPlanations) values suggest the causative MV in order of significance for each prediction, enabling coaches to make informed decisions about training adjustments and athlete well-being, leading to performance improvement. Using the fingerprint mechanism, we created green (within 1 Standard Deviation (SD)), yellow (&gt; 1SD and &lt; 2SD), and red card (&gt; 2SD) zones for athlete readiness assessment. While, this study was conducted on D-I women’s basketball, the modalities apply to several sports, such as soccer, volleyball, football, and ice hockey. This framework allows coaches to understand their athlete dynamics from a strictly data perspective, which helps them strategize their next moves, combined with their personal experience and interactions with the team.</p>","PeriodicalId":18925,"journal":{"name":"Neural Computing and Applications","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Neural Computing and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s00521-024-10383-z","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

The advances in machine learning (ML) tools and techniques have enabled the non-intrusive collection and rapid analysis of massive amounts of data involving athletes in competitive collegiate sports. It has facilitated the development of services that a coach can employ in analyzing these data into actionable insights in designing training schedules and effective strategies for maximizing an athlete’s performance, while minimizing injury risk. Collegiate sports utilize data to get a competitive advantage. While game statistics are publicly available, relying on more than one form of data can help reveal a pattern. We developed a framework that considers various modalities and creates an athletic signature to predict their future performance. Our research involves the study of 42 distinct features that quantify various internal/external stressors the athletes face to characterize and estimate their athletic readiness (in the form of reactive strength index modified—RSImod) using ML algorithms. Our study, conducted over 26 weeks with 17 collegiate women’s basketball athletes, developed a framework that first performed sensitivity analysis using a hybrid approach combining the strengths of various filter-based, wrapper-based, and embedded feature importance techniques to identify the features most significantly impacting athlete readiness. These features were then categorized into four moderating variables (MVs, i.e. factors): sleep, cardiac rhythm, training strain, and travel schedule. Further, we used factor analysis to enhance interpretability and reduce computational complexity. A hybrid boosted-decision-trees-based model designed based on athlete clusters predicted future athletic readiness based on MVs with a mean squared error (MSE) of 0.0102. Partial dependence plots (PDPs) helped qualitatively learn the relationship between the moderating variables and the RSImod score. Athletic signatures, uniquely defining athlete-specific MV patterns, account for intra-individual variability, offering a better statistical basis for predicting game lineup (green/yellow/red card assignment) in combination with model predictions. SHAP (SHapley Additive exPlanations) values suggest the causative MV in order of significance for each prediction, enabling coaches to make informed decisions about training adjustments and athlete well-being, leading to performance improvement. Using the fingerprint mechanism, we created green (within 1 Standard Deviation (SD)), yellow (> 1SD and < 2SD), and red card (> 2SD) zones for athlete readiness assessment. While, this study was conducted on D-I women’s basketball, the modalities apply to several sports, such as soccer, volleyball, football, and ice hockey. This framework allows coaches to understand their athlete dynamics from a strictly data perspective, which helps them strategize their next moves, combined with their personal experience and interactions with the team.

Abstract Image

竞技签名:预测大学篮球队的下场比赛阵容
机器学习(ML)工具和技术的进步使我们能够非侵入式地收集和快速分析涉及大学竞技体育运动员的海量数据。它促进了服务的发展,教练可以利用这些数据分析出可行的见解,从而设计出训练计划和有效的策略,最大限度地提高运动员的成绩,同时最大限度地降低受伤风险。大学体育利用数据获得竞争优势。虽然比赛统计数据是公开的,但依靠一种以上的数据形式有助于揭示一种模式。我们开发了一个框架,该框架考虑了各种模式,并创建了一个运动特征来预测他们未来的表现。我们的研究涉及对 42 个不同特征的研究,这些特征量化了运动员面临的各种内部/外部压力,利用 ML 算法来描述和估计他们的运动准备状态(以反应强度指数 modified-RSImod 的形式)。我们的研究以 17 名大学女子篮球运动员为对象,历时 26 周,开发了一个框架,该框架首先使用一种混合方法进行敏感性分析,该方法结合了各种基于过滤器、基于包装和嵌入式特征重要性技术的优势,以确定对运动员准备状态影响最大的特征。然后,这些特征被归类为四个调节变量(MV,即因素):睡眠、心律、训练负荷和行程安排。此外,我们还使用了因子分析来增强可解释性并降低计算复杂性。基于运动员集群设计的混合助推决策树模型根据 MV 预测了未来的运动准备情况,平均平方误差(MSE)为 0.0102。偏倚图(PDP)有助于定性地了解调节变量与 RSImod 分数之间的关系。运动特征独特地定义了运动员特有的 MV 模式,考虑了个体内部的变异性,为结合模型预测比赛阵容(绿牌/黄牌/红牌分配)提供了更好的统计基础。SHAP(SHapley Additive exPlanations)值按每个预测的显著性顺序提出了MV的成因,使教练员能够就训练调整和运动员福利做出明智的决定,从而提高成绩。利用指纹机制,我们创建了绿色(1 个标准差以内)、黄色(1 个标准差和 2 个标准差)和红牌(2 个标准差)区域,用于评估运动员的准备情况。虽然这项研究是针对大一女子篮球进行的,但其模式适用于多种运动,如足球、排球、橄榄球和冰上曲棍球。这一框架使教练员能够从严格的数据角度了解运动员的动态,这有助于他们结合个人经验和与球队的互动,制定下一步行动的战略。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信