On discretising continuous data for subgroup analysis in baseball pitching and alternative analytical methodologies.

IF 4.6 Q2 MATERIALS SCIENCE, BIOMATERIALS
Kevin A Giordano, Kyle W Wasserberger
{"title":"On discretising continuous data for subgroup analysis in baseball pitching and alternative analytical methodologies.","authors":"Kevin A Giordano, Kyle W Wasserberger","doi":"10.1080/14763141.2024.2377219","DOIUrl":null,"url":null,"abstract":"<p><p>There is a plethora of research attempting to contrast high- and low-velocity pitchers to identify traits to target for increasing velocity. However, pitch velocity exists on a continuum. Therefore, our purpose is to display the analytical discrepancies between creating velocity subgroups and leaving velocity as a continuous variable by examining the influence of ball velocity on elbow valgus torque. Motion capture data for 1315 actively competing pitchers were retrospectively extracted from a private database. We compared three analytic methods: (1) linear regression of valgus torque on ball velocity, (2) t-test between low- and high-velocity groups formed by a median split, and (3) t-test between very low- and very high-velocity groups formed by upper and lower velocity quartiles. Linear regression indicates ball velocity influenced valgus torque (<i>p</i> < 0.001, R<sup>2</sup> = 0.280). Median splitting reduced the predictability of ball velocity on valgus torque (<i>p</i> < 0.001, R<sup>2</sup> = 0.180). Conversely, extreme group splitting artificially inflated the effect size (<i>p</i> < 0.001, R<sup>2</sup> = 0.347). We recommend sports biomechanics researchers not discretise a continuous variable to form subgroups for analysis because (1) it distorts the relationship between the variables of interest and (2) a regression equation can be used to estimate the dependent variable at any value of the independent variable, not just the group means.</p>","PeriodicalId":2,"journal":{"name":"ACS Applied Bio Materials","volume":null,"pages":null},"PeriodicalIF":4.6000,"publicationDate":"2024-07-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ACS Applied Bio Materials","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1080/14763141.2024.2377219","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"MATERIALS SCIENCE, BIOMATERIALS","Score":null,"Total":0}
引用次数: 0

Abstract

There is a plethora of research attempting to contrast high- and low-velocity pitchers to identify traits to target for increasing velocity. However, pitch velocity exists on a continuum. Therefore, our purpose is to display the analytical discrepancies between creating velocity subgroups and leaving velocity as a continuous variable by examining the influence of ball velocity on elbow valgus torque. Motion capture data for 1315 actively competing pitchers were retrospectively extracted from a private database. We compared three analytic methods: (1) linear regression of valgus torque on ball velocity, (2) t-test between low- and high-velocity groups formed by a median split, and (3) t-test between very low- and very high-velocity groups formed by upper and lower velocity quartiles. Linear regression indicates ball velocity influenced valgus torque (p < 0.001, R2 = 0.280). Median splitting reduced the predictability of ball velocity on valgus torque (p < 0.001, R2 = 0.180). Conversely, extreme group splitting artificially inflated the effect size (p < 0.001, R2 = 0.347). We recommend sports biomechanics researchers not discretise a continuous variable to form subgroups for analysis because (1) it distorts the relationship between the variables of interest and (2) a regression equation can be used to estimate the dependent variable at any value of the independent variable, not just the group means.

棒球投球中用于分组分析的连续数据离散化及替代分析方法。
有大量研究试图对比高投速和低投速投手,以确定提高投速的目标特征。然而,投球速度是一个连续体。因此,我们的目的是通过研究球速对肘外翻力矩的影响,来显示创建速度分组与将速度作为连续变量之间的分析差异。我们从一个私人数据库中回顾性地提取了 1315 名积极参加比赛的投手的运动捕捉数据。我们比较了三种分析方法:(1)肘外翻力矩对球速的线性回归;(2)以中位数划分的低速组和高速组之间的 t 检验;以及(3)以上下速度四分位数划分的极低速组和极高速组之间的 t 检验。线性回归结果表明,球速对外翻力矩有影响(p 2 = 0.280)。中位数拆分降低了球速对外翻扭矩的可预测性(p 2 = 0.180)。相反,极端分组则人为地扩大了效应大小(p 2 = 0.347)。我们建议运动生物力学研究人员不要将连续变量离散化以形成分组进行分析,因为(1)这样做会扭曲相关变量之间的关系;(2)回归方程可用于估计因变量在自变量的任何值上的影响,而不仅仅是分组平均值。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
ACS Applied Bio Materials
ACS Applied Bio Materials Chemistry-Chemistry (all)
CiteScore
9.40
自引率
2.10%
发文量
464
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信