Improved prediction of swimming talent through random forest analysis of anthropometric and physiological phenotypes.

IF 3.7 Q2 GENETICS & HEREDITY
Phenomics (Cham, Switzerland) Pub Date : 2024-11-20 eCollection Date: 2024-10-01 DOI:10.1007/s43657-024-00176-8
Cheng Liu, Bingxiang Xu, Kang Wan, Qin Sun, Ruwen Wang, Yue Feng, Hui Shao, Tiemin Liu, Ru Wang
{"title":"Improved prediction of swimming talent through random forest analysis of anthropometric and physiological phenotypes.","authors":"Cheng Liu, Bingxiang Xu, Kang Wan, Qin Sun, Ruwen Wang, Yue Feng, Hui Shao, Tiemin Liu, Ru Wang","doi":"10.1007/s43657-024-00176-8","DOIUrl":null,"url":null,"abstract":"<p><p>The field of competitive swimming lacks broadly applicable predictive models for talent identification across various age groups of adolescent swimmers. This study aimed to construct a predictive model for athletic talent using machine learning methods based on anthropometric and physiological data. Baseline data were collected from 5444 participants aged 10-18 in Shanghai, China, between 2015 and 2018, with 4969 completing a 3-year follow-up. Talents were discerned based on their performance over the follow-up period, revealing age- and sex- dependent developmental differences between swimmers classified as talented versus non-talented. After controlling for confounding variables, age and sex, nine machine learning algorithms were employed, with Random Forest achieving the highest performance and being selected as the final model. The model demonstrated excellent predictive performance on both the test dataset and an independent validation dataset from Shandong (<i>n</i> = 118), indicating its strong generalizability. Furthermore, using the SHapley Additive exPlanations (SHAP) method to interpret the model, abdominal skinfold, lung capacity, chest circumference, shoulder width, and triceps skinfold were identified as the five most critical indicators for talent identification.</p><p><strong>Supplementary information: </strong>The online version contains supplementary material available at 10.1007/s43657-024-00176-8.</p>","PeriodicalId":74435,"journal":{"name":"Phenomics (Cham, Switzerland)","volume":"4 5","pages":"465-472"},"PeriodicalIF":3.7000,"publicationDate":"2024-11-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11666874/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Phenomics (Cham, Switzerland)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1007/s43657-024-00176-8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GENETICS & HEREDITY","Score":null,"Total":0}
引用次数: 0

Abstract

The field of competitive swimming lacks broadly applicable predictive models for talent identification across various age groups of adolescent swimmers. This study aimed to construct a predictive model for athletic talent using machine learning methods based on anthropometric and physiological data. Baseline data were collected from 5444 participants aged 10-18 in Shanghai, China, between 2015 and 2018, with 4969 completing a 3-year follow-up. Talents were discerned based on their performance over the follow-up period, revealing age- and sex- dependent developmental differences between swimmers classified as talented versus non-talented. After controlling for confounding variables, age and sex, nine machine learning algorithms were employed, with Random Forest achieving the highest performance and being selected as the final model. The model demonstrated excellent predictive performance on both the test dataset and an independent validation dataset from Shandong (n = 118), indicating its strong generalizability. Furthermore, using the SHapley Additive exPlanations (SHAP) method to interpret the model, abdominal skinfold, lung capacity, chest circumference, shoulder width, and triceps skinfold were identified as the five most critical indicators for talent identification.

Supplementary information: The online version contains supplementary material available at 10.1007/s43657-024-00176-8.

通过人体测量和生理表型的随机森林分析改进游泳天赋的预测。
竞技游泳领域缺乏广泛适用的预测模型来识别不同年龄段的青少年游泳运动员。本研究旨在基于人体测量学和生理学数据,利用机器学习方法构建运动人才的预测模型。基线数据收集自2015年至2018年间中国上海5444名年龄在10-18岁的参与者,其中4969人完成了为期3年的随访。天才是根据他们在随访期间的表现来识别的,揭示了被归类为天才和非天才的游泳运动员之间年龄和性别依赖的发展差异。在控制了混杂变量、年龄和性别后,采用了9种机器学习算法,其中Random Forest的性能最高,被选为最终模型。该模型在测试数据集和来自山东的独立验证数据集(n = 118)上均表现出出色的预测性能,表明其具有较强的泛化能力。此外,采用SHapley加性解释(SHAP)方法对模型进行解释,确定腹部皮褶、肺活量、胸围、肩宽和三头肌皮褶为人才识别的五个最关键指标。补充资料:在线版本包含补充资料,下载地址:10.1007/s43657-024-00176-8。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信