{"title":"\"Outlier detection: Underused in sport science despite outliers' impact on inference and prediction\".","authors":"Marco Zanin, Réka Lóczi, Alisa Zanin","doi":"10.1080/02640414.2024.2443313","DOIUrl":null,"url":null,"abstract":"<p><p>Outlier detection is critical in statistical analysis and predictive modelling, but it is often overlooked in research, leading to potentially inaccurate conclusions. This study aimed to (1) assess the prevalence of outlier detection strategies in sport science publications, (2) examine the effect of outliers on statistical inference using general linear mixed-effects models with longitudinal data, and (3) evaluate the impact of outliers on regression predictive models using the same dataset. A systematic literature search of 4,622 articles published in 2023 in Q1 journals in \"sport science\" category found that only 9.05% (99% CI: 4.87%-14.99%) reported employing outlier detection practices. A comprehensive outlier detection framework was designed to enhance data quality before formal statistical analysis using general(ised) linear mixed-effects models. The framework was applied to publicly available longitudinal data from rugby union small-sided games. Inclusion of outliers resulted in inaccurate maximum likelihood estimates, inflated standard errors, and biased conclusions regarding model parameters. Removing error outliers significantly improved the accuracy of predictive models, as evidenced by reduced root mean square errors. These findings underscore the importance of outlier detection in sport science research and demonstrate that appropriate handling of outliers enhances the validity of statistical inferences and predictions.</p>","PeriodicalId":17066,"journal":{"name":"Journal of Sports Sciences","volume":" ","pages":"1-11"},"PeriodicalIF":2.3000,"publicationDate":"2024-12-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Sports Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1080/02640414.2024.2443313","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SPORT SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Outlier detection is critical in statistical analysis and predictive modelling, but it is often overlooked in research, leading to potentially inaccurate conclusions. This study aimed to (1) assess the prevalence of outlier detection strategies in sport science publications, (2) examine the effect of outliers on statistical inference using general linear mixed-effects models with longitudinal data, and (3) evaluate the impact of outliers on regression predictive models using the same dataset. A systematic literature search of 4,622 articles published in 2023 in Q1 journals in "sport science" category found that only 9.05% (99% CI: 4.87%-14.99%) reported employing outlier detection practices. A comprehensive outlier detection framework was designed to enhance data quality before formal statistical analysis using general(ised) linear mixed-effects models. The framework was applied to publicly available longitudinal data from rugby union small-sided games. Inclusion of outliers resulted in inaccurate maximum likelihood estimates, inflated standard errors, and biased conclusions regarding model parameters. Removing error outliers significantly improved the accuracy of predictive models, as evidenced by reduced root mean square errors. These findings underscore the importance of outlier detection in sport science research and demonstrate that appropriate handling of outliers enhances the validity of statistical inferences and predictions.
期刊介绍:
The Journal of Sports Sciences has an international reputation for publishing articles of a high standard and is both Medline and Clarivate Analytics-listed. It publishes research on various aspects of the sports and exercise sciences, including anatomy, biochemistry, biomechanics, performance analysis, physiology, psychology, sports medicine and health, as well as coaching and talent identification, kinanthropometry and other interdisciplinary perspectives.
The emphasis of the Journal is on the human sciences, broadly defined and applied to sport and exercise. Besides experimental work in human responses to exercise, the subjects covered will include human responses to technologies such as the design of sports equipment and playing facilities, research in training, selection, performance prediction or modification, and stress reduction or manifestation. Manuscripts considered for publication include those dealing with original investigations of exercise, validation of technological innovations in sport or comprehensive reviews of topics relevant to the scientific study of sport.