Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction

Q2 Computer Science
N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch
{"title":"Detecting Outliers in Cardiopulmonary Exercise Testing Data of Ski Racers – A Comparison of Methods and their Effect on the Performance of Fatigue Prediction","authors":"N. Baumgartner, C. Kranzinger, S. Kranzinger, C. Snyder, T. Stöggl, B. Resch","doi":"10.2478/ijcss-2023-0005","DOIUrl":null,"url":null,"abstract":"Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.","PeriodicalId":38466,"journal":{"name":"International Journal of Computer Science in Sport","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2023-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Computer Science in Sport","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2478/ijcss-2023-0005","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"Computer Science","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract In sports science, cardiopulmonary data is used to assess exercise intensity, performance and health status of athletes and derive relevant target values. However, sensors may produce flawed data and data may include a wide variety of artifacts, which could potentially lead to false conclusions. Thus, appropriate and customized pre-processing algorithms are a vital prerequisite for producing reliable and valid analysis results. To find adequate outlier detection methods for this type of data, we compared three algorithms by applying them on seven ergospirometric measures of junior ski racing athletes and applied a model to predict fatigue during skiing based on the pre-processed data. While values that lie outside a realistic spectrum were consistently labelled as outliers by all methods, and mean values and standard deviations changed in similar ways, methods differed from each other when it comes to changing trends, recurring patterns, and subsequent outliers. Decomposing the sensor data into different components (trend, seasonality, remainder) before dealing with outliers increased average predictive performance the most. However, pre-processing remarkably improved prediction results for certain study participants and not for others. Thus, handling outliers correctly prior to deriving information from ergospirometric data is recommended but more research should be conducted to find methods that achieve more consistent improvement.
滑雪运动员心肺运动测试数据异常值的检测——疲劳预测方法的比较及其对性能的影响
在运动科学中,心肺数据被用来评估运动员的运动强度、表现和健康状况,并得出相关的目标值。然而,传感器可能产生有缺陷的数据,数据可能包括各种各样的伪影,这可能导致错误的结论。因此,适当和定制的预处理算法是产生可靠和有效的分析结果的重要前提。为了找到适合这类数据的异常值检测方法,我们对三种算法进行了比较,将它们应用于初级滑雪比赛运动员的七项人体呼吸量测量,并基于预处理数据应用了一个模型来预测滑雪过程中的疲劳。虽然所有方法都将超出现实范围的值标记为异常值,并且平均值和标准差以类似的方式变化,但当涉及到变化趋势,重复模式和随后的异常值时,方法各不相同。在处理异常值之前,将传感器数据分解为不同的组成部分(趋势、季节性、剩余),可以最大程度地提高平均预测性能。然而,预处理显著改善了某些研究参与者的预测结果,而对其他参与者则没有。因此,建议在从肺活量计数据中获得信息之前正确处理异常值,但应该进行更多的研究以找到实现更一致改善的方法。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
International Journal of Computer Science in Sport
International Journal of Computer Science in Sport Computer Science-Computer Science (all)
CiteScore
2.20
自引率
0.00%
发文量
4
审稿时长
12 weeks
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信