Evaluating machine learning approaches to predict the energy expenditure of cross-national preschool children: a study of preprocessing and feature effects.

IF 2.8 3区 医学 Q2 PHYSIOLOGY
Hannah J Coyle-Asbil, Mirko Brandes, Berit Brandes, Christoph Buck, Marvin N Wright, Lori Ann Vallis
{"title":"Evaluating machine learning approaches to predict the energy expenditure of cross-national preschool children: a study of preprocessing and feature effects.","authors":"Hannah J Coyle-Asbil, Mirko Brandes, Berit Brandes, Christoph Buck, Marvin N Wright, Lori Ann Vallis","doi":"10.1007/s00421-025-05800-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to examine the impact of preprocessing and inclusion of various features on predicting the energy expenditure (EE) of preschool children (3.0-6.99 years).</p><p><strong>Methods: </strong>The internal Canadian sample consisted of 36 children, equipped with accelerometers on their wrists (OPAL) and right hip (ActiGraph GT9X). The external German sample consisted of 41 children, equipped with accelerometers on their wrists (GENEActiv) and right hip (GENEActiv; ActiGraph GT3X +). Both datasets used portable metabolic units to record EE. The effects of filtering, rectifying, adding a time delay, frequency domain (FD) features, and participant features on EE prediction across linear regression, random forest (RF), and fully connected neural network models. The Canadian sample was split into training (2/3) and validation (1/3) sets, and the German sample served as an external validation set.</p><p><strong>Results: </strong>Consistently it was found that the RF with filtered, not rectified data with FD, participant features, and a time delay resulted in improved performance compared to approaches used previously. The models also performed similarly in the holdout sample but resulted in higher error when applied in the external validation dataset.</p><p><strong>Conclusions: </strong> Results attest that filtering, not rectifying, FD features and participant features result in improved model performance to predict the EE of preschool children.</p>","PeriodicalId":12005,"journal":{"name":"European Journal of Applied Physiology","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Applied Physiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00421-025-05800-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Purpose: This study aimed to examine the impact of preprocessing and inclusion of various features on predicting the energy expenditure (EE) of preschool children (3.0-6.99 years).

Methods: The internal Canadian sample consisted of 36 children, equipped with accelerometers on their wrists (OPAL) and right hip (ActiGraph GT9X). The external German sample consisted of 41 children, equipped with accelerometers on their wrists (GENEActiv) and right hip (GENEActiv; ActiGraph GT3X +). Both datasets used portable metabolic units to record EE. The effects of filtering, rectifying, adding a time delay, frequency domain (FD) features, and participant features on EE prediction across linear regression, random forest (RF), and fully connected neural network models. The Canadian sample was split into training (2/3) and validation (1/3) sets, and the German sample served as an external validation set.

Results: Consistently it was found that the RF with filtered, not rectified data with FD, participant features, and a time delay resulted in improved performance compared to approaches used previously. The models also performed similarly in the holdout sample but resulted in higher error when applied in the external validation dataset.

Conclusions:  Results attest that filtering, not rectifying, FD features and participant features result in improved model performance to predict the EE of preschool children.

评估机器学习方法预测跨国学龄前儿童的能量消耗:预处理和特征效应的研究。
目的:本研究旨在探讨各种特征的预处理和纳入对预测学龄前儿童(3.0-6.99岁)能量消耗(EE)的影响。方法:加拿大内部样本包括36名儿童,在他们的手腕(OPAL)和右髋关节(ActiGraph GT9X)上配备加速度计。外部德国样本包括41名儿童,在他们的手腕(GENEActiv)和右臀部(GENEActiv;ActiGraph GT3X +)。两个数据集都使用便携式代谢单位记录EE。滤波、整流、添加时滞、频域(FD)特征和参与者特征对跨线性回归、随机森林(RF)和全连接神经网络模型的EE预测的影响。加拿大样本分为训练集(2/3)和验证集(1/3),德国样本作为外部验证集。结果:一致地发现,与以前使用的方法相比,带有FD、参与者特征和时间延迟的过滤的、未校正的数据的RF导致了性能的提高。这些模型在holdout样本中也有类似的表现,但在外部验证数据集中应用时会导致更高的误差。结论:结果证明,过滤而非校正、FD特征和参与者特征可以提高模型预测学龄前儿童情感表达的性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
6.00
自引率
6.70%
发文量
227
审稿时长
3 months
期刊介绍: The European Journal of Applied Physiology (EJAP) aims to promote mechanistic advances in human integrative and translational physiology. Physiology is viewed broadly, having overlapping context with related disciplines such as biomechanics, biochemistry, endocrinology, ergonomics, immunology, motor control, and nutrition. EJAP welcomes studies dealing with physical exercise, training and performance. Studies addressing physiological mechanisms are preferred over descriptive studies. Papers dealing with animal models or pathophysiological conditions are not excluded from consideration, but must be clearly relevant to human physiology.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信