Evaluating machine learning approaches to predict the energy expenditure of cross-national preschool children: a study of preprocessing and feature effects.
Hannah J Coyle-Asbil, Mirko Brandes, Berit Brandes, Christoph Buck, Marvin N Wright, Lori Ann Vallis
{"title":"Evaluating machine learning approaches to predict the energy expenditure of cross-national preschool children: a study of preprocessing and feature effects.","authors":"Hannah J Coyle-Asbil, Mirko Brandes, Berit Brandes, Christoph Buck, Marvin N Wright, Lori Ann Vallis","doi":"10.1007/s00421-025-05800-7","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>This study aimed to examine the impact of preprocessing and inclusion of various features on predicting the energy expenditure (EE) of preschool children (3.0-6.99 years).</p><p><strong>Methods: </strong>The internal Canadian sample consisted of 36 children, equipped with accelerometers on their wrists (OPAL) and right hip (ActiGraph GT9X). The external German sample consisted of 41 children, equipped with accelerometers on their wrists (GENEActiv) and right hip (GENEActiv; ActiGraph GT3X +). Both datasets used portable metabolic units to record EE. The effects of filtering, rectifying, adding a time delay, frequency domain (FD) features, and participant features on EE prediction across linear regression, random forest (RF), and fully connected neural network models. The Canadian sample was split into training (2/3) and validation (1/3) sets, and the German sample served as an external validation set.</p><p><strong>Results: </strong>Consistently it was found that the RF with filtered, not rectified data with FD, participant features, and a time delay resulted in improved performance compared to approaches used previously. The models also performed similarly in the holdout sample but resulted in higher error when applied in the external validation dataset.</p><p><strong>Conclusions: </strong> Results attest that filtering, not rectifying, FD features and participant features result in improved model performance to predict the EE of preschool children.</p>","PeriodicalId":12005,"journal":{"name":"European Journal of Applied Physiology","volume":" ","pages":""},"PeriodicalIF":2.8000,"publicationDate":"2025-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Applied Physiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s00421-025-05800-7","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"PHYSIOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Purpose: This study aimed to examine the impact of preprocessing and inclusion of various features on predicting the energy expenditure (EE) of preschool children (3.0-6.99 years).
Methods: The internal Canadian sample consisted of 36 children, equipped with accelerometers on their wrists (OPAL) and right hip (ActiGraph GT9X). The external German sample consisted of 41 children, equipped with accelerometers on their wrists (GENEActiv) and right hip (GENEActiv; ActiGraph GT3X +). Both datasets used portable metabolic units to record EE. The effects of filtering, rectifying, adding a time delay, frequency domain (FD) features, and participant features on EE prediction across linear regression, random forest (RF), and fully connected neural network models. The Canadian sample was split into training (2/3) and validation (1/3) sets, and the German sample served as an external validation set.
Results: Consistently it was found that the RF with filtered, not rectified data with FD, participant features, and a time delay resulted in improved performance compared to approaches used previously. The models also performed similarly in the holdout sample but resulted in higher error when applied in the external validation dataset.
Conclusions: Results attest that filtering, not rectifying, FD features and participant features result in improved model performance to predict the EE of preschool children.
期刊介绍:
The European Journal of Applied Physiology (EJAP) aims to promote mechanistic advances in human integrative and translational physiology. Physiology is viewed broadly, having overlapping context with related disciplines such as biomechanics, biochemistry, endocrinology, ergonomics, immunology, motor control, and nutrition. EJAP welcomes studies dealing with physical exercise, training and performance. Studies addressing physiological mechanisms are preferred over descriptive studies. Papers dealing with animal models or pathophysiological conditions are not excluded from consideration, but must be clearly relevant to human physiology.