Rodrigo Yáñez-Sepúlveda, Rodrigo Olivares, Pablo Olivares, Juan Pablo Zavala-Crichton, Claudio Hinojosa-Torres, Frano Giakoni-Ramírez, Josivaldo de Souza-Lima, Matías Monsalves-Álvarez, Marcelo Tuesta, Jacqueline Páez-Herrera, Jorge Olivares-Arancibia, Tomás Reyes-Amigo, Guillermo Cortés-Roco, Juan Hurtado-Almonacid, Eduardo Guzmán-Muñoz, Nicole Aguilera-Martínez, José Francisco López-Gil, Vicente Javier Clemente-Suárez
{"title":"基于健康的青少年心脏代谢风险分类的监督机器学习算法。","authors":"Rodrigo Yáñez-Sepúlveda, Rodrigo Olivares, Pablo Olivares, Juan Pablo Zavala-Crichton, Claudio Hinojosa-Torres, Frano Giakoni-Ramírez, Josivaldo de Souza-Lima, Matías Monsalves-Álvarez, Marcelo Tuesta, Jacqueline Páez-Herrera, Jorge Olivares-Arancibia, Tomás Reyes-Amigo, Guillermo Cortés-Roco, Juan Hurtado-Almonacid, Eduardo Guzmán-Muñoz, Nicole Aguilera-Martínez, José Francisco López-Gil, Vicente Javier Clemente-Suárez","doi":"10.3390/sports13080273","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Cardiometabolic risk in adolescents represents a growing public health concern that is closely linked to modifiable factors such as physical fitness. Traditional statistical approaches often fail to capture complex, nonlinear relationships among anthropometric and fitness-related variables.</p><p><strong>Objective: </strong>To develop and evaluate supervised machine learning algorithms, including artificial neural networks and ensemble methods, for classifying cardiometabolic risk levels among Chilean adolescents based on standardized physical fitness assessments.</p><p><strong>Methods: </strong>A cross-sectional analysis was conducted using a large representative sample of school-aged adolescents. Field-based physical fitness tests, such as cardiorespiratory fitness (in terms of estimated maximal oxygen consumption [VO<sub>2max</sub>]), muscular strength (push-ups), and explosive power (horizontal jump) testing, were used as input variables. A cardiometabolic risk index was derived using international criteria. Various supervised machine learning models were trained and compared regarding accuracy, F1 score, recall, and area under the receiver operating characteristic curve (AUC-ROC).</p><p><strong>Results: </strong>Among all the models tested, the gradient boosting classifier achieved the best overall performance, with an accuracy of 77.0%, an F1 score of 67.3%, and the highest AUC-ROC (0.601). These results indicate a strong balance between sensitivity and specificity in classifying adolescents at cardiometabolic risk. Horizontal jumps and push-ups emerged as the most influential predictive variables.</p><p><strong>Conclusions: </strong>Gradient boosting proved to be the most effective model for predicting cardiometabolic risk based on physical fitness data. This approach offers a practical, data-driven tool for early risk detection in adolescent populations and may support scalable screening efforts in educational and clinical settings.</p>","PeriodicalId":53303,"journal":{"name":"Sports","volume":"13 8","pages":""},"PeriodicalIF":2.9000,"publicationDate":"2025-08-18","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12390583/pdf/","citationCount":"0","resultStr":"{\"title\":\"Supervised Machine Learning Algorithms for Fitness-Based Cardiometabolic Risk Classification in Adolescents.\",\"authors\":\"Rodrigo Yáñez-Sepúlveda, Rodrigo Olivares, Pablo Olivares, Juan Pablo Zavala-Crichton, Claudio Hinojosa-Torres, Frano Giakoni-Ramírez, Josivaldo de Souza-Lima, Matías Monsalves-Álvarez, Marcelo Tuesta, Jacqueline Páez-Herrera, Jorge Olivares-Arancibia, Tomás Reyes-Amigo, Guillermo Cortés-Roco, Juan Hurtado-Almonacid, Eduardo Guzmán-Muñoz, Nicole Aguilera-Martínez, José Francisco López-Gil, Vicente Javier Clemente-Suárez\",\"doi\":\"10.3390/sports13080273\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Cardiometabolic risk in adolescents represents a growing public health concern that is closely linked to modifiable factors such as physical fitness. Traditional statistical approaches often fail to capture complex, nonlinear relationships among anthropometric and fitness-related variables.</p><p><strong>Objective: </strong>To develop and evaluate supervised machine learning algorithms, including artificial neural networks and ensemble methods, for classifying cardiometabolic risk levels among Chilean adolescents based on standardized physical fitness assessments.</p><p><strong>Methods: </strong>A cross-sectional analysis was conducted using a large representative sample of school-aged adolescents. Field-based physical fitness tests, such as cardiorespiratory fitness (in terms of estimated maximal oxygen consumption [VO<sub>2max</sub>]), muscular strength (push-ups), and explosive power (horizontal jump) testing, were used as input variables. A cardiometabolic risk index was derived using international criteria. Various supervised machine learning models were trained and compared regarding accuracy, F1 score, recall, and area under the receiver operating characteristic curve (AUC-ROC).</p><p><strong>Results: </strong>Among all the models tested, the gradient boosting classifier achieved the best overall performance, with an accuracy of 77.0%, an F1 score of 67.3%, and the highest AUC-ROC (0.601). These results indicate a strong balance between sensitivity and specificity in classifying adolescents at cardiometabolic risk. Horizontal jumps and push-ups emerged as the most influential predictive variables.</p><p><strong>Conclusions: </strong>Gradient boosting proved to be the most effective model for predicting cardiometabolic risk based on physical fitness data. This approach offers a practical, data-driven tool for early risk detection in adolescent populations and may support scalable screening efforts in educational and clinical settings.</p>\",\"PeriodicalId\":53303,\"journal\":{\"name\":\"Sports\",\"volume\":\"13 8\",\"pages\":\"\"},\"PeriodicalIF\":2.9000,\"publicationDate\":\"2025-08-18\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12390583/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sports\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3390/sports13080273\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"SPORT SCIENCES\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sports","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3390/sports13080273","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SPORT SCIENCES","Score":null,"Total":0}
Supervised Machine Learning Algorithms for Fitness-Based Cardiometabolic Risk Classification in Adolescents.
Background: Cardiometabolic risk in adolescents represents a growing public health concern that is closely linked to modifiable factors such as physical fitness. Traditional statistical approaches often fail to capture complex, nonlinear relationships among anthropometric and fitness-related variables.
Objective: To develop and evaluate supervised machine learning algorithms, including artificial neural networks and ensemble methods, for classifying cardiometabolic risk levels among Chilean adolescents based on standardized physical fitness assessments.
Methods: A cross-sectional analysis was conducted using a large representative sample of school-aged adolescents. Field-based physical fitness tests, such as cardiorespiratory fitness (in terms of estimated maximal oxygen consumption [VO2max]), muscular strength (push-ups), and explosive power (horizontal jump) testing, were used as input variables. A cardiometabolic risk index was derived using international criteria. Various supervised machine learning models were trained and compared regarding accuracy, F1 score, recall, and area under the receiver operating characteristic curve (AUC-ROC).
Results: Among all the models tested, the gradient boosting classifier achieved the best overall performance, with an accuracy of 77.0%, an F1 score of 67.3%, and the highest AUC-ROC (0.601). These results indicate a strong balance between sensitivity and specificity in classifying adolescents at cardiometabolic risk. Horizontal jumps and push-ups emerged as the most influential predictive variables.
Conclusions: Gradient boosting proved to be the most effective model for predicting cardiometabolic risk based on physical fitness data. This approach offers a practical, data-driven tool for early risk detection in adolescent populations and may support scalable screening efforts in educational and clinical settings.