Keisyanne Araujo-Moura, Letícia Souza, Tiago Almeida de Oliveira, Mateus Silva Rocha, Augusto César Ferreira De Moraes, Alexandre Chiavegatto Filho
{"title":"Prediction of Hypertension in the Pediatric Population Using Machine Learning and Transfer Learning: A Multicentric Analysis of the SAYCARE Study.","authors":"Keisyanne Araujo-Moura, Letícia Souza, Tiago Almeida de Oliveira, Mateus Silva Rocha, Augusto César Ferreira De Moraes, Alexandre Chiavegatto Filho","doi":"10.3389/ijph.2025.1607944","DOIUrl":null,"url":null,"abstract":"<p><strong>Objective: </strong>To develop a machine learning (ML) model utilizing transfer learning (TL) techniques to predict hypertension in children and adolescents across South America.</p><p><strong>Methods: </strong>Data from two cohorts (children and adolescents) in seven South American cities were analyzed. A TL strategy was implemented by transferring knowledge from a CatBoost model trained on the children's sample and adapting it to the adolescent sample. Model performance was evaluated using standard metrics.</p><p><strong>Results: </strong>Among children, the prevalence of normal blood pressure was 88.9% (301 participants), while 14.1% (50 participants) had elevated blood pressure (EBP). In the adolescent group, the prevalence of normal blood pressure was 92.5% (284 participants), with 7.5% (23 participants) presenting with EBP. Random Forest, XGBoost, and LightGBM achieved high accuracy (0.90) for children, with XGBoost and LightGBM demonstrating superior recall (0.50) and AUC-ROC (0.74). For adolescents, models without TL showed poor performance, with accuracy and recall values remaining low and AUC-ROC ranging from 0.46 to 0.56. After applying TL, model performance improved significantly, with CatBoost achieving an AUC-ROC of 0.82, accuracy of 1.0, and recall of 0.18.</p><p><strong>Conclusion: </strong>Soft drinks, filled cookies, and chips were key dietary predictors of elevated blood pressure, with higher intake in adolescents. Machine learning with transfer learning effectively identified these risks, emphasizing the need for early dietary interventions to prevent hypertension and support cardiovascular health in pediatric populations.</p>","PeriodicalId":14322,"journal":{"name":"International Journal of Public Health","volume":"70 ","pages":"1607944"},"PeriodicalIF":2.6000,"publicationDate":"2025-03-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11937837/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Public Health","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.3389/ijph.2025.1607944","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0
Abstract
Objective: To develop a machine learning (ML) model utilizing transfer learning (TL) techniques to predict hypertension in children and adolescents across South America.
Methods: Data from two cohorts (children and adolescents) in seven South American cities were analyzed. A TL strategy was implemented by transferring knowledge from a CatBoost model trained on the children's sample and adapting it to the adolescent sample. Model performance was evaluated using standard metrics.
Results: Among children, the prevalence of normal blood pressure was 88.9% (301 participants), while 14.1% (50 participants) had elevated blood pressure (EBP). In the adolescent group, the prevalence of normal blood pressure was 92.5% (284 participants), with 7.5% (23 participants) presenting with EBP. Random Forest, XGBoost, and LightGBM achieved high accuracy (0.90) for children, with XGBoost and LightGBM demonstrating superior recall (0.50) and AUC-ROC (0.74). For adolescents, models without TL showed poor performance, with accuracy and recall values remaining low and AUC-ROC ranging from 0.46 to 0.56. After applying TL, model performance improved significantly, with CatBoost achieving an AUC-ROC of 0.82, accuracy of 1.0, and recall of 0.18.
Conclusion: Soft drinks, filled cookies, and chips were key dietary predictors of elevated blood pressure, with higher intake in adolescents. Machine learning with transfer learning effectively identified these risks, emphasizing the need for early dietary interventions to prevent hypertension and support cardiovascular health in pediatric populations.
期刊介绍:
The International Journal of Public Health publishes scientific articles relevant to global public health, from different countries and cultures, and assembles them into issues that raise awareness and understanding of public health problems and solutions. The Journal welcomes submissions of original research, critical and relevant reviews, methodological papers and manuscripts that emphasize theoretical content. IJPH sometimes publishes commentaries and opinions. Special issues highlight key areas of current research. The Editorial Board''s mission is to provide a thoughtful forum for contemporary issues and challenges in global public health research and practice.