Hengyan Liu, Weibin Kou, Yik-Chung Wu, Pui Hing Chau, Thomas Wai Hung Chung, Daniel Yee Tak Fong
{"title":"Predicting Childhood and Adolescence Hypertension: Analysis of Predictors Using Machine Learning.","authors":"Hengyan Liu, Weibin Kou, Yik-Chung Wu, Pui Hing Chau, Thomas Wai Hung Chung, Daniel Yee Tak Fong","doi":"10.1542/peds.2024-066675","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There has been a substantial burden of hypertension in children and adolescents. Given the availability of primary prevention strategies, it is important to determine predictors for early identification of children and adolescents at risk of hypertension. This study aims to attempt and validate machine learning (ML) algorithms for accurately predicting blood pressure (BP) status (normal, prehypertension, and hypertension) over 1- and 3-year periods, identifying key predictors without compromising model performance.</p><p><strong>Methods: </strong>We included a population-based cohort of primary 1 to secondary 6 students (typically aged 6 to 18 years) during the academic years of 1995 to 1996 and 2019 to 2020 in Hong Kong. Thirty-six easy-assessed predictors were initially model childhood BP status. Multiple ML algorithms, decision tree, random forest, k-nearest neighbor, eXtreme Gradient Boosting (XGBoost), and multinomial logistic regression (MLR), were used. Model evaluation was performed by various accuracy metrics. The Shapley Additive Explanations (SHAP) was used to identify key features for both predictions.</p><p><strong>Results: </strong>A total of 923 301 and 602 179 visit pairs were used for the 1- and 3-year predictions, respectively. XGBoost demonstrated the highest prediction accuracies for 1-year (macro-area under the receiver operating characteristic curve [AUROC] = 0.92, micro-AUROC = 0.91) and 3-year (macro-AUROC = 0.91, micro-AUROC = 0.90) periods. The traditional MLR approach had the lowest accuracies for 1- (macro-AUROC = 0.70, micro-AUROC = 0.68) and 3-year (macro-AUROC = 0.70, micro-AUROC = 0.68) predictions. The SHAP values identified 17 key predictors without the need for direct BP measurements or laboratory tests.</p><p><strong>Conclusion: </strong>ML prediction models can accurately predict childhood prehypertension and hypertension at 1 and 3 years, independent of BP and laboratory measurements. The identified key predictors may inform areas for personalized prevention in hypertension.</p>","PeriodicalId":20028,"journal":{"name":"Pediatrics","volume":" ","pages":""},"PeriodicalIF":6.2000,"publicationDate":"2025-02-04","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1542/peds.2024-066675","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: There has been a substantial burden of hypertension in children and adolescents. Given the availability of primary prevention strategies, it is important to determine predictors for early identification of children and adolescents at risk of hypertension. This study aims to attempt and validate machine learning (ML) algorithms for accurately predicting blood pressure (BP) status (normal, prehypertension, and hypertension) over 1- and 3-year periods, identifying key predictors without compromising model performance.
Methods: We included a population-based cohort of primary 1 to secondary 6 students (typically aged 6 to 18 years) during the academic years of 1995 to 1996 and 2019 to 2020 in Hong Kong. Thirty-six easy-assessed predictors were initially model childhood BP status. Multiple ML algorithms, decision tree, random forest, k-nearest neighbor, eXtreme Gradient Boosting (XGBoost), and multinomial logistic regression (MLR), were used. Model evaluation was performed by various accuracy metrics. The Shapley Additive Explanations (SHAP) was used to identify key features for both predictions.
Results: A total of 923 301 and 602 179 visit pairs were used for the 1- and 3-year predictions, respectively. XGBoost demonstrated the highest prediction accuracies for 1-year (macro-area under the receiver operating characteristic curve [AUROC] = 0.92, micro-AUROC = 0.91) and 3-year (macro-AUROC = 0.91, micro-AUROC = 0.90) periods. The traditional MLR approach had the lowest accuracies for 1- (macro-AUROC = 0.70, micro-AUROC = 0.68) and 3-year (macro-AUROC = 0.70, micro-AUROC = 0.68) predictions. The SHAP values identified 17 key predictors without the need for direct BP measurements or laboratory tests.
Conclusion: ML prediction models can accurately predict childhood prehypertension and hypertension at 1 and 3 years, independent of BP and laboratory measurements. The identified key predictors may inform areas for personalized prevention in hypertension.
期刊介绍:
The Pediatrics® journal is the official flagship journal of the American Academy of Pediatrics (AAP). It is widely cited in the field of pediatric medicine and is recognized as the leading journal in the field.
The journal publishes original research and evidence-based articles, which provide authoritative information to help readers stay up-to-date with the latest developments in pediatric medicine. The content is peer-reviewed and undergoes rigorous evaluation to ensure its quality and reliability.
Pediatrics also serves as a valuable resource for conducting new research studies and supporting education and training activities in the field of pediatrics. It aims to enhance the quality of pediatric outpatient and inpatient care by disseminating valuable knowledge and insights.
As of 2023, Pediatrics has an impressive Journal Impact Factor (IF) Score of 8.0. The IF is a measure of a journal's influence and importance in the scientific community, with higher scores indicating a greater impact. This score reflects the significance and reach of the research published in Pediatrics, further establishing its prominence in the field of pediatric medicine.