{"title":"基于机器学习的美国普通人群 N 端脑钠肽升高预测。","authors":"Yuichiro Mori, Shingo Fukuma, Kyohei Yamaji, Atsushi Mizuno, Naoki Kondo, Kosuke Inoue","doi":"10.1002/ehf2.15056","DOIUrl":null,"url":null,"abstract":"<p><strong>Aims: </strong>Natriuretic peptide-based pre-heart failure screening has been proposed in recent guidelines. However, an effective strategy to identify screening targets from the general population, more than half of which are at risk for heart failure or pre-heart failure, has not been well established. This study evaluated the performance of machine learning prediction models for predicting elevated N terminal pro brain natriuretic peptide (NT-proBNP) levels in the US general population.</p><p><strong>Methods and results: </strong>Individuals aged 20-79 years without cardiovascular disease from the nationally representative National Health and Nutrition Examination Survey 1999-2004 were included. Six prediction models (two conventional regression models and four machine learning models) were trained with the 1999-2002 cohort to predict elevated NT-proBNP levels (>125 pg/mL) using demographic, lifestyle, and commonly measured biochemical data. The model performance was tested using the 2003-2004 cohort. Of the 10 237 individuals, 1510 (14.8%) had NT-proBNP levels >125 pg/mL. The highest area under the receiver operating characteristic curve (AUC) was observed in SuperLearner (AUC [95% CI] = 0.862 [0.847-0.878], P < 0.001 compared with the logistic regression model). The logistic regression model with splines showed a comparable performance (AUC [95% CI] = 0.857 [0.841-0.874], P = 0.08). Age, albumin level, haemoglobin level, sex, estimated glomerular filtration rate, and systolic blood pressure were the most important predictors. We found a similar prediction performance even after excluding socio-economic information (marital status, family income, and education status) from the prediction models. When we used different thresholds for elevated NT-proBNP, the AUC (95% CI) in the SuperLearner models 0.846 (0.830-0.861) for NT-proBNP > 100 pg/mL and 0.866 (0.849-0.884) for NT-proBNP > 150 pg/mL.</p><p><strong>Conclusions: </strong>Using nationally representative data from the United States, both logistic regression and machine learning models well predicted elevated NT-proBNP. The predictive performance remained consistent even when the models incorporated only commonly available variables in daily clinical practice. Prediction models using regularly measured information would serve as a potentially useful tools for clinicians to effectively identify targets of natriuretic-peptide screening.</p>","PeriodicalId":11864,"journal":{"name":"ESC Heart Failure","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine learning-based prediction of elevated N terminal pro brain natriuretic peptide among US general population.\",\"authors\":\"Yuichiro Mori, Shingo Fukuma, Kyohei Yamaji, Atsushi Mizuno, Naoki Kondo, Kosuke Inoue\",\"doi\":\"10.1002/ehf2.15056\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Aims: </strong>Natriuretic peptide-based pre-heart failure screening has been proposed in recent guidelines. However, an effective strategy to identify screening targets from the general population, more than half of which are at risk for heart failure or pre-heart failure, has not been well established. This study evaluated the performance of machine learning prediction models for predicting elevated N terminal pro brain natriuretic peptide (NT-proBNP) levels in the US general population.</p><p><strong>Methods and results: </strong>Individuals aged 20-79 years without cardiovascular disease from the nationally representative National Health and Nutrition Examination Survey 1999-2004 were included. Six prediction models (two conventional regression models and four machine learning models) were trained with the 1999-2002 cohort to predict elevated NT-proBNP levels (>125 pg/mL) using demographic, lifestyle, and commonly measured biochemical data. The model performance was tested using the 2003-2004 cohort. Of the 10 237 individuals, 1510 (14.8%) had NT-proBNP levels >125 pg/mL. The highest area under the receiver operating characteristic curve (AUC) was observed in SuperLearner (AUC [95% CI] = 0.862 [0.847-0.878], P < 0.001 compared with the logistic regression model). The logistic regression model with splines showed a comparable performance (AUC [95% CI] = 0.857 [0.841-0.874], P = 0.08). Age, albumin level, haemoglobin level, sex, estimated glomerular filtration rate, and systolic blood pressure were the most important predictors. We found a similar prediction performance even after excluding socio-economic information (marital status, family income, and education status) from the prediction models. When we used different thresholds for elevated NT-proBNP, the AUC (95% CI) in the SuperLearner models 0.846 (0.830-0.861) for NT-proBNP > 100 pg/mL and 0.866 (0.849-0.884) for NT-proBNP > 150 pg/mL.</p><p><strong>Conclusions: </strong>Using nationally representative data from the United States, both logistic regression and machine learning models well predicted elevated NT-proBNP. The predictive performance remained consistent even when the models incorporated only commonly available variables in daily clinical practice. Prediction models using regularly measured information would serve as a potentially useful tools for clinicians to effectively identify targets of natriuretic-peptide screening.</p>\",\"PeriodicalId\":11864,\"journal\":{\"name\":\"ESC Heart Failure\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-19\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ESC Heart Failure\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1002/ehf2.15056\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q2\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESC Heart Failure","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ehf2.15056","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
Machine learning-based prediction of elevated N terminal pro brain natriuretic peptide among US general population.
Aims: Natriuretic peptide-based pre-heart failure screening has been proposed in recent guidelines. However, an effective strategy to identify screening targets from the general population, more than half of which are at risk for heart failure or pre-heart failure, has not been well established. This study evaluated the performance of machine learning prediction models for predicting elevated N terminal pro brain natriuretic peptide (NT-proBNP) levels in the US general population.
Methods and results: Individuals aged 20-79 years without cardiovascular disease from the nationally representative National Health and Nutrition Examination Survey 1999-2004 were included. Six prediction models (two conventional regression models and four machine learning models) were trained with the 1999-2002 cohort to predict elevated NT-proBNP levels (>125 pg/mL) using demographic, lifestyle, and commonly measured biochemical data. The model performance was tested using the 2003-2004 cohort. Of the 10 237 individuals, 1510 (14.8%) had NT-proBNP levels >125 pg/mL. The highest area under the receiver operating characteristic curve (AUC) was observed in SuperLearner (AUC [95% CI] = 0.862 [0.847-0.878], P < 0.001 compared with the logistic regression model). The logistic regression model with splines showed a comparable performance (AUC [95% CI] = 0.857 [0.841-0.874], P = 0.08). Age, albumin level, haemoglobin level, sex, estimated glomerular filtration rate, and systolic blood pressure were the most important predictors. We found a similar prediction performance even after excluding socio-economic information (marital status, family income, and education status) from the prediction models. When we used different thresholds for elevated NT-proBNP, the AUC (95% CI) in the SuperLearner models 0.846 (0.830-0.861) for NT-proBNP > 100 pg/mL and 0.866 (0.849-0.884) for NT-proBNP > 150 pg/mL.
Conclusions: Using nationally representative data from the United States, both logistic regression and machine learning models well predicted elevated NT-proBNP. The predictive performance remained consistent even when the models incorporated only commonly available variables in daily clinical practice. Prediction models using regularly measured information would serve as a potentially useful tools for clinicians to effectively identify targets of natriuretic-peptide screening.
期刊介绍:
ESC Heart Failure is the open access journal of the Heart Failure Association of the European Society of Cardiology dedicated to the advancement of knowledge in the field of heart failure. The journal aims to improve the understanding, prevention, investigation and treatment of heart failure. Molecular and cellular biology, pathology, physiology, electrophysiology, pharmacology, as well as the clinical, social and population sciences all form part of the discipline that is heart failure. Accordingly, submission of manuscripts on basic, translational, clinical and population sciences is invited. Original contributions on nursing, care of the elderly, primary care, health economics and other specialist fields related to heart failure are also welcome, as are case reports that highlight interesting aspects of heart failure care and treatment.