Machine learning-based prediction of elevated N terminal pro brain natriuretic peptide among US general population.

IF 3.2 2区医学 Q2 CARDIAC & CARDIOVASCULAR SYSTEMS

ESC Heart Failure Pub Date : 2024-11-19 DOI:10.1002/ehf2.15056

Yuichiro Mori, Shingo Fukuma, Kyohei Yamaji, Atsushi Mizuno, Naoki Kondo, Kosuke Inoue

{"title":"Machine learning-based prediction of elevated N terminal pro brain natriuretic peptide among US general population.","authors":"Yuichiro Mori, Shingo Fukuma, Kyohei Yamaji, Atsushi Mizuno, Naoki Kondo, Kosuke Inoue","doi":"10.1002/ehf2.15056","DOIUrl":null,"url":null,"abstract":"Aims: Natriuretic peptide-based pre-heart failure screening has been proposed in recent guidelines. However, an effective strategy to identify screening targets from the general population, more than half of which are at risk for heart failure or pre-heart failure, has not been well established. This study evaluated the performance of machine learning prediction models for predicting elevated N terminal pro brain natriuretic peptide (NT-proBNP) levels in the US general population.Methods and results: Individuals aged 20-79 years without cardiovascular disease from the nationally representative National Health and Nutrition Examination Survey 1999-2004 were included. Six prediction models (two conventional regression models and four machine learning models) were trained with the 1999-2002 cohort to predict elevated NT-proBNP levels (>125 pg/mL) using demographic, lifestyle, and commonly measured biochemical data. The model performance was tested using the 2003-2004 cohort. Of the 10 237 individuals, 1510 (14.8%) had NT-proBNP levels >125 pg/mL. The highest area under the receiver operating characteristic curve (AUC) was observed in SuperLearner (AUC [95% CI] = 0.862 [0.847-0.878], P < 0.001 compared with the logistic regression model). The logistic regression model with splines showed a comparable performance (AUC [95% CI] = 0.857 [0.841-0.874], P = 0.08). Age, albumin level, haemoglobin level, sex, estimated glomerular filtration rate, and systolic blood pressure were the most important predictors. We found a similar prediction performance even after excluding socio-economic information (marital status, family income, and education status) from the prediction models. When we used different thresholds for elevated NT-proBNP, the AUC (95% CI) in the SuperLearner models 0.846 (0.830-0.861) for NT-proBNP > 100 pg/mL and 0.866 (0.849-0.884) for NT-proBNP > 150 pg/mL.Conclusions: Using nationally representative data from the United States, both logistic regression and machine learning models well predicted elevated NT-proBNP. The predictive performance remained consistent even when the models incorporated only commonly available variables in daily clinical practice. Prediction models using regularly measured information would serve as a potentially useful tools for clinicians to effectively identify targets of natriuretic-peptide screening.","PeriodicalId":11864,"journal":{"name":"ESC Heart Failure","volume":" ","pages":""},"PeriodicalIF":3.2000,"publicationDate":"2024-11-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"ESC Heart Failure","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ehf2.15056","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}

引用次数: 0

Abstract

Aims: Natriuretic peptide-based pre-heart failure screening has been proposed in recent guidelines. However, an effective strategy to identify screening targets from the general population, more than half of which are at risk for heart failure or pre-heart failure, has not been well established. This study evaluated the performance of machine learning prediction models for predicting elevated N terminal pro brain natriuretic peptide (NT-proBNP) levels in the US general population.

Methods and results: Individuals aged 20-79 years without cardiovascular disease from the nationally representative National Health and Nutrition Examination Survey 1999-2004 were included. Six prediction models (two conventional regression models and four machine learning models) were trained with the 1999-2002 cohort to predict elevated NT-proBNP levels (>125 pg/mL) using demographic, lifestyle, and commonly measured biochemical data. The model performance was tested using the 2003-2004 cohort. Of the 10 237 individuals, 1510 (14.8%) had NT-proBNP levels >125 pg/mL. The highest area under the receiver operating characteristic curve (AUC) was observed in SuperLearner (AUC [95% CI] = 0.862 [0.847-0.878], P < 0.001 compared with the logistic regression model). The logistic regression model with splines showed a comparable performance (AUC [95% CI] = 0.857 [0.841-0.874], P = 0.08). Age, albumin level, haemoglobin level, sex, estimated glomerular filtration rate, and systolic blood pressure were the most important predictors. We found a similar prediction performance even after excluding socio-economic information (marital status, family income, and education status) from the prediction models. When we used different thresholds for elevated NT-proBNP, the AUC (95% CI) in the SuperLearner models 0.846 (0.830-0.861) for NT-proBNP > 100 pg/mL and 0.866 (0.849-0.884) for NT-proBNP > 150 pg/mL.

Conclusions: Using nationally representative data from the United States, both logistic regression and machine learning models well predicted elevated NT-proBNP. The predictive performance remained consistent even when the models incorporated only commonly available variables in daily clinical practice. Prediction models using regularly measured information would serve as a potentially useful tools for clinicians to effectively identify targets of natriuretic-peptide screening.

查看原文本刊更多论文

基于机器学习的美国普通人群 N 端脑钠肽升高预测。

目的：最近的指南中提出了以利钠肽为基础的心衰前期筛查。然而，从普通人群中确定筛查目标的有效策略尚未得到很好的确立，而普通人群中有一半以上存在心力衰竭或先兆心力衰竭的风险。本研究评估了机器学习预测模型在预测美国普通人群中 N 末端脑钠肽 (NT-proBNP) 水平升高方面的性能：方法：研究人员纳入了具有全国代表性的 1999-2004 年全国健康与营养调查中 20-79 岁无心血管疾病的人群。利用 1999-2002 年队列中的人口统计学、生活方式和常用生化测量数据训练了六个预测模型（两个传统回归模型和四个机器学习模型），以预测 NT-proBNP 水平的升高（>125 pg/mL）。使用 2003-2004 年队列对模型性能进行了测试。在 10 237 人中，1510 人（14.8%）的 NT-proBNP 水平大于 125 pg/mL。SuperLearner 的接收器操作特征曲线下面积（AUC）最高（AUC [95% CI] = 0.862 [0.847-0.878], P 100 pg/mL，NT-proBNP > 150 pg/mL为 0.866 (0.849-0.884)）：结论：使用具有全国代表性的美国数据，逻辑回归和机器学习模型都能很好地预测 NT-proBNP 的升高。即使模型中只包含日常临床实践中常见的变量，预测效果也保持一致。使用定期测量信息的预测模型将成为临床医生有效确定钠尿肽筛查目标的潜在有用工具。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

ESC Heart Failure Medicine-Cardiology and Cardiovascular Medicine

CiteScore

7.00

自引率

7.90%

发文量

461

审稿时长

12 weeks

期刊介绍： ESC Heart Failure is the open access journal of the Heart Failure Association of the European Society of Cardiology dedicated to the advancement of knowledge in the field of heart failure. The journal aims to improve the understanding, prevention, investigation and treatment of heart failure. Molecular and cellular biology, pathology, physiology, electrophysiology, pharmacology, as well as the clinical, social and population sciences all form part of the discipline that is heart failure. Accordingly, submission of manuscripts on basic, translational, clinical and population sciences is invited. Original contributions on nursing, care of the elderly, primary care, health economics and other specialist fields related to heart failure are also welcome, as are case reports that highlight interesting aspects of heart failure care and treatment.