Arash A Nargesi, Philip Adejumo, Lovedeep Singh Dhingra, Benjamin Rosand, Astrid Hengartner, Andreas Coppi, Simon Benigeri, Sounok Sen, Tariq Ahmad, Girish N Nadkarni, Zhenqiu Lin, Faraz S Ahmad, Harlan M Krumholz, Rohan Khera
{"title":"利用基于深度学习的自然语言处理技术自动识别射血分数降低的心力衰竭。","authors":"Arash A Nargesi, Philip Adejumo, Lovedeep Singh Dhingra, Benjamin Rosand, Astrid Hengartner, Andreas Coppi, Simon Benigeri, Sounok Sen, Tariq Ahmad, Girish N Nadkarni, Zhenqiu Lin, Faraz S Ahmad, Harlan M Krumholz, Rohan Khera","doi":"10.1016/j.jchf.2024.08.012","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>The lack of automated tools for measuring care quality limits the implementation of a national program to assess guideline-directed care in heart failure with reduced ejection fraction (HFrEF).</p><p><strong>Objectives: </strong>The authors aimed to automate the identification of patients with HFrEF at hospital discharge, an opportunity to evaluate and improve the quality of care.</p><p><strong>Methods: </strong>The authors developed a novel deep-learning language model for identifying patients with HFrEF from discharge summaries of hospitalizations with heart failure at Yale New Haven Hospital during 2015 to 2019. HFrEF was defined by left ventricular ejection fraction <40% on antecedent echocardiography. The authors externally validated the model at Northwestern Medicine, community hospitals of Yale, and the MIMIC-III (Medical Information Mart for Intensive Care III) database.</p><p><strong>Results: </strong>A total of 13,251 notes from 5,392 unique individuals (age 73 ± 14 years, 48% women), including 2,487 patients with HFrEF (46.1%), were used for model development (train/held-out: 70%/30%). The model achieved an area under receiver-operating characteristic curve (AUROC) of 0.97 and area under precision recall curve (AUPRC) of 0.97 in detecting HFrEF on the held-out set. The model had high performance in identifying HFrEF with AUROC = 0.94 and AUPRC = 0.91 on 19,242 notes from Northwestern Medicine, AUROC = 0.95 and AUPRC = 0.96 on 139 manually abstracted notes from Yale community hospitals, and AUROC = 0.91 and AUPRC = 0.92 on 146 manually reviewed notes from MIMIC-III. Model-based predictions of HFrEF corresponded to a net reclassification improvement of 60.2% ± 1.9% compared with diagnosis codes (P < 0.001).</p><p><strong>Conclusions: </strong>The authors developed a language model that identifies HFrEF from clinical notes with high precision and accuracy, representing a key element in automating quality assessment for individuals with HFrEF.</p>","PeriodicalId":14687,"journal":{"name":"JACC. Heart failure","volume":" ","pages":"75-87"},"PeriodicalIF":10.3000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Automated Identification of Heart Failure With Reduced Ejection Fraction Using Deep Learning-Based Natural Language Processing.\",\"authors\":\"Arash A Nargesi, Philip Adejumo, Lovedeep Singh Dhingra, Benjamin Rosand, Astrid Hengartner, Andreas Coppi, Simon Benigeri, Sounok Sen, Tariq Ahmad, Girish N Nadkarni, Zhenqiu Lin, Faraz S Ahmad, Harlan M Krumholz, Rohan Khera\",\"doi\":\"10.1016/j.jchf.2024.08.012\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>The lack of automated tools for measuring care quality limits the implementation of a national program to assess guideline-directed care in heart failure with reduced ejection fraction (HFrEF).</p><p><strong>Objectives: </strong>The authors aimed to automate the identification of patients with HFrEF at hospital discharge, an opportunity to evaluate and improve the quality of care.</p><p><strong>Methods: </strong>The authors developed a novel deep-learning language model for identifying patients with HFrEF from discharge summaries of hospitalizations with heart failure at Yale New Haven Hospital during 2015 to 2019. HFrEF was defined by left ventricular ejection fraction <40% on antecedent echocardiography. The authors externally validated the model at Northwestern Medicine, community hospitals of Yale, and the MIMIC-III (Medical Information Mart for Intensive Care III) database.</p><p><strong>Results: </strong>A total of 13,251 notes from 5,392 unique individuals (age 73 ± 14 years, 48% women), including 2,487 patients with HFrEF (46.1%), were used for model development (train/held-out: 70%/30%). The model achieved an area under receiver-operating characteristic curve (AUROC) of 0.97 and area under precision recall curve (AUPRC) of 0.97 in detecting HFrEF on the held-out set. The model had high performance in identifying HFrEF with AUROC = 0.94 and AUPRC = 0.91 on 19,242 notes from Northwestern Medicine, AUROC = 0.95 and AUPRC = 0.96 on 139 manually abstracted notes from Yale community hospitals, and AUROC = 0.91 and AUPRC = 0.92 on 146 manually reviewed notes from MIMIC-III. Model-based predictions of HFrEF corresponded to a net reclassification improvement of 60.2% ± 1.9% compared with diagnosis codes (P < 0.001).</p><p><strong>Conclusions: </strong>The authors developed a language model that identifies HFrEF from clinical notes with high precision and accuracy, representing a key element in automating quality assessment for individuals with HFrEF.</p>\",\"PeriodicalId\":14687,\"journal\":{\"name\":\"JACC. Heart failure\",\"volume\":\" \",\"pages\":\"75-87\"},\"PeriodicalIF\":10.3000,\"publicationDate\":\"2025-01-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JACC. Heart failure\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.jchf.2024.08.012\",\"RegionNum\":1,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2024/10/23 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"CARDIAC & CARDIOVASCULAR SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JACC. Heart failure","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.jchf.2024.08.012","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/10/23 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
Automated Identification of Heart Failure With Reduced Ejection Fraction Using Deep Learning-Based Natural Language Processing.
Background: The lack of automated tools for measuring care quality limits the implementation of a national program to assess guideline-directed care in heart failure with reduced ejection fraction (HFrEF).
Objectives: The authors aimed to automate the identification of patients with HFrEF at hospital discharge, an opportunity to evaluate and improve the quality of care.
Methods: The authors developed a novel deep-learning language model for identifying patients with HFrEF from discharge summaries of hospitalizations with heart failure at Yale New Haven Hospital during 2015 to 2019. HFrEF was defined by left ventricular ejection fraction <40% on antecedent echocardiography. The authors externally validated the model at Northwestern Medicine, community hospitals of Yale, and the MIMIC-III (Medical Information Mart for Intensive Care III) database.
Results: A total of 13,251 notes from 5,392 unique individuals (age 73 ± 14 years, 48% women), including 2,487 patients with HFrEF (46.1%), were used for model development (train/held-out: 70%/30%). The model achieved an area under receiver-operating characteristic curve (AUROC) of 0.97 and area under precision recall curve (AUPRC) of 0.97 in detecting HFrEF on the held-out set. The model had high performance in identifying HFrEF with AUROC = 0.94 and AUPRC = 0.91 on 19,242 notes from Northwestern Medicine, AUROC = 0.95 and AUPRC = 0.96 on 139 manually abstracted notes from Yale community hospitals, and AUROC = 0.91 and AUPRC = 0.92 on 146 manually reviewed notes from MIMIC-III. Model-based predictions of HFrEF corresponded to a net reclassification improvement of 60.2% ± 1.9% compared with diagnosis codes (P < 0.001).
Conclusions: The authors developed a language model that identifies HFrEF from clinical notes with high precision and accuracy, representing a key element in automating quality assessment for individuals with HFrEF.
期刊介绍:
JACC: Heart Failure publishes crucial findings on the pathophysiology, diagnosis, treatment, and care of heart failure patients. The goal is to enhance understanding through timely scientific communication on disease, clinical trials, outcomes, and therapeutic advances. The Journal fosters interdisciplinary connections with neuroscience, pulmonary medicine, nephrology, electrophysiology, and surgery related to heart failure. It also covers articles on pharmacogenetics, biomarkers, and metabolomics.