Pablo M Marti-Castellote, Christopher Reeder, Brian L Claggett, Pulkit Singh, Emily S Lau, Shaan Khurshid, Puneet Batra, Steven A Lubitz, Mahnaz Maddah, Orly Vardeny, Eldrin F Lewis, Marc A Pfeffer, Pardeep S Jhund, Akshay S Desai, John J V McMurray, Patrick T Ellinor, Jennifer E Ho, Scott D Solomon, Jonathan W Cunningham
{"title":"Natural Language Processing to Adjudicate Heart Failure Hospitalizations in Global Clinical Trials.","authors":"Pablo M Marti-Castellote, Christopher Reeder, Brian L Claggett, Pulkit Singh, Emily S Lau, Shaan Khurshid, Puneet Batra, Steven A Lubitz, Mahnaz Maddah, Orly Vardeny, Eldrin F Lewis, Marc A Pfeffer, Pardeep S Jhund, Akshay S Desai, John J V McMurray, Patrick T Ellinor, Jennifer E Ho, Scott D Solomon, Jonathan W Cunningham","doi":"10.1161/CIRCHEARTFAILURE.124.012514","DOIUrl":null,"url":null,"abstract":"<p><p><b>Background:</b> Medical record review by a physician clinical events committee is the gold standard for identifying cardiovascular outcomes in clinical trials, but is labor-intensive and poorly reproducible. Automated outcome adjudication by artificial intelligence (AI) could enable larger and less expensive clinical trials, but has not been validated in global studies. <b>Methods:</b> We developed a novel model for automated AI-based heart failure adjudication (\"HF-NLP\") using hospitalizations from three international clinical outcomes trials. This model was tested on potential heart failure hospitalizations from the DELIVER trial, a cardiovascular outcomes trial comparing dapagliflozin with placebo in 6063 patients with heart failure with mildly reduced or preserved ejection fraction. AI-based adjudications were compared with adjudications from a clinical events committee that followed FDA-based criteria. <b>Results:</b> AI-based adjudication agreed with the clinical events committee in 83% of events. A strategy of human review for events that the AI model deemed uncertain (16%) would have achieved 91% agreement with the clinical events committee while reducing adjudication workload by 84%. The estimated effect of dapagliflozin on heart failure hospitalization was nearly identical with AI-based adjudication (hazard ratio 0.76 [95% CI 0.66-0.88]) compared to clinical events committee adjudication (hazard ratio 0.77 [95% CI 0.67-0.89]). The AI model extracted symptoms, signs, and treatments of heart failure from each medical record in tabular format and quoted sentences documenting them. <b>Conclusions:</b> AI-based adjudication of clinical outcomes has the potential to improve the efficiency of global clinical trials while preserving accuracy and interpretability.</p>","PeriodicalId":10196,"journal":{"name":"Circulation: Heart Failure","volume":" ","pages":""},"PeriodicalIF":7.8000,"publicationDate":"2024-11-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Circulation: Heart Failure","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1161/CIRCHEARTFAILURE.124.012514","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CARDIAC & CARDIOVASCULAR SYSTEMS","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Medical record review by a physician clinical events committee is the gold standard for identifying cardiovascular outcomes in clinical trials, but is labor-intensive and poorly reproducible. Automated outcome adjudication by artificial intelligence (AI) could enable larger and less expensive clinical trials, but has not been validated in global studies. Methods: We developed a novel model for automated AI-based heart failure adjudication ("HF-NLP") using hospitalizations from three international clinical outcomes trials. This model was tested on potential heart failure hospitalizations from the DELIVER trial, a cardiovascular outcomes trial comparing dapagliflozin with placebo in 6063 patients with heart failure with mildly reduced or preserved ejection fraction. AI-based adjudications were compared with adjudications from a clinical events committee that followed FDA-based criteria. Results: AI-based adjudication agreed with the clinical events committee in 83% of events. A strategy of human review for events that the AI model deemed uncertain (16%) would have achieved 91% agreement with the clinical events committee while reducing adjudication workload by 84%. The estimated effect of dapagliflozin on heart failure hospitalization was nearly identical with AI-based adjudication (hazard ratio 0.76 [95% CI 0.66-0.88]) compared to clinical events committee adjudication (hazard ratio 0.77 [95% CI 0.67-0.89]). The AI model extracted symptoms, signs, and treatments of heart failure from each medical record in tabular format and quoted sentences documenting them. Conclusions: AI-based adjudication of clinical outcomes has the potential to improve the efficiency of global clinical trials while preserving accuracy and interpretability.
期刊介绍:
Circulation: Heart Failure focuses on content related to heart failure, mechanical circulatory support, and heart transplant science and medicine. It considers studies conducted in humans or analyses of human data, as well as preclinical studies with direct clinical correlation or relevance. While primarily a clinical journal, it may publish novel basic and preclinical studies that significantly advance the field of heart failure.