Christopher L Moore, Vimig Socrates, Mina Hesami, Ryan P Denkewicz, Joe J Cavallo, Arjun K Venkatesh, R Andrew Taylor
{"title":"Using natural language processing to identify emergency department patients with incidental lung nodules requiring follow-up.","authors":"Christopher L Moore, Vimig Socrates, Mina Hesami, Ryan P Denkewicz, Joe J Cavallo, Arjun K Venkatesh, R Andrew Taylor","doi":"10.1111/acem.15080","DOIUrl":null,"url":null,"abstract":"<p><strong>Objectives: </strong>For emergency department (ED) patients, lung cancer may be detected early through incidental lung nodules (ILNs) discovered on chest CTs. However, there are significant errors in the communication and follow-up of incidental findings on ED imaging, particularly due to unstructured radiology reports. Natural language processing (NLP) can aid in identifying ILNs requiring follow-up, potentially reducing errors from missed follow-up. We sought to develop an open-access, three-step NLP pipeline specifically for this purpose.</p><p><strong>Methods: </strong>This retrospective used a cohort of 26,545 chest CTs performed in three EDs from 2014 to 2021. Randomly selected chest CT reports were annotated by MD raters using Prodigy software to develop a stepwise NLP \"pipeline\" that first excluded prior or known malignancy, determined the presence of a lung nodule, and then categorized any recommended follow-up. NLP was developed using a RoBERTa large language model on the SpaCy platform and deployed as open-access software using Docker. After NLP development it was applied to 1000 CT reports that were manually reviewed to determine accuracy using accepted NLP metrics of precision (positive predictive value), recall (sensitivity), and F1 score (which balances precision and recall).</p><p><strong>Results: </strong>Precision, recall, and F1 score were 0.85, 0.71, and 0.77, respectively, for malignancy; 0.87, 0.83, and 0.85 for nodule; and 0.82, 0.90, and 0.85 for follow-up. Overall accuracy for follow-up in the absence of malignancy with a nodule present was 93.3%. The overall recommended follow-up rate was 12.4%, with 10.1% of patients having evidence of known or prior malignancy.</p><p><strong>Conclusions: </strong>We developed an accurate, open-access pipeline to identify ILNs with recommended follow-up on ED chest CTs. While the prevalence of recommended follow-up is lower than some prior studies, it more accurately reflects the prevalence of truly incidental findings without prior or known malignancy. Incorporating this tool could reduce errors by improving the identification, communication, and tracking of ILNs.</p>","PeriodicalId":7105,"journal":{"name":"Academic Emergency Medicine","volume":" ","pages":""},"PeriodicalIF":3.4000,"publicationDate":"2025-01-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Academic Emergency Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/acem.15080","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"EMERGENCY MEDICINE","Score":null,"Total":0}
引用次数: 0
Abstract
Objectives: For emergency department (ED) patients, lung cancer may be detected early through incidental lung nodules (ILNs) discovered on chest CTs. However, there are significant errors in the communication and follow-up of incidental findings on ED imaging, particularly due to unstructured radiology reports. Natural language processing (NLP) can aid in identifying ILNs requiring follow-up, potentially reducing errors from missed follow-up. We sought to develop an open-access, three-step NLP pipeline specifically for this purpose.
Methods: This retrospective used a cohort of 26,545 chest CTs performed in three EDs from 2014 to 2021. Randomly selected chest CT reports were annotated by MD raters using Prodigy software to develop a stepwise NLP "pipeline" that first excluded prior or known malignancy, determined the presence of a lung nodule, and then categorized any recommended follow-up. NLP was developed using a RoBERTa large language model on the SpaCy platform and deployed as open-access software using Docker. After NLP development it was applied to 1000 CT reports that were manually reviewed to determine accuracy using accepted NLP metrics of precision (positive predictive value), recall (sensitivity), and F1 score (which balances precision and recall).
Results: Precision, recall, and F1 score were 0.85, 0.71, and 0.77, respectively, for malignancy; 0.87, 0.83, and 0.85 for nodule; and 0.82, 0.90, and 0.85 for follow-up. Overall accuracy for follow-up in the absence of malignancy with a nodule present was 93.3%. The overall recommended follow-up rate was 12.4%, with 10.1% of patients having evidence of known or prior malignancy.
Conclusions: We developed an accurate, open-access pipeline to identify ILNs with recommended follow-up on ED chest CTs. While the prevalence of recommended follow-up is lower than some prior studies, it more accurately reflects the prevalence of truly incidental findings without prior or known malignancy. Incorporating this tool could reduce errors by improving the identification, communication, and tracking of ILNs.
期刊介绍:
Academic Emergency Medicine (AEM) is the official monthly publication of the Society for Academic Emergency Medicine (SAEM) and publishes information relevant to the practice, educational advancements, and investigation of emergency medicine. It is the second-largest peer-reviewed scientific journal in the specialty of emergency medicine.
The goal of AEM is to advance the science, education, and clinical practice of emergency medicine, to serve as a voice for the academic emergency medicine community, and to promote SAEM''s goals and objectives. Members and non-members worldwide depend on this journal for translational medicine relevant to emergency medicine, as well as for clinical news, case studies and more.
Each issue contains information relevant to the research, educational advancements, and practice in emergency medicine. Subject matter is diverse, including preclinical studies, clinical topics, health policy, and educational methods. The research of SAEM members contributes significantly to the scientific content and development of the journal.