Jessie Torgersen, Melissa Skanderson, Farah Kidwai-Khan, Dena M Carbonari, Janet P Tate, Lesley S Park, Debika Bhattacharya, Joseph K Lim, Tamar H Taddei, Amy C Justice, Vincent Lo Re
{"title":"Identification of hepatic steatosis among persons with and without HIV using natural language processing.","authors":"Jessie Torgersen, Melissa Skanderson, Farah Kidwai-Khan, Dena M Carbonari, Janet P Tate, Lesley S Park, Debika Bhattacharya, Joseph K Lim, Tamar H Taddei, Amy C Justice, Vincent Lo Re","doi":"10.1097/HC9.0000000000000468","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Steatotic liver disease (SLD) is a growing phenomenon, and our understanding of its determinants has been limited by our ability to identify it clinically. Natural language processing (NLP) can potentially identify hepatic steatosis systematically within large clinical repositories of imaging reports. We validated the performance of an NLP algorithm for the identification of SLD in clinical imaging reports and applied this tool to a large population of people with and without HIV.</p><p><strong>Methods: </strong>Patients were included in the analysis if they enrolled in the Veterans Aging Cohort Study between 2001 and 2017, had an imaging report inclusive of the liver, and had ≥2 years of observation before the imaging study. SLD was considered present when reports contained the terms \"fatty,\" \"steatosis,\" \"steatotic,\" or \"steatohepatitis.\" The performance of the SLD NLP algorithm was compared to a clinical review of 800 reports. We then applied the NLP algorithm to the first eligible imaging study and compared patient characteristics by SLD and HIV status.</p><p><strong>Results: </strong>NLP achieved 100% sensitivity and 88.5% positive predictive value for the identification of SLD. When applied to 26,706 eligible Veterans Aging Cohort Study patient imaging reports, SLD was identified in 72.2% and did not significantly differ by HIV status. SLD was associated with a higher prevalence of metabolic comorbidities, alcohol use disorder, and hepatitis B and C, but not HIV infection.</p><p><strong>Conclusions: </strong>While limited to those undergoing radiologic study, the NLP algorithm accurately identified SLD in people with and without HIV and offers a valuable tool to evaluate the determinants and consequences of hepatic steatosis.</p>","PeriodicalId":12978,"journal":{"name":"Hepatology Communications","volume":"8 7","pages":""},"PeriodicalIF":5.6000,"publicationDate":"2024-06-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11186806/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Hepatology Communications","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1097/HC9.0000000000000468","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/7/1 0:00:00","PubModel":"eCollection","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Steatotic liver disease (SLD) is a growing phenomenon, and our understanding of its determinants has been limited by our ability to identify it clinically. Natural language processing (NLP) can potentially identify hepatic steatosis systematically within large clinical repositories of imaging reports. We validated the performance of an NLP algorithm for the identification of SLD in clinical imaging reports and applied this tool to a large population of people with and without HIV.
Methods: Patients were included in the analysis if they enrolled in the Veterans Aging Cohort Study between 2001 and 2017, had an imaging report inclusive of the liver, and had ≥2 years of observation before the imaging study. SLD was considered present when reports contained the terms "fatty," "steatosis," "steatotic," or "steatohepatitis." The performance of the SLD NLP algorithm was compared to a clinical review of 800 reports. We then applied the NLP algorithm to the first eligible imaging study and compared patient characteristics by SLD and HIV status.
Results: NLP achieved 100% sensitivity and 88.5% positive predictive value for the identification of SLD. When applied to 26,706 eligible Veterans Aging Cohort Study patient imaging reports, SLD was identified in 72.2% and did not significantly differ by HIV status. SLD was associated with a higher prevalence of metabolic comorbidities, alcohol use disorder, and hepatitis B and C, but not HIV infection.
Conclusions: While limited to those undergoing radiologic study, the NLP algorithm accurately identified SLD in people with and without HIV and offers a valuable tool to evaluate the determinants and consequences of hepatic steatosis.
期刊介绍:
Hepatology Communications is a peer-reviewed, online-only, open access journal for fast dissemination of high quality basic, translational, and clinical research in hepatology. Hepatology Communications maintains high standard and rigorous peer review. Because of its open access nature, authors retain the copyright to their works, all articles are immediately available and free to read and share, and it is fully compliant with funder and institutional mandates. The journal is committed to fast publication and author satisfaction.