Melissa Chew, Catherine Yu, Leanne Stojevski, Paul Conilione, Anthony Gust, Mani Suleiman, Will Swansson, Bennett Anderson, Mayur Garg, Diana Lewis
{"title":"A Novel Natural Language Processing Tool Improves Colonoscopy Auditing of Adenoma and Serrated Polyp Detection Rates.","authors":"Melissa Chew, Catherine Yu, Leanne Stojevski, Paul Conilione, Anthony Gust, Mani Suleiman, Will Swansson, Bennett Anderson, Mayur Garg, Diana Lewis","doi":"10.1111/jgh.16933","DOIUrl":null,"url":null,"abstract":"<p><strong>Background and study aims: </strong>Determining adenoma detection rate (ADR) and serrated polyp detection rate (SDR) can be challenging as they usually involve manual matching of colonoscopy and histology reports. This study aimed to validate a Natural Language Processing (NLP) code that enables rapid and efficient data extraction to calculate ADR and SDR.</p><p><strong>Design: </strong>A NLP code was developed to automatically extract colonoscopy quality indicators from colonoscopy and histology reports at a tertiary health service. These reports were manually reviewed to verify the concordance of ADR and SDR between the two methods. This process was applied in the initial training phase, repeated following modification of the code, and again with a validation cohort.</p><p><strong>Results: </strong>Included in the training and test phases were 5911 colonoscopies, with 2022 in the validation phase. The NLP code extracted patient names with 99.9% concordance and had a 98.9% accuracy in ADR and SDR in the training phase. Search terms were subsequently modified to take into consideration spelling variations and overlapping terminologies. Using data from the same cohort, accuracy of the NLP improved to 100%, excluding four colonoscopies that had missing histology reports in the test phase. Using a validated cohort, NLP had a 99.9% accuracy in ADR and SDR. The total time taken for auditing using NLP in the validation phase was less than 1 h.</p><p><strong>Conclusions: </strong>An automatic NLP code had an accuracy of almost 100% in determining ADR and SDR in a tertiary colonoscopy service. Wider adoption of NLP enables significant improvements in colonoscopy audits that is accurate and time efficient.</p>","PeriodicalId":15877,"journal":{"name":"Journal of Gastroenterology and Hepatology","volume":" ","pages":""},"PeriodicalIF":3.7000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Gastroenterology and Hepatology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1111/jgh.16933","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background and study aims: Determining adenoma detection rate (ADR) and serrated polyp detection rate (SDR) can be challenging as they usually involve manual matching of colonoscopy and histology reports. This study aimed to validate a Natural Language Processing (NLP) code that enables rapid and efficient data extraction to calculate ADR and SDR.
Design: A NLP code was developed to automatically extract colonoscopy quality indicators from colonoscopy and histology reports at a tertiary health service. These reports were manually reviewed to verify the concordance of ADR and SDR between the two methods. This process was applied in the initial training phase, repeated following modification of the code, and again with a validation cohort.
Results: Included in the training and test phases were 5911 colonoscopies, with 2022 in the validation phase. The NLP code extracted patient names with 99.9% concordance and had a 98.9% accuracy in ADR and SDR in the training phase. Search terms were subsequently modified to take into consideration spelling variations and overlapping terminologies. Using data from the same cohort, accuracy of the NLP improved to 100%, excluding four colonoscopies that had missing histology reports in the test phase. Using a validated cohort, NLP had a 99.9% accuracy in ADR and SDR. The total time taken for auditing using NLP in the validation phase was less than 1 h.
Conclusions: An automatic NLP code had an accuracy of almost 100% in determining ADR and SDR in a tertiary colonoscopy service. Wider adoption of NLP enables significant improvements in colonoscopy audits that is accurate and time efficient.
期刊介绍:
Journal of Gastroenterology and Hepatology is produced 12 times per year and publishes peer-reviewed original papers, reviews and editorials concerned with clinical practice and research in the fields of hepatology, gastroenterology and endoscopy. Papers cover the medical, radiological, pathological, biochemical, physiological and historical aspects of the subject areas. All submitted papers are reviewed by at least two referees expert in the field of the submitted paper.