Prospective Evaluation of Real-Time Artificial Intelligence for the Hill Classification of the Gastroesophageal Junction.

IF 5.8 2区医学 Q1 GASTROENTEROLOGY & HEPATOLOGY

United European Gastroenterology Journal Pub Date : 2025-03-01 Epub Date: 2024-12-12 DOI:10.1002/ueg2.12721

Ioannis Kafetzis, Philipp Sodmann, Bianca-Elena Herghelegiu, Markus Brand, Wolfram G Zoller, Florian Seyfried, Karl-Hermann Fuchs, Alexander Meining, Alexander Hann

{"title":"Prospective Evaluation of Real-Time Artificial Intelligence for the Hill Classification of the Gastroesophageal Junction.","authors":"Ioannis Kafetzis, Philipp Sodmann, Bianca-Elena Herghelegiu, Markus Brand, Wolfram G Zoller, Florian Seyfried, Karl-Hermann Fuchs, Alexander Meining, Alexander Hann","doi":"10.1002/ueg2.12721","DOIUrl":null,"url":null,"abstract":"Background: Assessment of the gastroesophageal junction (GEJ) is an integral part of gastroscopy; however, the absence of standardized reporting hinders consistency of examination documentation. The Hill classification offers a standardized approach for evaluating the GEJ. This study aims to compare the accuracy of an artificial intelligence (AI) system with that of physicians in classifying the GEJ according to Hill in a prospective, blinded, superiority trial.Methods: Consecutive patients scheduled for gastroscopy with an intact GEJ were recruited during clinical routine from October 2023 to December 2023. Nine physicians (six experienced, three inexperienced) assessed the Hill grade, and the AI system operated in the background in real-time. The gold standard was determined by a majority vote of independent assessments by three expert endoscopists who did not participate in the study. The primary outcome was accuracy. Secondary outcomes were per-Hill grade analysis and result comparison for experienced and inexperienced endoscopists separately.Results: In 131 analysed examinations the AI's accuracy of 84.7% (95% CI: 78.6-90.8) was significantly higher than 62.5% (95% CI: 54.2-71) of physicians (p < 0.01). The AI outperformed physicians in all but one cases in the per-Hill-class analysis. AI was significantly more accurate than inexperienced physicians (85% vs. 56%, p < 0.01) and in trend better than experienced physicians (84% vs. 69.6%, p = 0.07).Conclusions: AI was significantly more accurate than examiners in assessing the Hill classification. This superior model performance can prove beneficial for endoscopists, especially those with limited experience.Trial registration: ClinicalTrials.gov identifier: NCT06040723.","PeriodicalId":23444,"journal":{"name":"United European Gastroenterology Journal","volume":" ","pages":"240-246"},"PeriodicalIF":5.8000,"publicationDate":"2025-03-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11975621/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"United European Gastroenterology Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/ueg2.12721","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/12 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

Background: Assessment of the gastroesophageal junction (GEJ) is an integral part of gastroscopy; however, the absence of standardized reporting hinders consistency of examination documentation. The Hill classification offers a standardized approach for evaluating the GEJ. This study aims to compare the accuracy of an artificial intelligence (AI) system with that of physicians in classifying the GEJ according to Hill in a prospective, blinded, superiority trial.

Methods: Consecutive patients scheduled for gastroscopy with an intact GEJ were recruited during clinical routine from October 2023 to December 2023. Nine physicians (six experienced, three inexperienced) assessed the Hill grade, and the AI system operated in the background in real-time. The gold standard was determined by a majority vote of independent assessments by three expert endoscopists who did not participate in the study. The primary outcome was accuracy. Secondary outcomes were per-Hill grade analysis and result comparison for experienced and inexperienced endoscopists separately.

Results: In 131 analysed examinations the AI's accuracy of 84.7% (95% CI: 78.6-90.8) was significantly higher than 62.5% (95% CI: 54.2-71) of physicians (p < 0.01). The AI outperformed physicians in all but one cases in the per-Hill-class analysis. AI was significantly more accurate than inexperienced physicians (85% vs. 56%, p < 0.01) and in trend better than experienced physicians (84% vs. 69.6%, p = 0.07).

Conclusions: AI was significantly more accurate than examiners in assessing the Hill classification. This superior model performance can prove beneficial for endoscopists, especially those with limited experience.

Trial registration: ClinicalTrials.gov identifier: NCT06040723.

查看原文本刊更多论文

实时人工智能对胃食管交界处希尔分类的前瞻性评估

背景：胃食管交界处（GEJ）的评估是胃镜检查的一个组成部分；然而，缺乏标准化的报告妨碍了检查文件的一致性。Hill分类法为评估GEJ提供了一种标准化的方法。这项研究旨在比较人工智能（AI）系统与医生在根据Hill对GEJ进行分类时的准确性，这是一项前瞻性、盲法、优势试验。方法：在2023年10月至2023年12月的临床常规中招募连续的GEJ完整的患者进行胃镜检查。9名医生（6名经验丰富，3名经验不足）评估Hill等级，AI系统在后台实时运行。金标准是由三位内窥镜专家独立评估的多数投票决定的，他们没有参与这项研究。主要结果是准确性。次要结果分别为有经验和没有经验的内窥镜医师的per-Hill分级分析和结果比较。结果：在131项分析的检查中，人工智能的准确率为84.7% (95% CI: 78.6-90.8)，显著高于医生的62.5% (95% CI: 54.2-71) (p)。结论：人工智能评估Hill分类的准确率显著高于检查人员。这种优越的模型性能可以证明有利于内窥镜医生，特别是那些经验有限。试验注册：ClinicalTrials.gov标识符：NCT06040723。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

United European Gastroenterology Journal GASTROENTEROLOGY & HEPATOLOGY-

CiteScore

10.50

自引率

13.30%

发文量

147

期刊介绍： United European Gastroenterology Journal (UEG Journal) is the official Journal of the United European Gastroenterology (UEG), a professional non-profit organisation combining all the leading European societies concerned with digestive disease. UEG’s member societies represent over 22,000 specialists working across medicine, surgery, paediatrics, GI oncology and endoscopy, which makes UEG a unique platform for collaboration and the exchange of knowledge.