Rahul Ramakrishnan BS , Kevin Kuang BA , Vijay Rajput MD , Mark Benson MD , Sachin Mohan MD, PhD
{"title":"Esophageal varices detection and bleeding risk assessment with artificial intelligence: a systematic review","authors":"Rahul Ramakrishnan BS , Kevin Kuang BA , Vijay Rajput MD , Mark Benson MD , Sachin Mohan MD, PhD","doi":"10.1016/j.igie.2024.10.001","DOIUrl":null,"url":null,"abstract":"<div><h3>Background and Aims</h3><div>Esophageal varices (EVs) result from portal hypertension in decompensated liver disease secondary to liver cirrhosis. Diagnosis and grading is done using EGD and mucosal examination. However, interobserver differences may occur when analyzing EGD results. Recent application of artificial intelligence (AI) algorithms yielded mixed results for varices detection and bleeding risk estimation. The capabilities of AI in the detection and grading of EVs need to be evaluated.</div></div><div><h3>Methods</h3><div>A systematic review was conducted with Preferred Reporting Items for Systematic reviews and Meta-Analyses guidelines. PubMed, EMBASE, and MEDLINE were searched from January 1956 to July 2023. Data were manually identified and extracted by reviewers, assessing outcomes of AI tools used, EV detection accuracies, and bleeding risk prediction accuracies. Average accuracies were derived from result sections or manual calculations.</div></div><div><h3>Results</h3><div>Sixteen studies with 26,170 patients and 19 AI systems and algorithms were included after a review of 1670 studies. The most common AI systems were artificial neural network and random forest. The categorical boosting machine learning (ML) algorithm was most accurate for prediction of bleeding (100%), whereas the radiomic model ML tool was the least accurate for EV detection (49%). Overall, AI had an average EV detection accuracy of 78.67% and variceal bleed accuracy of 83.2%.</div></div><div><h3>Conclusions</h3><div>AI could provide an accessible interface for EV prediction and estimation of bleeding risk. Limitations include the dependence on a single dataset for efficacy, assessment with specific AI tools, and potential overinterpretation of broad geographic variances. Newer algorithms should have larger datasets with reproducible validity to strengthen the predictive value in clinical practice.</div></div>","PeriodicalId":100652,"journal":{"name":"iGIE","volume":"3 4","pages":"Pages 478-486"},"PeriodicalIF":0.0000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"iGIE","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S294970862400133X","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Background and Aims
Esophageal varices (EVs) result from portal hypertension in decompensated liver disease secondary to liver cirrhosis. Diagnosis and grading is done using EGD and mucosal examination. However, interobserver differences may occur when analyzing EGD results. Recent application of artificial intelligence (AI) algorithms yielded mixed results for varices detection and bleeding risk estimation. The capabilities of AI in the detection and grading of EVs need to be evaluated.
Methods
A systematic review was conducted with Preferred Reporting Items for Systematic reviews and Meta-Analyses guidelines. PubMed, EMBASE, and MEDLINE were searched from January 1956 to July 2023. Data were manually identified and extracted by reviewers, assessing outcomes of AI tools used, EV detection accuracies, and bleeding risk prediction accuracies. Average accuracies were derived from result sections or manual calculations.
Results
Sixteen studies with 26,170 patients and 19 AI systems and algorithms were included after a review of 1670 studies. The most common AI systems were artificial neural network and random forest. The categorical boosting machine learning (ML) algorithm was most accurate for prediction of bleeding (100%), whereas the radiomic model ML tool was the least accurate for EV detection (49%). Overall, AI had an average EV detection accuracy of 78.67% and variceal bleed accuracy of 83.2%.
Conclusions
AI could provide an accessible interface for EV prediction and estimation of bleeding risk. Limitations include the dependence on a single dataset for efficacy, assessment with specific AI tools, and potential overinterpretation of broad geographic variances. Newer algorithms should have larger datasets with reproducible validity to strengthen the predictive value in clinical practice.