Michelle Chae Min Lee, Armin Farahvash, Petros Zezos
{"title":"Artificial Intelligence for Classification of Endoscopic Severity of Inflammatory Bowel Disease: A Systematic Review and Critical Appraisal.","authors":"Michelle Chae Min Lee, Armin Farahvash, Petros Zezos","doi":"10.1093/ibd/izaf050","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Endoscopic scoring indices for ulcerative colitis and Crohn's disease are subject to inter-endoscopist variability. There is increasing interest in the development of deep learning models to standardize endoscopic assessment of intestinal diseases. Here, we summarize and critically appraise the literature on artificial intelligence-assisted endoscopic characterization of inflammatory bowel disease severity.</p><p><strong>Methods: </strong>A systematic search of Ovid MEDLINE, EMBASE, Cochrane Central Register of Controlled Trials, and IEEE Xplore was performed to identify reports of AI systems used for endoscopic severity classification of IBD. Selected studies were critically appraised for methodological and reporting quality using APPRAISE-AI.</p><p><strong>Results: </strong>Thirty-one studies published between 2019 and 2024 were included. Of 31 studies, 28 studies examined endoscopic classification of ulcerative colitis and 3 examined Crohn's disease. Researchers sought to accomplish a wide range of classification tasks, including binary and multilevel classification, based on still images or full-length colonoscopy videos. Overall scores for study quality ranged from 41 (moderate quality) to 64 (high quality) out of 100, with 28 out of 31 studies within the moderate quality range. The highest-scoring domains were clinical relevance and reporting quality, while the lowest-scoring domains were robustness of results and reproducibility.</p><p><strong>Conclusions: </strong>Multiple AI models have demonstrated the potential for clinical translation for ulcerative colitis. Research concerning the endoscopic severity assessment of Crohn's disease is limited and should be further explored. More rigorous external validation of AI models and increased transparency of data and codes are needed to improve the quality of AI studies.</p>","PeriodicalId":13623,"journal":{"name":"Inflammatory Bowel Diseases","volume":" ","pages":""},"PeriodicalIF":4.5000,"publicationDate":"2025-03-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Inflammatory Bowel Diseases","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1093/ibd/izaf050","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Endoscopic scoring indices for ulcerative colitis and Crohn's disease are subject to inter-endoscopist variability. There is increasing interest in the development of deep learning models to standardize endoscopic assessment of intestinal diseases. Here, we summarize and critically appraise the literature on artificial intelligence-assisted endoscopic characterization of inflammatory bowel disease severity.
Methods: A systematic search of Ovid MEDLINE, EMBASE, Cochrane Central Register of Controlled Trials, and IEEE Xplore was performed to identify reports of AI systems used for endoscopic severity classification of IBD. Selected studies were critically appraised for methodological and reporting quality using APPRAISE-AI.
Results: Thirty-one studies published between 2019 and 2024 were included. Of 31 studies, 28 studies examined endoscopic classification of ulcerative colitis and 3 examined Crohn's disease. Researchers sought to accomplish a wide range of classification tasks, including binary and multilevel classification, based on still images or full-length colonoscopy videos. Overall scores for study quality ranged from 41 (moderate quality) to 64 (high quality) out of 100, with 28 out of 31 studies within the moderate quality range. The highest-scoring domains were clinical relevance and reporting quality, while the lowest-scoring domains were robustness of results and reproducibility.
Conclusions: Multiple AI models have demonstrated the potential for clinical translation for ulcerative colitis. Research concerning the endoscopic severity assessment of Crohn's disease is limited and should be further explored. More rigorous external validation of AI models and increased transparency of data and codes are needed to improve the quality of AI studies.
期刊介绍:
Inflammatory Bowel Diseases® supports the mission of the Crohn''s & Colitis Foundation by bringing the most impactful and cutting edge clinical topics and research findings related to inflammatory bowel diseases to clinicians and researchers working in IBD and related fields. The Journal is committed to publishing on innovative topics that influence the future of clinical care, treatment, and research.