使用多中心临床试验数据训练和部署溃疡性结肠炎内镜严重程度分级的深度学习模型。

IF 2.4 Q2 GASTROENTEROLOGY & HEPATOLOGY

Therapeutic Advances in Gastrointestinal Endoscopy Pub Date : 2021-02-25 eCollection Date: 2021-01-01 DOI:10.1177/2631774521990623

Benjamin Gutierrez Becker, Filippo Arcadu, Andreas Thalhammer, Citlalli Gamez Serna, Owen Feehan, Faye Drawnel, Young S Oh, Marco Prunotto

{"title":"使用多中心临床试验数据训练和部署溃疡性结肠炎内镜严重程度分级的深度学习模型。","authors":"Benjamin Gutierrez Becker, Filippo Arcadu, Andreas Thalhammer, Citlalli Gamez Serna, Owen Feehan, Faye Drawnel, Young S Oh, Marco Prunotto","doi":"10.1177/2631774521990623","DOIUrl":null,"url":null,"abstract":"Introduction: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed even among experienced and sufficiently trained experts. In recent years, several machine learning algorithms have been proposed in an effort to improve the standardization and reproducibility of Mayo Clinic Endoscopic Subscore grading.Methods: Here we propose an end-to-end fully automated system based on deep learning to predict a binary version of the Mayo Clinic Endoscopic Subscore directly from raw colonoscopy videos. Differently from previous studies, the proposed method mimics the assessment done in practice by a gastroenterologist, that is, traversing the whole colonoscopy video, identifying visually informative regions and computing an overall Mayo Clinic Endoscopic Subscore. The proposed deep learning-based system has been trained and deployed on raw colonoscopies using Mayo Clinic Endoscopic Subscore ground truth provided only at the colon section level, without manually selecting frames driving the severity scoring of ulcerative colitis.Results and conclusion: Our evaluation on 1672 endoscopic videos obtained from a multisite data set obtained from the etrolizumab Phase II Eucalyptus and Phase III Hickory and Laurel clinical trials, show that our proposed methodology can grade endoscopic videos with a high degree of accuracy and robustness (Area Under the Receiver Operating Characteristic Curve = 0.84 for Mayo Clinic Endoscopic Subscore ⩾ 1, 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 2 and 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 3) and reduced amounts of manual annotation.Plain language summary: Patient, caregiver and provider thoughts on educational materials about prescribing and medication safety Artificial intelligence can be used to automatically assess full endoscopic videos and estimate the severity of ulcerative colitis. In this work, we present an artificial intelligence algorithm for the automatic grading of ulcerative colitis in full endoscopic videos. Our artificial intelligence models were trained and evaluated on a large and diverse set of colonoscopy videos obtained from concluded clinical trials. We demonstrate not only that artificial intelligence is able to accurately grade full endoscopic videos, but also that using diverse data sets obtained from multiple sites is critical to train robust AI models that could potentially be deployed on real-world data.","PeriodicalId":40947,"journal":{"name":"Therapeutic Advances in Gastrointestinal Endoscopy","volume":"14 ","pages":"2631774521990623"},"PeriodicalIF":2.4000,"publicationDate":"2021-02-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://sci-hub-pdf.com/10.1177/2631774521990623","citationCount":"32","resultStr":"{\"title\":\"Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data.\",\"authors\":\"Benjamin Gutierrez Becker, Filippo Arcadu, Andreas Thalhammer, Citlalli Gamez Serna, Owen Feehan, Faye Drawnel, Young S Oh, Marco Prunotto\",\"doi\":\"10.1177/2631774521990623\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Introduction: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed even among experienced and sufficiently trained experts. In recent years, several machine learning algorithms have been proposed in an effort to improve the standardization and reproducibility of Mayo Clinic Endoscopic Subscore grading.Methods: Here we propose an end-to-end fully automated system based on deep learning to predict a binary version of the Mayo Clinic Endoscopic Subscore directly from raw colonoscopy videos. Differently from previous studies, the proposed method mimics the assessment done in practice by a gastroenterologist, that is, traversing the whole colonoscopy video, identifying visually informative regions and computing an overall Mayo Clinic Endoscopic Subscore. The proposed deep learning-based system has been trained and deployed on raw colonoscopies using Mayo Clinic Endoscopic Subscore ground truth provided only at the colon section level, without manually selecting frames driving the severity scoring of ulcerative colitis.Results and conclusion: Our evaluation on 1672 endoscopic videos obtained from a multisite data set obtained from the etrolizumab Phase II Eucalyptus and Phase III Hickory and Laurel clinical trials, show that our proposed methodology can grade endoscopic videos with a high degree of accuracy and robustness (Area Under the Receiver Operating Characteristic Curve = 0.84 for Mayo Clinic Endoscopic Subscore ⩾ 1, 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 2 and 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 3) and reduced amounts of manual annotation.Plain language summary: Patient, caregiver and provider thoughts on educational materials about prescribing and medication safety Artificial intelligence can be used to automatically assess full endoscopic videos and estimate the severity of ulcerative colitis. In this work, we present an artificial intelligence algorithm for the automatic grading of ulcerative colitis in full endoscopic videos. Our artificial intelligence models were trained and evaluated on a large and diverse set of colonoscopy videos obtained from concluded clinical trials. We demonstrate not only that artificial intelligence is able to accurately grade full endoscopic videos, but also that using diverse data sets obtained from multiple sites is critical to train robust AI models that could potentially be deployed on real-world data.\",\"PeriodicalId\":40947,\"journal\":{\"name\":\"Therapeutic Advances in Gastrointestinal Endoscopy\",\"volume\":\"14 \",\"pages\":\"2631774521990623\"},\"PeriodicalIF\":2.4000,\"publicationDate\":\"2021-02-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://sci-hub-pdf.com/10.1177/2631774521990623\",\"citationCount\":\"32\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Therapeutic Advances in Gastrointestinal Endoscopy\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1177/2631774521990623\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2021/1/1 0:00:00\",\"PubModel\":\"eCollection\",\"JCR\":\"Q2\",\"JCRName\":\"GASTROENTEROLOGY & HEPATOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Therapeutic Advances in Gastrointestinal Endoscopy","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1177/2631774521990623","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2021/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"GASTROENTEROLOGY & HEPATOLOGY","Score":null,"Total":0}

引用次数: 32

摘要

梅奥诊所内窥镜评分是一种常用的评估溃疡性结肠炎严重程度的分级系统。使用梅奥诊所内窥镜评分对结肠镜检查进行正确分级是一项具有挑战性的任务，即使在经验丰富且训练有素的专家中，也会观察到镜间和镜内变异性的次优率。近年来，人们提出了几种机器学习算法，以提高梅奥诊所内镜评分的标准化和可重复性。方法:在这里，我们提出了一个基于深度学习的端到端全自动系统，直接从原始结肠镜检查视频中预测梅奥诊所内窥镜评分的二进制版本。与以往的研究不同，本文提出的方法模拟了胃肠病学家在实践中所做的评估，即遍历整个结肠镜检查视频，识别视觉信息区域，并计算梅奥诊所的整体内窥镜评分。所提出的基于深度学习的系统已经在原始结肠镜检查中进行了训练和部署，使用的是仅在结肠切片水平提供的梅奥诊所内镜Subscore基础事实，而不是手动选择驱动溃疡性结肠炎严重程度评分的框架。结果与结论:我们对从etrolizumab II期桉树和III期山核桃和月桂临床试验获得的多站点数据集获得的1672个内视镜视频的评估表明，我们提出的方法可以对内视镜视频进行分级，具有高度的准确性和稳健性(对于梅奥诊所内视镜亚评分大于或等于1的接受者操作特征曲线下面积= 0.84，梅奥诊所内窥镜子评分大于或等于2的0.85和梅奥诊所内窥镜子评分大于或等于3的0.85)，减少了人工注释的数量。简单的语言总结:患者，护理人员和提供者对处方和药物安全教育材料的想法人工智能可用于自动评估完整的内镜视频并估计溃疡性结肠炎的严重程度。在这项工作中，我们提出了一种用于溃疡性结肠炎全内镜视频自动分级的人工智能算法。我们的人工智能模型是在大量不同的结肠镜检查视频上进行训练和评估的，这些视频来自于已结束的临床试验。我们不仅证明了人工智能能够准确地对完整的内镜视频进行分级，而且还证明了使用从多个站点获得的不同数据集对于训练可能部署在现实世界数据上的强大人工智能模型至关重要。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data.

查看原文本刊更多论文

Training and deploying a deep learning model for endoscopic severity grading in ulcerative colitis using multicenter clinical trial data.

Introduction: The Mayo Clinic Endoscopic Subscore is a commonly used grading system to assess the severity of ulcerative colitis. Correctly grading colonoscopies using the Mayo Clinic Endoscopic Subscore is a challenging task, with suboptimal rates of interrater and intrarater variability observed even among experienced and sufficiently trained experts. In recent years, several machine learning algorithms have been proposed in an effort to improve the standardization and reproducibility of Mayo Clinic Endoscopic Subscore grading.

Methods: Here we propose an end-to-end fully automated system based on deep learning to predict a binary version of the Mayo Clinic Endoscopic Subscore directly from raw colonoscopy videos. Differently from previous studies, the proposed method mimics the assessment done in practice by a gastroenterologist, that is, traversing the whole colonoscopy video, identifying visually informative regions and computing an overall Mayo Clinic Endoscopic Subscore. The proposed deep learning-based system has been trained and deployed on raw colonoscopies using Mayo Clinic Endoscopic Subscore ground truth provided only at the colon section level, without manually selecting frames driving the severity scoring of ulcerative colitis.

Results and conclusion: Our evaluation on 1672 endoscopic videos obtained from a multisite data set obtained from the etrolizumab Phase II Eucalyptus and Phase III Hickory and Laurel clinical trials, show that our proposed methodology can grade endoscopic videos with a high degree of accuracy and robustness (Area Under the Receiver Operating Characteristic Curve = 0.84 for Mayo Clinic Endoscopic Subscore ⩾ 1, 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 2 and 0.85 for Mayo Clinic Endoscopic Subscore ⩾ 3) and reduced amounts of manual annotation.

Plain language summary: Patient, caregiver and provider thoughts on educational materials about prescribing and medication safety Artificial intelligence can be used to automatically assess full endoscopic videos and estimate the severity of ulcerative colitis. In this work, we present an artificial intelligence algorithm for the automatic grading of ulcerative colitis in full endoscopic videos. Our artificial intelligence models were trained and evaluated on a large and diverse set of colonoscopy videos obtained from concluded clinical trials. We demonstrate not only that artificial intelligence is able to accurately grade full endoscopic videos, but also that using diverse data sets obtained from multiple sites is critical to train robust AI models that could potentially be deployed on real-world data.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Therapeutic Advances in Gastrointestinal Endoscopy GASTROENTEROLOGY & HEPATOLOGY-

CiteScore

4.80

自引率

0.00%

发文量

审稿时长

13 weeks