IPNet: An Interpretable Network With Progressive Loss for Whole-Stage Colorectal Disease Diagnosis

Junhu Fu;Ke Chen;Qi Dou;Yun Gao;Yiping He;Pinghong Zhou;Shengli Lin;Yuanyuan Wang;Yi Guo
{"title":"IPNet: An Interpretable Network With Progressive Loss for Whole-Stage Colorectal Disease Diagnosis","authors":"Junhu Fu;Ke Chen;Qi Dou;Yun Gao;Yiping He;Pinghong Zhou;Shengli Lin;Yuanyuan Wang;Yi Guo","doi":"10.1109/TMI.2024.3459910","DOIUrl":null,"url":null,"abstract":"Colorectal cancer plays a dominant role in cancer-related deaths, primarily due to the absence of obvious early-stage symptoms. Whole-stage colorectal disease diagnosis is crucial for assessing lesion evolution and determining treatment plans. However, locality difference and disease progression lead to intra-class disparities and inter-class similarities for colorectal lesion representation. In addition, interpretable algorithms explaining the lesion progression are still lacking, making the prediction process a “black box”. In this paper, we propose IPNet, a dual-branch interpretable network with progressive loss for whole-stage colorectal disease diagnosis. The dual-branch architecture captures unbiased features representing diverse localities to suppress intra-class variation. The progressive loss function considers inter-class relationship, using prior knowledge of disease evolution to guide classification. Furthermore, a novel Grain-CAM is designed to interpret IPNet by visualizing pixel-wise attention maps from shallow to deep layers, providing regions semantically related to IPNet’s progressive classification. We conducted whole-stage diagnosis on two image modalities, i.e., colorectal lesion classification on 129,893 endoscopic optical images and rectal tumor T-staging on 11,072 endoscopic ultrasound images. IPNet is shown to surpass other state-of-the-art algorithms, accordingly achieving an accuracy of 93.15% and 89.62%. Especially, it establishes effective decision boundaries for challenges like polyp vs. adenoma and T2 vs. T3. The results demonstrate an explainable attempt for colorectal lesion classification at a whole-stage level, and rectal tumor T-staging by endoscopic ultrasound is also unprecedentedly explored. IPNet is expected to be further applied, assisting physicians in whole-stage disease diagnosis and enhancing diagnostic interpretability.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 2","pages":"789-800"},"PeriodicalIF":0.0000,"publicationDate":"2024-09-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10684448/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Colorectal cancer plays a dominant role in cancer-related deaths, primarily due to the absence of obvious early-stage symptoms. Whole-stage colorectal disease diagnosis is crucial for assessing lesion evolution and determining treatment plans. However, locality difference and disease progression lead to intra-class disparities and inter-class similarities for colorectal lesion representation. In addition, interpretable algorithms explaining the lesion progression are still lacking, making the prediction process a “black box”. In this paper, we propose IPNet, a dual-branch interpretable network with progressive loss for whole-stage colorectal disease diagnosis. The dual-branch architecture captures unbiased features representing diverse localities to suppress intra-class variation. The progressive loss function considers inter-class relationship, using prior knowledge of disease evolution to guide classification. Furthermore, a novel Grain-CAM is designed to interpret IPNet by visualizing pixel-wise attention maps from shallow to deep layers, providing regions semantically related to IPNet’s progressive classification. We conducted whole-stage diagnosis on two image modalities, i.e., colorectal lesion classification on 129,893 endoscopic optical images and rectal tumor T-staging on 11,072 endoscopic ultrasound images. IPNet is shown to surpass other state-of-the-art algorithms, accordingly achieving an accuracy of 93.15% and 89.62%. Especially, it establishes effective decision boundaries for challenges like polyp vs. adenoma and T2 vs. T3. The results demonstrate an explainable attempt for colorectal lesion classification at a whole-stage level, and rectal tumor T-staging by endoscopic ultrasound is also unprecedentedly explored. IPNet is expected to be further applied, assisting physicians in whole-stage disease diagnosis and enhancing diagnostic interpretability.
IPNet:用于全阶段结直肠疾病诊断的渐进损失可解释网络
结直肠癌在癌症相关死亡中占主导地位,这主要是由于结直肠癌没有明显的早期症状。全阶段结直肠疾病诊断对于评估病变演变和确定治疗方案至关重要。然而,地域差异和疾病进展导致结直肠病变表征的类内差异和类间相似性。此外,解释病变进展的可解释算法仍然缺乏,这使得预测过程成为一个 "黑箱"。在本文中,我们提出了用于全阶段结直肠疾病诊断的具有渐进损失的双分支可解释网络 IPNet。双分支架构捕捉代表不同局部的无偏特征,以抑制类内变异。渐进损失函数考虑了类间关系,利用疾病演变的先验知识来指导分类。此外,我们还设计了一种新颖的 Grain-CAM,通过可视化从浅层到深层的像素注意力图来解释 IPNet,提供与 IPNet 渐进式分类语义相关的区域。我们对两种图像模式进行了全阶段诊断,即对 129893 张内窥镜光学图像进行结直肠病变分类,以及对 11072 张内窥镜超声图像进行直肠肿瘤 T 分期。结果表明,IPNet 超越了其他最先进的算法,准确率分别达到 93.15% 和 89.62%。特别是,它为息肉与腺瘤、T2 与 T3 等挑战建立了有效的决策边界。研究结果表明,IPNet 尝试在整个阶段对结直肠病变进行分类,并通过内窥镜超声对直肠肿瘤进行 T 型分期进行了前所未有的探索。预计 IPNet 将得到进一步应用,协助医生进行全阶段疾病诊断,并提高诊断的可解释性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信