Robust and explainable framework to address data scarcity in diagnostic imaging

IF 6.3 2区 医学 Q1 BIOLOGY
Zehui Zhao , Laith Alzubaidi , Jinglan Zhang , Ye Duan , Usman Naseem , Yuantong Gu
{"title":"Robust and explainable framework to address data scarcity in diagnostic imaging","authors":"Zehui Zhao ,&nbsp;Laith Alzubaidi ,&nbsp;Jinglan Zhang ,&nbsp;Ye Duan ,&nbsp;Usman Naseem ,&nbsp;Yuantong Gu","doi":"10.1016/j.compbiomed.2025.111052","DOIUrl":null,"url":null,"abstract":"<div><div>Deep learning has significantly advanced automatic medical diagnostics, releasing human resources from clinical pressure, yet the persistent challenge of data scarcity in this area hampers its further improvements and applications. To address this gap, we introduce a novel ensemble framework called ‘Efficient Transfer and Self-supervised Learning based Ensemble Framework’ (ETSEF). ETSEF leverages features from multiple pre-trained deep learning models to efficiently learn powerful representations from a limited number of data samples. To the best of our knowledge, ETSEF is the first strategy that combines two pre-training methodologies (Transfer Learning and Self-supervised Learning) with ensemble learning approaches. Various data enhancement techniques, including data augmentation, feature fusion, feature selection, and decision fusion, have also been deployed to maximise the efficiency and robustness of the ETSEF model. Five independent medical imaging tasks, including endoscopy, breast cancer detection, monkeypox detection, brain tumour detection, and glaucoma detection, were tested to demonstrate ETSEF’s effectiveness and robustness. Facing limited sample numbers and challenging medical tasks, ETSEF has demonstrated its effectiveness by improving diagnostic accuracy by up to 13.3% compared to strong ensemble baseline models and up to 14.4% compared with recent state-of-the-art methods. Moreover, we emphasise the robustness and trustworthiness of the ETSEF method through various vision-explainable artificial intelligence techniques, including Grad-CAM, SHAP, and t-SNE. Compared to large-scale deep learning models, ETSEF can be flexibly deployed and maintain superior performance for challenging medical imaging tasks, demonstrating potential for application in areas lacking training data. The code is available at <span><span>Github ETSEF</span><svg><path></path></svg></span>.</div></div>","PeriodicalId":10578,"journal":{"name":"Computers in biology and medicine","volume":"197 ","pages":"Article 111052"},"PeriodicalIF":6.3000,"publicationDate":"2025-09-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computers in biology and medicine","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0010482525014040","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Deep learning has significantly advanced automatic medical diagnostics, releasing human resources from clinical pressure, yet the persistent challenge of data scarcity in this area hampers its further improvements and applications. To address this gap, we introduce a novel ensemble framework called ‘Efficient Transfer and Self-supervised Learning based Ensemble Framework’ (ETSEF). ETSEF leverages features from multiple pre-trained deep learning models to efficiently learn powerful representations from a limited number of data samples. To the best of our knowledge, ETSEF is the first strategy that combines two pre-training methodologies (Transfer Learning and Self-supervised Learning) with ensemble learning approaches. Various data enhancement techniques, including data augmentation, feature fusion, feature selection, and decision fusion, have also been deployed to maximise the efficiency and robustness of the ETSEF model. Five independent medical imaging tasks, including endoscopy, breast cancer detection, monkeypox detection, brain tumour detection, and glaucoma detection, were tested to demonstrate ETSEF’s effectiveness and robustness. Facing limited sample numbers and challenging medical tasks, ETSEF has demonstrated its effectiveness by improving diagnostic accuracy by up to 13.3% compared to strong ensemble baseline models and up to 14.4% compared with recent state-of-the-art methods. Moreover, we emphasise the robustness and trustworthiness of the ETSEF method through various vision-explainable artificial intelligence techniques, including Grad-CAM, SHAP, and t-SNE. Compared to large-scale deep learning models, ETSEF can be flexibly deployed and maintain superior performance for challenging medical imaging tasks, demonstrating potential for application in areas lacking training data. The code is available at Github ETSEF.
稳健和可解释的框架,以解决诊断成像中的数据稀缺问题。
深度学习极大地推动了医疗自动诊断的发展,将人力资源从临床压力中解放出来,但该领域数据稀缺的持续挑战阻碍了其进一步的改进和应用。为了解决这一差距,我们引入了一种新的集成框架,称为“基于高效迁移和自我监督学习的集成框架”(ETSEF)。ETSEF利用来自多个预训练深度学习模型的特征,从有限数量的数据样本中有效地学习强大的表示。据我们所知,ETSEF是第一个将两种预训练方法(迁移学习和自监督学习)与集成学习方法相结合的策略。各种数据增强技术,包括数据增强、特征融合、特征选择和决策融合,也被用于最大限度地提高ETSEF模型的效率和鲁棒性。测试了5个独立的医学成像任务,包括内窥镜检查、乳腺癌检测、猴痘检测、脑肿瘤检测和青光眼检测,以证明ETSEF的有效性和稳健性。面对有限的样本数量和具有挑战性的医疗任务,ETSEF已经证明了其有效性,与强集合基线模型相比,其诊断准确率提高了13.3%,与最近最先进的方法相比,其诊断准确率提高了14.4%。此外,我们通过各种视觉可解释的人工智能技术(包括grada - cam, SHAP和t-SNE)强调ETSEF方法的鲁棒性和可信度。与大规模深度学习模型相比,ETSEF可以灵活部署,并在具有挑战性的医学成像任务中保持优越的性能,显示出在缺乏训练数据的领域的应用潜力。代码可在Github ETSEF获得。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Computers in biology and medicine
Computers in biology and medicine 工程技术-工程:生物医学
CiteScore
11.70
自引率
10.40%
发文量
1086
审稿时长
74 days
期刊介绍: Computers in Biology and Medicine is an international forum for sharing groundbreaking advancements in the use of computers in bioscience and medicine. This journal serves as a medium for communicating essential research, instruction, ideas, and information regarding the rapidly evolving field of computer applications in these domains. By encouraging the exchange of knowledge, we aim to facilitate progress and innovation in the utilization of computers in biology and medicine.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信