通过双重蒸馏实现医学图像分割的多尺度情境学习。

IF 3.2 2区医学 Q1 RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING

Medical physics Pub Date : 2024-11-11 DOI:10.1002/mp.17506

Ruize Cui, Lanqing Liu, Youyi Song, Ge Ren, Xiaowei Hu, Jing Qin

{"title":"通过双重蒸馏实现医学图像分割的多尺度情境学习。","authors":"Ruize Cui, Lanqing Liu, Youyi Song, Ge Ren, Xiaowei Hu, Jing Qin","doi":"10.1002/mp.17506","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> Background</h3>\n \n <p>Recently, many studies have explored fusing features extracted from Convolutional neural networks (CNNs) and transformers to integrate multi-scale representations for better performance in medical image segmentation tasks. Although these hybrid models have achieved better results than previous CNN-based and transformer-based methods, they suffer from high computation and space complexities.</p>\n </section>\n \n <section>\n \n <h3> Purpose</h3>\n \n <p>The purpose of this research is to address the prohibitive computation and space complexities of hybrid models, which limit their application in clinical practice where computational resources are usually constrained.</p>\n </section>\n \n <section>\n \n <h3> Methods</h3>\n \n <p>We propose a novel model equipped with a dual distillation scheme to sufficiently harness the complementary advantages of CNNs and transformers without compromising model efficiency. We further propose a multi-scale prior-knowledge distillation (MPD) module to effectively distill multi-scale knowledge from features extracted from transformers. In addition, to cooperate with the knowledge distillation scheme, we also propose an efficient and robust Selective Fusion module in the student network.</p>\n </section>\n \n <section>\n \n <h3> Results</h3>\n \n <p>We extensively evaluate the proposed model against fourteen different network frameworks on two representative datasets: SipakMed and ISIC 2017. In the SipakMed dataset, 3037 Pap smear images are used for training and 1012 for testing. In the ISIC 2017 dataset, 2000 dermoscopic images are used for training, 150 for validation, and 600 for testing. Experimental results demonstrate that our method not only surpasses existing methods by a considerable margin with respect to the evaluation metrics of mean Intersection over Union, mean Dice coefficient, mean average symmetric surface distance, but also requires fewer computational resources in terms of model parameters and floating-point operations per second.</p>\n </section>\n \n <section>\n \n <h3> Conclusions</h3>\n \n <p>Comprehensive comparisons in terms of segmentation accuracy and computational complexity unequivocally confirm that our method effectively and efficiently integrates the advantages of both CNNs and transformers, showing its suitability and significance for clinical applications.</p>\n </section>\n </div>","PeriodicalId":18384,"journal":{"name":"Medical physics","volume":"52 2","pages":"787-800"},"PeriodicalIF":3.2000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Multi-scale contextual learning for medical image segmentation via dual distillation\",\"authors\":\"Ruize Cui, Lanqing Liu, Youyi Song, Ge Ren, Xiaowei Hu, Jing Qin\",\"doi\":\"10.1002/mp.17506\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n \\n \\n <section>\\n \\n <h3> Background</h3>\\n \\n <p>Recently, many studies have explored fusing features extracted from Convolutional neural networks (CNNs) and transformers to integrate multi-scale representations for better performance in medical image segmentation tasks. Although these hybrid models have achieved better results than previous CNN-based and transformer-based methods, they suffer from high computation and space complexities.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Purpose</h3>\\n \\n <p>The purpose of this research is to address the prohibitive computation and space complexities of hybrid models, which limit their application in clinical practice where computational resources are usually constrained.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Methods</h3>\\n \\n <p>We propose a novel model equipped with a dual distillation scheme to sufficiently harness the complementary advantages of CNNs and transformers without compromising model efficiency. We further propose a multi-scale prior-knowledge distillation (MPD) module to effectively distill multi-scale knowledge from features extracted from transformers. In addition, to cooperate with the knowledge distillation scheme, we also propose an efficient and robust Selective Fusion module in the student network.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Results</h3>\\n \\n <p>We extensively evaluate the proposed model against fourteen different network frameworks on two representative datasets: SipakMed and ISIC 2017. In the SipakMed dataset, 3037 Pap smear images are used for training and 1012 for testing. In the ISIC 2017 dataset, 2000 dermoscopic images are used for training, 150 for validation, and 600 for testing. Experimental results demonstrate that our method not only surpasses existing methods by a considerable margin with respect to the evaluation metrics of mean Intersection over Union, mean Dice coefficient, mean average symmetric surface distance, but also requires fewer computational resources in terms of model parameters and floating-point operations per second.</p>\\n </section>\\n \\n <section>\\n \\n <h3> Conclusions</h3>\\n \\n <p>Comprehensive comparisons in terms of segmentation accuracy and computational complexity unequivocally confirm that our method effectively and efficiently integrates the advantages of both CNNs and transformers, showing its suitability and significance for clinical applications.</p>\\n </section>\\n </div>\",\"PeriodicalId\":18384,\"journal\":{\"name\":\"Medical physics\",\"volume\":\"52 2\",\"pages\":\"787-800\"},\"PeriodicalIF\":3.2000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Medical physics\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1002/mp.17506\",\"RegionNum\":2,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Medical physics","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/mp.17506","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"RADIOLOGY, NUCLEAR MEDICINE & MEDICAL IMAGING","Score":null,"Total":0}

引用次数: 0

摘要

背景：最近，许多研究都在探索如何融合从卷积神经网络（CNN）和变换器中提取的特征，以整合多尺度表征，从而在医学图像分割任务中获得更好的性能。目的：本研究的目的是解决混合模型令人望而却步的计算和空间复杂性问题，这些问题限制了混合模型在临床实践中的应用，因为在临床实践中计算资源通常是有限的：我们提出了一种配备双重蒸馏方案的新型模型，以充分发挥 CNN 和变压器的互补优势，同时不影响模型的效率。我们进一步提出了多尺度先验知识蒸馏（MPD）模块，以便从变换器提取的特征中有效地蒸馏出多尺度知识。此外，为了配合知识提炼方案，我们还在学生网络中提出了一个高效、稳健的选择性融合模块：我们在两个具有代表性的数据集上针对 14 种不同的网络框架对所提出的模型进行了广泛评估：SipakMed 和 ISIC 2017。在 SipakMed 数据集中，3037 张巴氏涂片图像用于训练，1012 张用于测试。在 ISIC 2017 数据集中，2000 张皮肤镜图像用于训练，150 张用于验证，600 张用于测试。实验结果表明，我们的方法不仅在平均联合相交、平均骰子系数、平均对称面距离等评价指标上大大超过了现有方法，而且在模型参数和每秒浮点运算方面所需的计算资源也更少：从分割准确性和计算复杂性两方面进行的综合比较明确证实，我们的方法有效、高效地整合了 CNN 和变换器的优势，显示了其在临床应用中的适用性和重要性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Multi-scale contextual learning for medical image segmentation via dual distillation

Background

Recently, many studies have explored fusing features extracted from Convolutional neural networks (CNNs) and transformers to integrate multi-scale representations for better performance in medical image segmentation tasks. Although these hybrid models have achieved better results than previous CNN-based and transformer-based methods, they suffer from high computation and space complexities.

Purpose

The purpose of this research is to address the prohibitive computation and space complexities of hybrid models, which limit their application in clinical practice where computational resources are usually constrained.

Methods

We propose a novel model equipped with a dual distillation scheme to sufficiently harness the complementary advantages of CNNs and transformers without compromising model efficiency. We further propose a multi-scale prior-knowledge distillation (MPD) module to effectively distill multi-scale knowledge from features extracted from transformers. In addition, to cooperate with the knowledge distillation scheme, we also propose an efficient and robust Selective Fusion module in the student network.

Results

We extensively evaluate the proposed model against fourteen different network frameworks on two representative datasets: SipakMed and ISIC 2017. In the SipakMed dataset, 3037 Pap smear images are used for training and 1012 for testing. In the ISIC 2017 dataset, 2000 dermoscopic images are used for training, 150 for validation, and 600 for testing. Experimental results demonstrate that our method not only surpasses existing methods by a considerable margin with respect to the evaluation metrics of mean Intersection over Union, mean Dice coefficient, mean average symmetric surface distance, but also requires fewer computational resources in terms of model parameters and floating-point operations per second.

Conclusions

Comprehensive comparisons in terms of segmentation accuracy and computational complexity unequivocally confirm that our method effectively and efficiently integrates the advantages of both CNNs and transformers, showing its suitability and significance for clinical applications.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Medical physics 医学-核医学

CiteScore

6.80

自引率

15.80%

发文量

660

审稿时长

1.7 months

期刊介绍： Medical Physics publishes original, high impact physics, imaging science, and engineering research that advances patient diagnosis and therapy through contributions in 1) Basic science developments with high potential for clinical translation 2) Clinical applications of cutting edge engineering and physics innovations 3) Broadly applicable and innovative clinical physics developments Medical Physics is a journal of global scope and reach. By publishing in Medical Physics your research will reach an international, multidisciplinary audience including practicing medical physicists as well as physics- and engineering based translational scientists. We work closely with authors of promising articles to improve their quality.