MVKD-Trans:一种基于超声图像的乳腺癌分类的多视图知识蒸馏视觉转换架构。

IF 2.5 4区 医学 Q1 ACOUSTICS
Ultrasonic Imaging Pub Date : 2025-09-01 Epub Date: 2025-06-20 DOI:10.1177/01617346251346060
Dongchen Ling, Xiong Jiao
{"title":"MVKD-Trans:一种基于超声图像的乳腺癌分类的多视图知识蒸馏视觉转换架构。","authors":"Dongchen Ling, Xiong Jiao","doi":"10.1177/01617346251346060","DOIUrl":null,"url":null,"abstract":"<p><p>Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.</p>","PeriodicalId":49401,"journal":{"name":"Ultrasonic Imaging","volume":" ","pages":"171-181"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MVKD-Trans: A Multi-View Knowledge Distillation Vision Transformer Architecture for Breast Cancer Classification Based on Ultrasound Images.\",\"authors\":\"Dongchen Ling, Xiong Jiao\",\"doi\":\"10.1177/01617346251346060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.</p>\",\"PeriodicalId\":49401,\"journal\":{\"name\":\"Ultrasonic Imaging\",\"volume\":\" \",\"pages\":\"171-181\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ultrasonic Imaging\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/01617346251346060\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasonic Imaging","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/01617346251346060","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}
引用次数: 0

摘要

乳腺癌是威胁妇女健康的主要癌症。近年来,深度神经网络在乳腺癌分类的准确性和效率方面都优于传统方法。然而,大多数基于超声的乳腺癌分类方法依赖于单一视角的信息,这可能导致更高的误诊率。在这项研究中,我们提出了一个多视图知识蒸馏视觉转换器架构(MVKD-Trans)用于乳腺良恶性肿瘤的分类。我们利用同一肿瘤的多视点超声图像来捕捉不同的特征。此外,我们采用shuffle模块进行特征融合,提取通道和空间双注意信息,以提高模型的表征能力。鉴于超声设备的计算能力有限,我们还利用知识蒸馏(KD)技术将多视图网络压缩为单视图网络。结果表明,该模型的准确率为88.15%,ROC曲线下面积(AUC)为91.23%,灵敏度为81.41%,特异性为90.73%,精密度为78.29%,F1评分为79.69%。与几个现有的模型相比,我们的方法的优越性能突出了它在显著提高对乳腺癌的理解和分类方面的潜力。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
MVKD-Trans: A Multi-View Knowledge Distillation Vision Transformer Architecture for Breast Cancer Classification Based on Ultrasound Images.

Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Ultrasonic Imaging
Ultrasonic Imaging 医学-工程:生物医学
CiteScore
5.10
自引率
8.70%
发文量
15
审稿时长
>12 weeks
期刊介绍: Ultrasonic Imaging provides rapid publication for original and exceptional papers concerned with the development and application of ultrasonic-imaging technology. Ultrasonic Imaging publishes articles in the following areas: theoretical and experimental aspects of advanced methods and instrumentation for imaging
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信