MVKD-Trans：一种基于超声图像的乳腺癌分类的多视图知识蒸馏视觉转换架构。

IF 2.5 4区医学 Q1 ACOUSTICS

Ultrasonic Imaging Pub Date : 2025-09-01 Epub Date: 2025-06-20 DOI:10.1177/01617346251346060

Dongchen Ling, Xiong Jiao

{"title":"MVKD-Trans：一种基于超声图像的乳腺癌分类的多视图知识蒸馏视觉转换架构。","authors":"Dongchen Ling, Xiong Jiao","doi":"10.1177/01617346251346060","DOIUrl":null,"url":null,"abstract":"Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.","PeriodicalId":49401,"journal":{"name":"Ultrasonic Imaging","volume":" ","pages":"171-181"},"PeriodicalIF":2.5000,"publicationDate":"2025-09-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"MVKD-Trans: A Multi-View Knowledge Distillation Vision Transformer Architecture for Breast Cancer Classification Based on Ultrasound Images.\",\"authors\":\"Dongchen Ling, Xiong Jiao\",\"doi\":\"10.1177/01617346251346060\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.\",\"PeriodicalId\":49401,\"journal\":{\"name\":\"Ultrasonic Imaging\",\"volume\":\" \",\"pages\":\"171-181\"},\"PeriodicalIF\":2.5000,\"publicationDate\":\"2025-09-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Ultrasonic Imaging\",\"FirstCategoryId\":\"5\",\"ListUrlMain\":\"https://doi.org/10.1177/01617346251346060\",\"RegionNum\":4,\"RegionCategory\":\"医学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/20 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q1\",\"JCRName\":\"ACOUSTICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Ultrasonic Imaging","FirstCategoryId":"5","ListUrlMain":"https://doi.org/10.1177/01617346251346060","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/20 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ACOUSTICS","Score":null,"Total":0}

引用次数: 0

摘要

乳腺癌是威胁妇女健康的主要癌症。近年来，深度神经网络在乳腺癌分类的准确性和效率方面都优于传统方法。然而，大多数基于超声的乳腺癌分类方法依赖于单一视角的信息，这可能导致更高的误诊率。在这项研究中，我们提出了一个多视图知识蒸馏视觉转换器架构（MVKD-Trans）用于乳腺良恶性肿瘤的分类。我们利用同一肿瘤的多视点超声图像来捕捉不同的特征。此外，我们采用shuffle模块进行特征融合，提取通道和空间双注意信息，以提高模型的表征能力。鉴于超声设备的计算能力有限，我们还利用知识蒸馏（KD）技术将多视图网络压缩为单视图网络。结果表明，该模型的准确率为88.15%，ROC曲线下面积（AUC）为91.23%，灵敏度为81.41%，特异性为90.73%，精密度为78.29%，F1评分为79.69%。与几个现有的模型相比，我们的方法的优越性能突出了它在显著提高对乳腺癌的理解和分类方面的潜力。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

MVKD-Trans: A Multi-View Knowledge Distillation Vision Transformer Architecture for Breast Cancer Classification Based on Ultrasound Images.

Breast cancer is the leading cancer threatening women's health. In recent years, deep neural networks have outperformed traditional methods in terms of both accuracy and efficiency for breast cancer classification. However, most ultrasound-based breast cancer classification methods rely on single-perspective information, which may lead to higher misdiagnosis rates. In this study, we propose a multi-view knowledge distillation vision transformer architecture (MVKD-Trans) for the classification of benign and malignant breast tumors. We utilize multi-view ultrasound images of the same tumor to capture diverse features. Additionally, we employ a shuffle module for feature fusion, extracting channel and spatial dual-attention information to improve the model's representational capability. Given the limited computational capacity of ultrasound devices, we also utilize knowledge distillation (KD) techniques to compress the multi-view network into a single-view network. The results show that the accuracy, area under the ROC curve (AUC), sensitivity, specificity, precision, and F1 score of the model are 88.15%, 91.23%, 81.41%, 90.73%, 78.29%, and 79.69%, respectively. The superior performance of our approach, compared to several existing models, highlights its potential to significantly enhance the understanding and classification of breast cancer.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Ultrasonic Imaging 医学-工程：生物医学

CiteScore

5.10

自引率

8.70%

发文量

审稿时长

>12 weeks

期刊介绍： Ultrasonic Imaging provides rapid publication for original and exceptional papers concerned with the development and application of ultrasonic-imaging technology. Ultrasonic Imaging publishes articles in the following areas: theoretical and experimental aspects of advanced methods and instrumentation for imaging