联邦超声影像联合学习在乳腺癌诊断中的应用

IEEE transactions on medical imaging Pub Date : 2025-01-20 DOI:10.1109/TMI.2025.3532474

Tianpeng Deng;Chunwang Huang;Ming Cai;Yu Liu;Min Liu;Jiatai Lin;Zhenwei Shi;Bingchao Zhao;Jingqi Huang;Changhong Liang;Guoqiang Han;Zaiyi Liu;Ying Wang;Chu Han

{"title":"联邦超声影像联合学习在乳腺癌诊断中的应用","authors":"Tianpeng Deng;Chunwang Huang;Ming Cai;Yu Liu;Min Liu;Jiatai Lin;Zhenwei Shi;Bingchao Zhao;Jingqi Huang;Changhong Liang;Guoqiang Han;Zaiyi Liu;Ying Wang;Chu Han","doi":"10.1109/TMI.2025.3532474","DOIUrl":null,"url":null,"abstract":"Ultrasonography plays an essential role in breast cancer diagnosis. Current deep learning based studies train the models on either images or videos in a centralized learning manner, lacking consideration of joint benefits between two different modality models or the privacy issue of data centralization. In this study, we propose the first decentralized learning solution for joint learning with breast ultrasound video and image, called FedBCD. To enable the model to learn from images and videos simultaneously and seamlessly in client-level local training, we propose a Joint Ultrasound Video and Image Learning (JUVIL) model to bridge the dimension gap between video and image data by incorporating temporal and spatial adapters. The parameter-efficient design of JUVIL with trainable adapters and frozen backbone further reduces the computational cost and communication burden of federated learning, finally improving the overall efficiency. Moreover, considering conventional model-wise aggregation may lead to unstable federated training due to different modalities, data capacities in different clients, and different functionalities across layers. We further propose a Fisher information matrix (<underline>FIM) guided <underline>Layer-wise <underline>Aggregation method named FILA. By measuring layer-wise sensitivity with FIM, FILA assigns higher contributions to the clients with lower sensitivity, improving personalized performance during federated training. Extensive experiments on three image clients and one video client demonstrate the benefits of joint learning architecture, especially for the ones with small-scale data. FedBCD significantly outperforms nine federated learning methods on both video-based and image-based diagnoses, demonstrating the superiority and potential for clinical practice. Code is released at <uri>https://github.com/tianpeng-deng/FedBCD</uri>.","PeriodicalId":94033,"journal":{"name":"IEEE transactions on medical imaging","volume":"44 6","pages":"2395-2407"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-20","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"FedBCD: Federated Ultrasound Video and Image Joint Learning for Breast Cancer Diagnosis\",\"authors\":\"Tianpeng Deng;Chunwang Huang;Ming Cai;Yu Liu;Min Liu;Jiatai Lin;Zhenwei Shi;Bingchao Zhao;Jingqi Huang;Changhong Liang;Guoqiang Han;Zaiyi Liu;Ying Wang;Chu Han\",\"doi\":\"10.1109/TMI.2025.3532474\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Ultrasonography plays an essential role in breast cancer diagnosis. Current deep learning based studies train the models on either images or videos in a centralized learning manner, lacking consideration of joint benefits between two different modality models or the privacy issue of data centralization. In this study, we propose the first decentralized learning solution for joint learning with breast ultrasound video and image, called FedBCD. To enable the model to learn from images and videos simultaneously and seamlessly in client-level local training, we propose a Joint Ultrasound Video and Image Learning (JUVIL) model to bridge the dimension gap between video and image data by incorporating temporal and spatial adapters. The parameter-efficient design of JUVIL with trainable adapters and frozen backbone further reduces the computational cost and communication burden of federated learning, finally improving the overall efficiency. Moreover, considering conventional model-wise aggregation may lead to unstable federated training due to different modalities, data capacities in different clients, and different functionalities across layers. We further propose a Fisher information matrix (<underline>FIM) guided <underline>Layer-wise <underline>Aggregation method named FILA. By measuring layer-wise sensitivity with FIM, FILA assigns higher contributions to the clients with lower sensitivity, improving personalized performance during federated training. Extensive experiments on three image clients and one video client demonstrate the benefits of joint learning architecture, especially for the ones with small-scale data. FedBCD significantly outperforms nine federated learning methods on both video-based and image-based diagnoses, demonstrating the superiority and potential for clinical practice. Code is released at <uri>https://github.com/tianpeng-deng/FedBCD</uri>.\",\"PeriodicalId\":94033,\"journal\":{\"name\":\"IEEE transactions on medical imaging\",\"volume\":\"44 6\",\"pages\":\"2395-2407\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-01-20\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE transactions on medical imaging\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10848115/\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE transactions on medical imaging","FirstCategoryId":"1085","ListUrlMain":"https://ieeexplore.ieee.org/document/10848115/","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

超声检查在乳腺癌诊断中起着至关重要的作用。目前基于深度学习的研究以集中学习的方式训练图像或视频模型，缺乏考虑两种不同模态模型之间的共同利益或数据集中的隐私问题。在本研究中，我们提出了第一个用于乳房超声视频和图像联合学习的分散学习解决方案，称为FedBCD。为了使模型能够在客户级本地训练中同时无缝地从图像和视频中学习，我们提出了一个联合超声视频和图像学习（JUVIL）模型，通过结合时间和空间适配器来弥合视频和图像数据之间的维度差距。采用可训练适配器和冻结主干的JUVIL参数高效设计进一步降低了联邦学习的计算成本和通信负担，最终提高了整体效率。此外，考虑传统的模型智能聚合可能会导致不稳定的联合训练，因为不同的模式、不同客户机中的数据容量和跨层的不同功能不同。我们进一步提出了Fisher信息矩阵（FIM）引导的分层聚合方法FILA。通过使用FIM测量分层灵敏度，FILA为灵敏度较低的客户分配更高的贡献，从而提高联邦训练期间的个性化性能。在三个图像客户端和一个视频客户端上进行的大量实验证明了联合学习架构的好处，特别是对于具有小规模数据的架构。FedBCD在基于视频和基于图像的诊断上都明显优于9种联邦学习方法，显示了临床实践的优越性和潜力。代码发布在https://github.com/tianpeng-deng/FedBCD。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

FedBCD: Federated Ultrasound Video and Image Joint Learning for Breast Cancer Diagnosis

Ultrasonography plays an essential role in breast cancer diagnosis. Current deep learning based studies train the models on either images or videos in a centralized learning manner, lacking consideration of joint benefits between two different modality models or the privacy issue of data centralization. In this study, we propose the first decentralized learning solution for joint learning with breast ultrasound video and image, called FedBCD. To enable the model to learn from images and videos simultaneously and seamlessly in client-level local training, we propose a Joint Ultrasound Video and Image Learning (JUVIL) model to bridge the dimension gap between video and image data by incorporating temporal and spatial adapters. The parameter-efficient design of JUVIL with trainable adapters and frozen backbone further reduces the computational cost and communication burden of federated learning, finally improving the overall efficiency. Moreover, considering conventional model-wise aggregation may lead to unstable federated training due to different modalities, data capacities in different clients, and different functionalities across layers. We further propose a Fisher information matrix (FIM) guided Layer-wise Aggregation method named FILA. By measuring layer-wise sensitivity with FIM, FILA assigns higher contributions to the clients with lower sensitivity, improving personalized performance during federated training. Extensive experiments on three image clients and one video client demonstrate the benefits of joint learning architecture, especially for the ones with small-scale data. FedBCD significantly outperforms nine federated learning methods on both video-based and image-based diagnoses, demonstrating the superiority and potential for clinical practice. Code is released at https://github.com/tianpeng-deng/FedBCD.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IEEE transactions on medical imaging

自引率

0.00%

发文量