{"title":"Secure and privacy-preserving surgical instrument segmentation in minimally invasive surgeries using federated differential privacy approach","authors":"Bakiya. K, Nickolas Savarimuthu","doi":"10.1016/j.compmedimag.2025.102637","DOIUrl":null,"url":null,"abstract":"<div><div>Accurate segmentation of surgical instruments is essential for practical intraoperative guidance in robot-assisted procedures, contributing to improved surgical navigation and enhanced patient safety. Federated Learning is a decentralized approach that enables collaborative model training across institutions without sharing raw data, thereby ensuring data privacy, which is particularly crucial in healthcare. This paper introduces the Federated Averaging algorithm to address the quantity skew by aggregating client model weights centrally. In parallel, the Federated Differential Privacy algorithm was implemented to enhance data privacy by introducing controlled noise to gradients at the client level. For segmentation, we evaluated a U-Net, a Multi-head Attention U-Net for small instruments, and a Squeeze-and-Excitation U-Net for overall accuracy. These models were benchmarked on the datasets of the Kvasir-Instrument (gastrointestinal endoscopy) and RoboTool (20 diverse surgical procedures). Quantitative evaluations using FedAvg, FedSGD, and FedDP across U-Net variants demonstrated that SE-UNet with FedDP at 60 epochs yielded the best results with Dice Score: 99.00 % ± 0.01, Accuracy: 99.68 % ± 0.25, and mIoU: 98.05 % ± 0.01, highlighting superior generalization and convergence stability. Across all architectures, FedDP consistently outperformed FedAvg and FedSGD, with accuracy improvements ranging from 0.3 % to 2.0 % and mIoU gains up to 6.8 %, especially pronounced in SE-UNet. Extending training from 40 to 60 epochs enhanced model stability, with standard deviations reducing from as high as ±3.28 % to as low as ±0.01 %. Statistical analysis confirmed this benefit, with 83.3 % of configurations showing improved p-values, and overall significance rates increasing from 84.4 % to 91.1 %. SE-UNet exhibited the most consistent and robust performance improvements, with an average p-value reduction of 40.7 %, affirming its reliability under federated settings.</div></div>","PeriodicalId":50631,"journal":{"name":"Computerized Medical Imaging and Graphics","volume":"125 ","pages":"Article 102637"},"PeriodicalIF":4.9000,"publicationDate":"2025-08-22","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Computerized Medical Imaging and Graphics","FirstCategoryId":"5","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S0895611125001466","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ENGINEERING, BIOMEDICAL","Score":null,"Total":0}
引用次数: 0
Abstract
Accurate segmentation of surgical instruments is essential for practical intraoperative guidance in robot-assisted procedures, contributing to improved surgical navigation and enhanced patient safety. Federated Learning is a decentralized approach that enables collaborative model training across institutions without sharing raw data, thereby ensuring data privacy, which is particularly crucial in healthcare. This paper introduces the Federated Averaging algorithm to address the quantity skew by aggregating client model weights centrally. In parallel, the Federated Differential Privacy algorithm was implemented to enhance data privacy by introducing controlled noise to gradients at the client level. For segmentation, we evaluated a U-Net, a Multi-head Attention U-Net for small instruments, and a Squeeze-and-Excitation U-Net for overall accuracy. These models were benchmarked on the datasets of the Kvasir-Instrument (gastrointestinal endoscopy) and RoboTool (20 diverse surgical procedures). Quantitative evaluations using FedAvg, FedSGD, and FedDP across U-Net variants demonstrated that SE-UNet with FedDP at 60 epochs yielded the best results with Dice Score: 99.00 % ± 0.01, Accuracy: 99.68 % ± 0.25, and mIoU: 98.05 % ± 0.01, highlighting superior generalization and convergence stability. Across all architectures, FedDP consistently outperformed FedAvg and FedSGD, with accuracy improvements ranging from 0.3 % to 2.0 % and mIoU gains up to 6.8 %, especially pronounced in SE-UNet. Extending training from 40 to 60 epochs enhanced model stability, with standard deviations reducing from as high as ±3.28 % to as low as ±0.01 %. Statistical analysis confirmed this benefit, with 83.3 % of configurations showing improved p-values, and overall significance rates increasing from 84.4 % to 91.1 %. SE-UNet exhibited the most consistent and robust performance improvements, with an average p-value reduction of 40.7 %, affirming its reliability under federated settings.
期刊介绍:
The purpose of the journal Computerized Medical Imaging and Graphics is to act as a source for the exchange of research results concerning algorithmic advances, development, and application of digital imaging in disease detection, diagnosis, intervention, prevention, precision medicine, and population health. Included in the journal will be articles on novel computerized imaging or visualization techniques, including artificial intelligence and machine learning, augmented reality for surgical planning and guidance, big biomedical data visualization, computer-aided diagnosis, computerized-robotic surgery, image-guided therapy, imaging scanning and reconstruction, mobile and tele-imaging, radiomics, and imaging integration and modeling with other information relevant to digital health. The types of biomedical imaging include: magnetic resonance, computed tomography, ultrasound, nuclear medicine, X-ray, microwave, optical and multi-photon microscopy, video and sensory imaging, and the convergence of biomedical images with other non-imaging datasets.