Allan G. Duah , Roland V. Bumbuc , H. Ibrahim Korkmaz , Rory Wilding , Vivek M. Sheraton
{"title":"FedDeepInsight—A privacy-first federated learning architecture for medical data","authors":"Allan G. Duah , Roland V. Bumbuc , H. Ibrahim Korkmaz , Rory Wilding , Vivek M. Sheraton","doi":"10.1016/j.imu.2025.101691","DOIUrl":null,"url":null,"abstract":"<div><div>Medical data, hospital patient-specific data, are highly sensitive to privacy and are essential for research in the biomedical field. Although there are many new approaches to creating databases that ensure data must be FAIR and GDPR compliant, these approaches require the intervention of secured data handlers. To address this gap, this study investigates and designs a standardized Federated Learning (FL) architecture for medical data. Specifically, we examine traditional and novel methods for preprocessing, handling, and utilizing such data in FL. We develop “FedDeepInsight”, a novel data transformation framework that enables tabular data augmentation and transformation into image data prior to neural network training and FL. Additionally, we analyze how the type of dataset influences the performance of federated learning algorithms and machine learning models in terms of accuracy and efficiency. Our results indicate that FedAvg is the most reliable aggregation algorithm, providing superior accuracy, stability, and convergence, and FedYogi is also viable with well-tuned hyperparameters. For privacy protection, we recommend Differential Privacy (DP) with calibrated noise multipliers and initial upper and lower bounds for stability. Ultimately, we emerge as a promising solution for secure, privacy-preserving federation learning in healthcare.</div></div>","PeriodicalId":13953,"journal":{"name":"Informatics in Medicine Unlocked","volume":"58 ","pages":"Article 101691"},"PeriodicalIF":0.0000,"publicationDate":"2025-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Informatics in Medicine Unlocked","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2352914825000802","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"Medicine","Score":null,"Total":0}
引用次数: 0
Abstract
Medical data, hospital patient-specific data, are highly sensitive to privacy and are essential for research in the biomedical field. Although there are many new approaches to creating databases that ensure data must be FAIR and GDPR compliant, these approaches require the intervention of secured data handlers. To address this gap, this study investigates and designs a standardized Federated Learning (FL) architecture for medical data. Specifically, we examine traditional and novel methods for preprocessing, handling, and utilizing such data in FL. We develop “FedDeepInsight”, a novel data transformation framework that enables tabular data augmentation and transformation into image data prior to neural network training and FL. Additionally, we analyze how the type of dataset influences the performance of federated learning algorithms and machine learning models in terms of accuracy and efficiency. Our results indicate that FedAvg is the most reliable aggregation algorithm, providing superior accuracy, stability, and convergence, and FedYogi is also viable with well-tuned hyperparameters. For privacy protection, we recommend Differential Privacy (DP) with calibrated noise multipliers and initial upper and lower bounds for stability. Ultimately, we emerge as a promising solution for secure, privacy-preserving federation learning in healthcare.
期刊介绍:
Informatics in Medicine Unlocked (IMU) is an international gold open access journal covering a broad spectrum of topics within medical informatics, including (but not limited to) papers focusing on imaging, pathology, teledermatology, public health, ophthalmological, nursing and translational medicine informatics. The full papers that are published in the journal are accessible to all who visit the website.