Leveraging federated learning for privacy-preserving analysis of multi-institutional electronic health records in rare disease research

Karthik Meduri , Geeta Sandeep Nadella , Akhila Reddy Yadulla , Vinay Kumar Kasula , Mohan Harish Maturi , Steven Brown , Snehal Satish , Hari Gonaygunta
{"title":"Leveraging federated learning for privacy-preserving analysis of multi-institutional electronic health records in rare disease research","authors":"Karthik Meduri ,&nbsp;Geeta Sandeep Nadella ,&nbsp;Akhila Reddy Yadulla ,&nbsp;Vinay Kumar Kasula ,&nbsp;Mohan Harish Maturi ,&nbsp;Steven Brown ,&nbsp;Snehal Satish ,&nbsp;Hari Gonaygunta","doi":"10.1016/j.ject.2024.11.001","DOIUrl":null,"url":null,"abstract":"<div><div>This research announces that the fresh federated learning structure is designed to enhance the privacy-preserving analysis of electronic health records (EHRs), and multiple institutions in this framework permit secure collaboration among institutions, allowing them to train machine-learning replicas without directly sharing patient data. We implemented and evaluated numerous machine-learning models to forecast patient treatment needs, including Logistic Regression, Decision-Tree-Classifiers, Support-Vectors-Classifiers, Random-Forests, and Stacking-Classifiers. The Random Forest classifier achieved the best performance with an accuracy of 90 % and an F1 score of 80 %, demonstrating that it handled complex and imbalanced datasets. This FL-based approach not only complies with privacy regulations such as HIPAA and GDPR but also overcomes significant challenges in data sharing, making it ideal for rare disease research. By enabling secure data aggregation across institutions, the framework significantly enhances the ability to study rare diseases and accelerates the discovery of new treatments. Future directions include extending this framework to other areas of healthcare and incorporating advanced machine-learning techniques to enhance its capabilities further. This research sets the new standard for secure and collaborative healthcare data analysis and promotes innovation and ethical practices in rare disease research.</div></div>","PeriodicalId":100776,"journal":{"name":"Journal of Economy and Technology","volume":"3 ","pages":"Pages 177-189"},"PeriodicalIF":0.0000,"publicationDate":"2024-11-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Economy and Technology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2949948824000544","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

This research announces that the fresh federated learning structure is designed to enhance the privacy-preserving analysis of electronic health records (EHRs), and multiple institutions in this framework permit secure collaboration among institutions, allowing them to train machine-learning replicas without directly sharing patient data. We implemented and evaluated numerous machine-learning models to forecast patient treatment needs, including Logistic Regression, Decision-Tree-Classifiers, Support-Vectors-Classifiers, Random-Forests, and Stacking-Classifiers. The Random Forest classifier achieved the best performance with an accuracy of 90 % and an F1 score of 80 %, demonstrating that it handled complex and imbalanced datasets. This FL-based approach not only complies with privacy regulations such as HIPAA and GDPR but also overcomes significant challenges in data sharing, making it ideal for rare disease research. By enabling secure data aggregation across institutions, the framework significantly enhances the ability to study rare diseases and accelerates the discovery of new treatments. Future directions include extending this framework to other areas of healthcare and incorporating advanced machine-learning techniques to enhance its capabilities further. This research sets the new standard for secure and collaborative healthcare data analysis and promotes innovation and ethical practices in rare disease research.
利用联邦学习对罕见疾病研究中的多机构电子健康记录进行隐私保护分析
这项研究宣布,新的联邦学习结构旨在增强电子健康记录(EHRs)的隐私保护分析,并且该框架中的多个机构允许机构之间的安全协作,允许他们在不直接共享患者数据的情况下训练机器学习副本。我们实施并评估了许多机器学习模型来预测患者的治疗需求,包括逻辑回归、决策树分类器、支持向量分类器、随机森林和堆叠分类器。随机森林分类器以90 %的准确率和80 %的F1分数取得了最好的性能,表明它可以处理复杂和不平衡的数据集。这种基于fl的方法不仅符合HIPAA和GDPR等隐私法规,而且克服了数据共享方面的重大挑战,使其成为罕见疾病研究的理想选择。通过实现跨机构的安全数据聚合,该框架大大提高了研究罕见疾病的能力,并加速了新疗法的发现。未来的方向包括将该框架扩展到医疗保健的其他领域,并结合先进的机器学习技术来进一步增强其能力。这项研究为安全和协作医疗数据分析设定了新标准,并促进了罕见病研究的创新和伦理实践。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信