Siqi Li, Pinyan Liu, G. G. Nascimento, Xinru Wang, F. Leite, Bibhas Chakraborty, Chuan Hong, Yilin Ning, F. Xie, Zhen Ling Teo, D. Ting, Hamed Haddadi, M. Ong, Marco Aur'elio Peres, Nan Liu
{"title":"用于电子健康记录和结构化医疗数据的联邦和分布式学习应用程序:范围审查","authors":"Siqi Li, Pinyan Liu, G. G. Nascimento, Xinru Wang, F. Leite, Bibhas Chakraborty, Chuan Hong, Yilin Ning, F. Xie, Zhen Ling Teo, D. Ting, Hamed Haddadi, M. Ong, Marco Aur'elio Peres, Nan Liu","doi":"10.48550/arXiv.2304.07310","DOIUrl":null,"url":null,"abstract":"OBJECTIVES\nFederated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medical data, identifies contemporary limitations, and discusses potential innovations.\n\n\nMATERIALS AND METHODS\nWe searched 5 databases, SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL, to identify articles that applied FL to structured medical data and reported results following the PRISMA guidelines. Each selected publication was evaluated from 3 primary perspectives, including data quality, modeling strategies, and FL frameworks.\n\n\nRESULTS\nOut of the 1193 papers screened, 34 met the inclusion criteria, with each article consisting of one or more studies that used FL to handle structured clinical/medical data. Of these, 24 utilized data acquired from electronic health records, with clinical predictions and association studies being the most common clinical research tasks that FL was applied to. Only one article exclusively explored the vertical FL setting, while the remaining 33 explored the horizontal FL setting, with only 14 discussing comparisons between single-site (local) and FL (global) analysis.\n\n\nCONCLUSIONS\nThe existing FL applications on structured medical data lack sufficient evaluations of clinically meaningful benefits, particularly when compared to single-site analyses. Therefore, it is crucial for future FL applications to prioritize clinical motivations and develop designs and methodologies that can effectively support and aid clinical practice and research.","PeriodicalId":236137,"journal":{"name":"Journal of the American Medical Informatics Association : JAMIA","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-04-14","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Federated and distributed learning applications for electronic health records and structured medical data: A scoping review\",\"authors\":\"Siqi Li, Pinyan Liu, G. G. Nascimento, Xinru Wang, F. Leite, Bibhas Chakraborty, Chuan Hong, Yilin Ning, F. Xie, Zhen Ling Teo, D. Ting, Hamed Haddadi, M. Ong, Marco Aur'elio Peres, Nan Liu\",\"doi\":\"10.48550/arXiv.2304.07310\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"OBJECTIVES\\nFederated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medical data, identifies contemporary limitations, and discusses potential innovations.\\n\\n\\nMATERIALS AND METHODS\\nWe searched 5 databases, SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL, to identify articles that applied FL to structured medical data and reported results following the PRISMA guidelines. Each selected publication was evaluated from 3 primary perspectives, including data quality, modeling strategies, and FL frameworks.\\n\\n\\nRESULTS\\nOut of the 1193 papers screened, 34 met the inclusion criteria, with each article consisting of one or more studies that used FL to handle structured clinical/medical data. Of these, 24 utilized data acquired from electronic health records, with clinical predictions and association studies being the most common clinical research tasks that FL was applied to. Only one article exclusively explored the vertical FL setting, while the remaining 33 explored the horizontal FL setting, with only 14 discussing comparisons between single-site (local) and FL (global) analysis.\\n\\n\\nCONCLUSIONS\\nThe existing FL applications on structured medical data lack sufficient evaluations of clinically meaningful benefits, particularly when compared to single-site analyses. Therefore, it is crucial for future FL applications to prioritize clinical motivations and develop designs and methodologies that can effectively support and aid clinical practice and research.\",\"PeriodicalId\":236137,\"journal\":{\"name\":\"Journal of the American Medical Informatics Association : JAMIA\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-04-14\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of the American Medical Informatics Association : JAMIA\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.48550/arXiv.2304.07310\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the American Medical Informatics Association : JAMIA","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2304.07310","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
摘要
目的联邦学习(FL)近年来在临床研究中越来越受欢迎,以促进隐私保护合作。结构化数据是最普遍的临床数据形式之一,随着电子健康记录在临床实践中的广泛采用,其数量也出现了显著增长。本文综述了FL在结构化医疗数据中的应用,确定了当前的局限性,并讨论了潜在的创新。材料和方法我们检索了5个数据库,SCOPUS, MEDLINE, Web of Science, Embase和CINAHL,以确定将FL应用于结构化医疗数据并按照PRISMA指南报告结果的文章。每个选定的出版物从3个主要角度进行评估,包括数据质量、建模策略和FL框架。结果在筛选的1193篇论文中,34篇符合纳入标准,每篇文章由一项或多项使用FL处理结构化临床/医学数据的研究组成。其中,24项利用了从电子健康记录中获得的数据,其中临床预测和关联研究是FL应用的最常见的临床研究任务。只有一篇文章专门探讨了垂直FL设置,而其余33篇文章探讨了水平FL设置,只有14篇文章讨论了单站点(本地)和FL(全球)分析的比较。结论现有的FL在结构化医学数据上的应用缺乏对临床有意义的益处的充分评估,特别是与单点分析相比。因此,对于未来的FL应用来说,优先考虑临床动机,开发能够有效支持和帮助临床实践和研究的设计和方法至关重要。
Federated and distributed learning applications for electronic health records and structured medical data: A scoping review
OBJECTIVES
Federated learning (FL) has gained popularity in clinical research in recent years to facilitate privacy-preserving collaboration. Structured data, one of the most prevalent forms of clinical data, has experienced significant growth in volume concurrently, notably with the widespread adoption of electronic health records in clinical practice. This review examines FL applications on structured medical data, identifies contemporary limitations, and discusses potential innovations.
MATERIALS AND METHODS
We searched 5 databases, SCOPUS, MEDLINE, Web of Science, Embase, and CINAHL, to identify articles that applied FL to structured medical data and reported results following the PRISMA guidelines. Each selected publication was evaluated from 3 primary perspectives, including data quality, modeling strategies, and FL frameworks.
RESULTS
Out of the 1193 papers screened, 34 met the inclusion criteria, with each article consisting of one or more studies that used FL to handle structured clinical/medical data. Of these, 24 utilized data acquired from electronic health records, with clinical predictions and association studies being the most common clinical research tasks that FL was applied to. Only one article exclusively explored the vertical FL setting, while the remaining 33 explored the horizontal FL setting, with only 14 discussing comparisons between single-site (local) and FL (global) analysis.
CONCLUSIONS
The existing FL applications on structured medical data lack sufficient evaluations of clinically meaningful benefits, particularly when compared to single-site analyses. Therefore, it is crucial for future FL applications to prioritize clinical motivations and develop designs and methodologies that can effectively support and aid clinical practice and research.