Wentai Zhang, Xueyang Wu, He Wang, Ruopei Wu, Congcong Deng, Qian Xu, Xiaohai Liu, Xuexue Bai, Shuangjian Yang, Xiaoxu Li, Ming Feng, Qiang Yang, Renzhi Wang
{"title":"用于预测肢端肥大症患者术后缓解的联合学习:一项多中心研究","authors":"Wentai Zhang, Xueyang Wu, He Wang, Ruopei Wu, Congcong Deng, Qian Xu, Xiaohai Liu, Xuexue Bai, Shuangjian Yang, Xiaoxu Li, Ming Feng, Qiang Yang, Renzhi Wang","doi":"10.1016/j.wneu.2024.10.091","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Decentralized federated learning (DFL) may serve as a useful framework for machine learning (ML) tasks in multicentered studies, maximizing the use of clinical data without data sharing. We aim to propose the first workflow of DFL for ML tasks in multicentered studies, which can be as powerful as those using centralized data.</p><p><strong>Methods: </strong>A DFL workflow was developed with four steps: registration, local computation, model update, and inspection. A total of 598 participants with acromegaly from PUMCH, and 120 participants from XWH were enrolled. The cohort from PUMCH was further split into five centers. Nine clinical features were incorporated into ML-based models trained based on four algorithms: LR, GBDT, SVM, and DNN. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models.</p><p><strong>Results: </strong>Models trained based on DFL workflow performed better than most models in LR (P<0.05), all models in DNN, SVM, and GBDT (P<0.05). Models trained on DFL workflow performed as powerful as models trained on centralized data in LR, DNN, and SVM (P>0.05).</p><p><strong>Conclusions: </strong>We demonstrate that the DFL workflow without data sharing should be a more appropriate method in ML tasks in multicentered studies. And the DFL workflow should be further exploited in clinical researches in other departments and it can encourage and facilitate multicentered studies.</p>","PeriodicalId":1,"journal":{"name":"Accounts of Chemical Research","volume":null,"pages":null},"PeriodicalIF":16.4000,"publicationDate":"2024-10-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Federated Learning for Predicting Postoperative Remission of Patients with Acromegaly: A Multicentered study.\",\"authors\":\"Wentai Zhang, Xueyang Wu, He Wang, Ruopei Wu, Congcong Deng, Qian Xu, Xiaohai Liu, Xuexue Bai, Shuangjian Yang, Xiaoxu Li, Ming Feng, Qiang Yang, Renzhi Wang\",\"doi\":\"10.1016/j.wneu.2024.10.091\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Background: </strong>Decentralized federated learning (DFL) may serve as a useful framework for machine learning (ML) tasks in multicentered studies, maximizing the use of clinical data without data sharing. We aim to propose the first workflow of DFL for ML tasks in multicentered studies, which can be as powerful as those using centralized data.</p><p><strong>Methods: </strong>A DFL workflow was developed with four steps: registration, local computation, model update, and inspection. A total of 598 participants with acromegaly from PUMCH, and 120 participants from XWH were enrolled. The cohort from PUMCH was further split into five centers. Nine clinical features were incorporated into ML-based models trained based on four algorithms: LR, GBDT, SVM, and DNN. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models.</p><p><strong>Results: </strong>Models trained based on DFL workflow performed better than most models in LR (P<0.05), all models in DNN, SVM, and GBDT (P<0.05). Models trained on DFL workflow performed as powerful as models trained on centralized data in LR, DNN, and SVM (P>0.05).</p><p><strong>Conclusions: </strong>We demonstrate that the DFL workflow without data sharing should be a more appropriate method in ML tasks in multicentered studies. And the DFL workflow should be further exploited in clinical researches in other departments and it can encourage and facilitate multicentered studies.</p>\",\"PeriodicalId\":1,\"journal\":{\"name\":\"Accounts of Chemical Research\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":16.4000,\"publicationDate\":\"2024-10-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Accounts of Chemical Research\",\"FirstCategoryId\":\"3\",\"ListUrlMain\":\"https://doi.org/10.1016/j.wneu.2024.10.091\",\"RegionNum\":1,\"RegionCategory\":\"化学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"CHEMISTRY, MULTIDISCIPLINARY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Accounts of Chemical Research","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1016/j.wneu.2024.10.091","RegionNum":1,"RegionCategory":"化学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CHEMISTRY, MULTIDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
摘要
背景:分散联合学习(DFL)可作为多中心研究中机器学习(ML)任务的有用框架,在不共享数据的情况下最大限度地利用临床数据。我们的目标是为多中心研究中的机器学习任务提出首个 DFL 工作流程,其功能与使用集中数据的工作流程一样强大:方法:开发的 DFL 工作流程包括四个步骤:注册、局部计算、模型更新和检查。共有 598 名来自 PUMCH 的肢端肥大症患者和 120 名来自 XWH 的患者参与了研究。来自 PUMCH 的队列进一步分为五个中心。九个临床特征被纳入基于四种算法训练的 ML 模型:LR、GBDT、SVM 和 DNN。接受者操作特征曲线的曲线下面积(AUC)用于评估模型的性能:结果:基于 DFL 工作流程训练的模型在 LR 中的表现优于大多数模型(P0.05):我们证明,在多中心研究的 ML 任务中,不共享数据的 DFL 工作流应该是一种更合适的方法。而且,DFL 工作流应在其他部门的临床研究中得到进一步利用,它可以鼓励和促进多中心研究。
Federated Learning for Predicting Postoperative Remission of Patients with Acromegaly: A Multicentered study.
Background: Decentralized federated learning (DFL) may serve as a useful framework for machine learning (ML) tasks in multicentered studies, maximizing the use of clinical data without data sharing. We aim to propose the first workflow of DFL for ML tasks in multicentered studies, which can be as powerful as those using centralized data.
Methods: A DFL workflow was developed with four steps: registration, local computation, model update, and inspection. A total of 598 participants with acromegaly from PUMCH, and 120 participants from XWH were enrolled. The cohort from PUMCH was further split into five centers. Nine clinical features were incorporated into ML-based models trained based on four algorithms: LR, GBDT, SVM, and DNN. The area under the curve (AUC) of receiver operating characteristic curves was used to evaluate the performance of the models.
Results: Models trained based on DFL workflow performed better than most models in LR (P<0.05), all models in DNN, SVM, and GBDT (P<0.05). Models trained on DFL workflow performed as powerful as models trained on centralized data in LR, DNN, and SVM (P>0.05).
Conclusions: We demonstrate that the DFL workflow without data sharing should be a more appropriate method in ML tasks in multicentered studies. And the DFL workflow should be further exploited in clinical researches in other departments and it can encourage and facilitate multicentered studies.
期刊介绍:
Accounts of Chemical Research presents short, concise and critical articles offering easy-to-read overviews of basic research and applications in all areas of chemistry and biochemistry. These short reviews focus on research from the author’s own laboratory and are designed to teach the reader about a research project. In addition, Accounts of Chemical Research publishes commentaries that give an informed opinion on a current research problem. Special Issues online are devoted to a single topic of unusual activity and significance.
Accounts of Chemical Research replaces the traditional article abstract with an article "Conspectus." These entries synopsize the research affording the reader a closer look at the content and significance of an article. Through this provision of a more detailed description of the article contents, the Conspectus enhances the article's discoverability by search engines and the exposure for the research.