线性联邦学习在肝癌放疗预后预测中的应用。

IF 2.8 Q2 ONCOLOGY

JCO Clinical Cancer Informatics Pub Date : 2025-07-01 Epub Date: 2025-06-30 DOI:10.1200/CCI-25-00074

Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine

{"title":"线性联邦学习在肝癌放疗预后预测中的应用。","authors":"Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine","doi":"10.1200/CCI-25-00074","DOIUrl":null,"url":null,"abstract":"Purpose: Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).Methods: Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.Results: For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.Conclusion: LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500074"},"PeriodicalIF":2.8000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12319563/pdf/","citationCount":"0","resultStr":"{\"title\":\"Linear Federated Learning for Outcome Prediction With Application to Hepatocellular Carcinoma Radiotherapy.\",\"authors\":\"Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine\",\"doi\":\"10.1200/CCI-25-00074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Purpose: Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).Methods: Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.Results: For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.Conclusion: LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.\",\"PeriodicalId\":51626,\"journal\":{\"name\":\"JCO Clinical Cancer Informatics\",\"volume\":\"9 \",\"pages\":\"e2500074\"},\"PeriodicalIF\":2.8000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12319563/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JCO Clinical Cancer Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1200/CCI-25-00074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-25-00074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

目的：联邦学习（FL）可以在不共享原始患者数据的情况下实现多机构预测建模，在利用不同数据集的同时保护隐私。本研究评估了线性FL （LFL）作为一种可解释的方法来增加样本量和结果预测的普遍性。作为概念验证，我们将LFL应用于接受外束放疗（EBRT）的肝细胞癌（HCC）患者，预测肝毒性和1年生存率（SRVy1）。方法：使用来自马萨诸塞州总医院（MGH）和布莱根妇女医院（BWH）的患者数据来训练模型，而来自MD安德森癌症中心的独立验证数据集评估泛化性。采用Logistic回归方法预测肝毒性和SRVy1的主要临床特征，包括基线白蛋白、胆红素、Child-Pugh评分、肝脏大小和平均肝脏剂量。LFL方法允许每个机构在不共享原始患者数据的情况下在本地训练模型。使用AUC评估模型的性能，并将LFL模型与机构特定模型进行比较。结果：对于生存预测，单机构模型是有限的，AUC = 0.55-0.63， LFL使其增加到0.67。对于毒性预测，外部验证显示MGH模型的AUC = 0.68， BWH模型的AUC为0.69，LFL将AUC维持在0.7。与单一机构模型相比，LFL模型系数适中，表明有能力减轻偏差，这也反映在验证数据集上的更好性能上。结论：对于接受EBRT治疗的HCC患者，LFL维持或提高了单机构模型的生存和肝毒性预测性能，同时保留了模型的可解释性和患者隐私。这些发现支持了LFL在多机构合作中的作用。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Linear Federated Learning for Outcome Prediction With Application to Hepatocellular Carcinoma Radiotherapy.

Purpose: Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).

Methods: Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.

Results: For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.

Conclusion: LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

JCO Clinical Cancer Informatics ONCOLOGY-

CiteScore

6.20

自引率

4.80%

发文量

190