Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine
{"title":"线性联邦学习在肝癌放疗预后预测中的应用。","authors":"Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine","doi":"10.1200/CCI-25-00074","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).</p><p><strong>Methods: </strong>Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.</p><p><strong>Results: </strong>For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.</p><p><strong>Conclusion: </strong>LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.</p>","PeriodicalId":51626,"journal":{"name":"JCO Clinical Cancer Informatics","volume":"9 ","pages":"e2500074"},"PeriodicalIF":3.3000,"publicationDate":"2025-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Linear Federated Learning for Outcome Prediction With Application to Hepatocellular Carcinoma Radiotherapy.\",\"authors\":\"Keyur D Shah, Harald Paganetti, Pablo Yepes, Theodore S Hong, Jennifer Y Wo, J Hannah Roberts, Eugene J Koay, Christian V Guthier, Ibrahim Chamseddine\",\"doi\":\"10.1200/CCI-25-00074\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><strong>Purpose: </strong>Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).</p><p><strong>Methods: </strong>Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.</p><p><strong>Results: </strong>For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.</p><p><strong>Conclusion: </strong>LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.</p>\",\"PeriodicalId\":51626,\"journal\":{\"name\":\"JCO Clinical Cancer Informatics\",\"volume\":\"9 \",\"pages\":\"e2500074\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2025-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"JCO Clinical Cancer Informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1200/CCI-25-00074\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"2025/6/30 0:00:00\",\"PubModel\":\"Epub\",\"JCR\":\"Q2\",\"JCRName\":\"ONCOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"JCO Clinical Cancer Informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1200/CCI-25-00074","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/30 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"ONCOLOGY","Score":null,"Total":0}
Linear Federated Learning for Outcome Prediction With Application to Hepatocellular Carcinoma Radiotherapy.
Purpose: Federated learning (FL) enables multi-institutional predictive modeling without sharing raw patient data, preserving privacy while leveraging diverse data sets. This study evaluates the use of linear FL (LFL) as an interpretable approach to enhance sample size and generalizability in outcome prediction. As a proof of concept, we applied LFL to patients with hepatocellular carcinoma (HCC) undergoing external beam radiotherapy (EBRT), predicting hepatic toxicity and 1-year survival (SRVy1).
Methods: Patient data from Massachusetts General Hospital (MGH) and Brigham and Women's Hospital (BWH) were used to train models, whereas an independent validation data set from MD Anderson Cancer Center assessed generalizability. Logistic regression was developed to predict hepatic toxicity and SRVy1 using key clinical features, including baseline albumin, bilirubin, Child-Pugh score, liver size, and mean liver dose. The LFL approach allowed each institution to train models locally without sharing raw patient data. Model performance was evaluated using the AUC and compared between the LFL model and institution-specific models.
Results: For survival prediction, single-institution models were limited, with AUC = 0.55-0.63, with LFL increasing it to 0.67. For toxicity prediction, external validation showed AUC = 0.68 for the MGH model and 0.69 for the BWH model, with LFL maintaining the AUC at 0.7. The model coefficients were moderate in the LFL compared with the single-institution models, indicating an ability to mitigate bias, which was also reflected by better performance on the validation data set.
Conclusion: LFL maintained or improved predictive performance over single-institution models for survival and hepatic toxicity in patients with HCC treated with EBRT while preserving model interpretability and patient privacy. These findings support LFL's role in multi-institutional collaborations.