The development and validation of a privacy-preserving model based on federated learning for diagnosing severe pediatric pneumonia.

IF 1.7 4区 医学 Q2 PEDIATRICS
Translational pediatrics Pub Date : 2025-06-27 Epub Date: 2025-06-25 DOI:10.21037/tp-2025-349
Dejian Wang, Guoqiang Qi, Jing Li, Yuqi Wang, Kexiong Dong, Jian Ding, Chen Zhu, Jun Zhu, Beiyan Li, Gang Yu, Shuiguang Deng
{"title":"The development and validation of a privacy-preserving model based on federated learning for diagnosing severe pediatric pneumonia.","authors":"Dejian Wang, Guoqiang Qi, Jing Li, Yuqi Wang, Kexiong Dong, Jian Ding, Chen Zhu, Jun Zhu, Beiyan Li, Gang Yu, Shuiguang Deng","doi":"10.21037/tp-2025-349","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>There is a challenge of in diagnostic testing of pneumonia in children, especially severe pneumonia. Thus, developing an auxiliary diagnostic model to help identify severe pneumonia in pediatric patients at an early stage would be highly valuable to address the issues. To overcome the issue of privacy protection, we applied a privacy-preserving machine learning framework to build a multicenter diagnostic model based on federated learning technology.</p><p><strong>Methods: </strong>Based on Arya, a novel privacy computing platform developed by Hangzhou Healink Technology Corporation, several privacy-preserving federated learning models were developed using datasets from one, two, or four medical centers. A total of 5,091 records were included in this multicenter retrospective study, with 2,484 pediatric patients with severe pneumonia and 2,607 with common pneumonia. Among the records, 80% were used in model training for the diagnosis of severe pneumonia, with 11 common indicators, including white blood cell count (WBC), high-sensitivity C-reactive protein (hs-CRP), hemoglobin (Hb), platelet count (PLT), lymphocyte percentage (L%), monocyte percentage (M%), neutrophil percentage (N%), prothrombin time (PT), alanine aminotransferase (ALT), aspartate aminotransferase (AST), and lactic dehydrogenase (LDH), while the other 20% records were used for model efficacy evaluation. During the process, the original data were stored in the individual hospitals without transmission.</p><p><strong>Results: </strong>Based on privacy-preserving federated learning technology, the developed models provided reliable diagnostic efficacy for severe pneumonia. Among these models, the four-center model achieved the highest diagnostic efficacy (95.10% sensitivity, 82.70% specificity, and 85.80% accuracy). Although the two-center models achieved a relatively low diagnostic efficacy, they still surpassed the diagnostic efficacy of the single-center model (88.10% sensitivity, 74.60% specificity, and 81.00% accuracy).</p><p><strong>Conclusions: </strong>Privacy-preserving federated learning technology can facilitate the performance of multicenter studies and was used to develop a high-performance diagnostic model for severe pneumonia in pediatric patients, which can benefit doctors and patients as an auxiliary diagnostic tool.</p>","PeriodicalId":23294,"journal":{"name":"Translational pediatrics","volume":"14 6","pages":"1287-1295"},"PeriodicalIF":1.7000,"publicationDate":"2025-06-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12268547/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Translational pediatrics","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.21037/tp-2025-349","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/6/25 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"PEDIATRICS","Score":null,"Total":0}
引用次数: 0

Abstract

Background: There is a challenge of in diagnostic testing of pneumonia in children, especially severe pneumonia. Thus, developing an auxiliary diagnostic model to help identify severe pneumonia in pediatric patients at an early stage would be highly valuable to address the issues. To overcome the issue of privacy protection, we applied a privacy-preserving machine learning framework to build a multicenter diagnostic model based on federated learning technology.

Methods: Based on Arya, a novel privacy computing platform developed by Hangzhou Healink Technology Corporation, several privacy-preserving federated learning models were developed using datasets from one, two, or four medical centers. A total of 5,091 records were included in this multicenter retrospective study, with 2,484 pediatric patients with severe pneumonia and 2,607 with common pneumonia. Among the records, 80% were used in model training for the diagnosis of severe pneumonia, with 11 common indicators, including white blood cell count (WBC), high-sensitivity C-reactive protein (hs-CRP), hemoglobin (Hb), platelet count (PLT), lymphocyte percentage (L%), monocyte percentage (M%), neutrophil percentage (N%), prothrombin time (PT), alanine aminotransferase (ALT), aspartate aminotransferase (AST), and lactic dehydrogenase (LDH), while the other 20% records were used for model efficacy evaluation. During the process, the original data were stored in the individual hospitals without transmission.

Results: Based on privacy-preserving federated learning technology, the developed models provided reliable diagnostic efficacy for severe pneumonia. Among these models, the four-center model achieved the highest diagnostic efficacy (95.10% sensitivity, 82.70% specificity, and 85.80% accuracy). Although the two-center models achieved a relatively low diagnostic efficacy, they still surpassed the diagnostic efficacy of the single-center model (88.10% sensitivity, 74.60% specificity, and 81.00% accuracy).

Conclusions: Privacy-preserving federated learning technology can facilitate the performance of multicenter studies and was used to develop a high-performance diagnostic model for severe pneumonia in pediatric patients, which can benefit doctors and patients as an auxiliary diagnostic tool.

Abstract Image

Abstract Image

Abstract Image

基于联邦学习的儿童重症肺炎诊断隐私保护模型的开发与验证。
背景:儿童肺炎特别是重症肺炎的诊断检测存在挑战。因此,开发一种辅助诊断模型来帮助在早期阶段识别儿科患者的严重肺炎,对于解决这些问题将是非常有价值的。为了克服隐私保护问题,我们应用隐私保护机器学习框架构建了基于联邦学习技术的多中心诊断模型。方法:基于杭州海联科技有限公司开发的新型隐私计算平台Arya,利用一、二、四家医疗中心的数据集,构建了多个隐私保护联邦学习模型。这项多中心回顾性研究共纳入5091例记录,其中重症肺炎患儿2484例,普通肺炎患儿2607例。其中80%用于重症肺炎的模型训练,常用指标包括白细胞计数(WBC)、高敏c反应蛋白(hs-CRP)、血红蛋白(Hb)、血小板计数(PLT)、淋巴细胞百分比(L%)、单核细胞百分比(M%)、中性粒细胞百分比(N%)、凝血酶原时间(PT)、丙氨酸转氨酶(ALT)、天冬氨酸转氨酶(AST)、乳酸脱氢酶(LDH)等11项指标。其余20%的记录用于模型疗效评价。在此过程中,原始数据存储在各个医院,不进行传输。结果:基于隐私保护的联邦学习技术,所建立的模型对重症肺炎具有可靠的诊断效果。其中,四中心模型的诊断效能最高(敏感性95.10%,特异性82.70%,准确率85.80%)。虽然双中心模型的诊断效能相对较低,但仍优于单中心模型的诊断效能(敏感性88.10%,特异性74.60%,准确率81.00%)。结论:隐私保护联邦学习技术可以促进多中心研究的开展,并用于开发儿科重症肺炎患者的高性能诊断模型,作为辅助诊断工具,使医生和患者受益。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Translational pediatrics
Translational pediatrics Medicine-Pediatrics, Perinatology and Child Health
CiteScore
4.50
自引率
5.00%
发文量
108
期刊介绍: Information not localized
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信