Jinqian Pan, Zhengkang Fan, Glenn E. Smith, Yi Guo, Jiang Bian, Jie Xu
{"title":"Federated learning with multi-cohort real-world data for predicting the progression from mild cognitive impairment to Alzheimer's disease","authors":"Jinqian Pan, Zhengkang Fan, Glenn E. Smith, Yi Guo, Jiang Bian, Jie Xu","doi":"10.1002/alz.70128","DOIUrl":null,"url":null,"abstract":"<div>\n \n \n <section>\n \n <h3> INTRODUCTION</h3>\n \n <p>Leveraging routinely collected electronic health records (EHRs) from multiple health-care institutions, this approach aims to assess the feasibility of using federated learning (FL) to predict the progression from mild cognitive impairment (MCI) to Alzheimer's disease (AD).</p>\n </section>\n \n <section>\n \n <h3> METHODS</h3>\n \n <p>We analyzed EHR data from the OneFlorida+ consortium, simulating six sites, and used a long short-term memory (LSTM) model with a federated averaging (FedAvg) algorithm. A personalized FL approach was used to address between-site heterogeneity. Model performance was assessed using the area under the receiver operating characteristic curve (AUC) and feature importance techniques.</p>\n </section>\n \n <section>\n \n <h3> RESULTS</h3>\n \n <p>Of 44,899 MCI patients, 6391 progressed to AD. FL models achieved a 6% improvement in AUC compared to local models. Key predictive features included body mass index, vitamin B12, blood pressure, and others.</p>\n </section>\n \n <section>\n \n <h3> DISCUSSION</h3>\n \n <p>FL showed promise in predicting AD progression by integrating heterogeneous data across multiple institutions while preserving privacy. Despite limitations, it offers potential for future clinical applications.</p>\n </section>\n \n <section>\n \n <h3> Highlights</h3>\n \n <div>\n <ul>\n \n <li>We applied long short-term memory and federated learning (FL) to predict mild cognitive impairment to Alzheimer's disease progression using electronic health record data from multiple institutions.</li>\n \n <li>FL improved prediction performance, with a 6% increase in area under the receiver operating characteristic curve compared to local models.</li>\n \n <li>We identified key predictive features, such as body mass index, vitamin B12, and blood pressure.</li>\n \n <li>FL shows effectiveness in handling data heterogeneity across multiple sites while ensuring data privacy.</li>\n \n <li>Personalized and pooled FL models generally performed better than global and local models.</li>\n </ul>\n </div>\n </section>\n </div>","PeriodicalId":7471,"journal":{"name":"Alzheimer's & Dementia","volume":"21 4","pages":""},"PeriodicalIF":13.0000,"publicationDate":"2025-04-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1002/alz.70128","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Alzheimer's & Dementia","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1002/alz.70128","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"CLINICAL NEUROLOGY","Score":null,"Total":0}
引用次数: 0
Abstract
INTRODUCTION
Leveraging routinely collected electronic health records (EHRs) from multiple health-care institutions, this approach aims to assess the feasibility of using federated learning (FL) to predict the progression from mild cognitive impairment (MCI) to Alzheimer's disease (AD).
METHODS
We analyzed EHR data from the OneFlorida+ consortium, simulating six sites, and used a long short-term memory (LSTM) model with a federated averaging (FedAvg) algorithm. A personalized FL approach was used to address between-site heterogeneity. Model performance was assessed using the area under the receiver operating characteristic curve (AUC) and feature importance techniques.
RESULTS
Of 44,899 MCI patients, 6391 progressed to AD. FL models achieved a 6% improvement in AUC compared to local models. Key predictive features included body mass index, vitamin B12, blood pressure, and others.
DISCUSSION
FL showed promise in predicting AD progression by integrating heterogeneous data across multiple institutions while preserving privacy. Despite limitations, it offers potential for future clinical applications.
Highlights
We applied long short-term memory and federated learning (FL) to predict mild cognitive impairment to Alzheimer's disease progression using electronic health record data from multiple institutions.
FL improved prediction performance, with a 6% increase in area under the receiver operating characteristic curve compared to local models.
We identified key predictive features, such as body mass index, vitamin B12, and blood pressure.
FL shows effectiveness in handling data heterogeneity across multiple sites while ensuring data privacy.
Personalized and pooled FL models generally performed better than global and local models.
期刊介绍:
Alzheimer's & Dementia is a peer-reviewed journal that aims to bridge knowledge gaps in dementia research by covering the entire spectrum, from basic science to clinical trials to social and behavioral investigations. It provides a platform for rapid communication of new findings and ideas, optimal translation of research into practical applications, increasing knowledge across diverse disciplines for early detection, diagnosis, and intervention, and identifying promising new research directions. In July 2008, Alzheimer's & Dementia was accepted for indexing by MEDLINE, recognizing its scientific merit and contribution to Alzheimer's research.