Johanna Schwinn, Seyedmostafa Sheikhalishahi, Matthaeus Morhart, Iñaki Soto-Rey, Mathias Kaspar, Ludwig Christian Hinske
{"title":"Comparative Federated Analytics of Blood Transfused Patients in Five ICU Databases: Using Kullback-Leibler Divergence.","authors":"Johanna Schwinn, Seyedmostafa Sheikhalishahi, Matthaeus Morhart, Iñaki Soto-Rey, Mathias Kaspar, Ludwig Christian Hinske","doi":"10.3233/SHTI251495","DOIUrl":null,"url":null,"abstract":"<p><p>This study assesses feature distribution differences across five intensive care databases using Kullback-Leibler Divergence (KLD). Analyzing bidirectional KLD patterns between individual databases and the composite of others, stratifying by transfusion status. Results reveal heterogeneity: HiRID shows highest divergence in both directions, particularly among transfusion cases; UKA exhibits moderate overall divergence but pronounced differences in transfusion scenarios; MIMIC-IV shows minimal divergence, indicating closest alignment with group distributions. Notably, transfusion cases consistently display higher divergence than non-transfusion cases across all databases, highlighting institution-specific practices. These findings stress the importance of assessing data heterogeneity before implementing federated learning models to understand generalization capabilities.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"57-61"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
This study assesses feature distribution differences across five intensive care databases using Kullback-Leibler Divergence (KLD). Analyzing bidirectional KLD patterns between individual databases and the composite of others, stratifying by transfusion status. Results reveal heterogeneity: HiRID shows highest divergence in both directions, particularly among transfusion cases; UKA exhibits moderate overall divergence but pronounced differences in transfusion scenarios; MIMIC-IV shows minimal divergence, indicating closest alignment with group distributions. Notably, transfusion cases consistently display higher divergence than non-transfusion cases across all databases, highlighting institution-specific practices. These findings stress the importance of assessing data heterogeneity before implementing federated learning models to understand generalization capabilities.