Johanna Schwinn, Seyedmostafa Sheikhalishahi, Matthaeus Morhart, Iñaki Soto-Rey, Mathias Kaspar, Ludwig Christian Hinske
{"title":"五个ICU数据库中输血患者的比较联邦分析:使用Kullback-Leibler散度。","authors":"Johanna Schwinn, Seyedmostafa Sheikhalishahi, Matthaeus Morhart, Iñaki Soto-Rey, Mathias Kaspar, Ludwig Christian Hinske","doi":"10.3233/SHTI251495","DOIUrl":null,"url":null,"abstract":"<p><p>This study assesses feature distribution differences across five intensive care databases using Kullback-Leibler Divergence (KLD). Analyzing bidirectional KLD patterns between individual databases and the composite of others, stratifying by transfusion status. Results reveal heterogeneity: HiRID shows highest divergence in both directions, particularly among transfusion cases; UKA exhibits moderate overall divergence but pronounced differences in transfusion scenarios; MIMIC-IV shows minimal divergence, indicating closest alignment with group distributions. Notably, transfusion cases consistently display higher divergence than non-transfusion cases across all databases, highlighting institution-specific practices. These findings stress the importance of assessing data heterogeneity before implementing federated learning models to understand generalization capabilities.</p>","PeriodicalId":94357,"journal":{"name":"Studies in health technology and informatics","volume":"332 ","pages":"57-61"},"PeriodicalIF":0.0000,"publicationDate":"2025-10-02","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Comparative Federated Analytics of Blood Transfused Patients in Five ICU Databases: Using Kullback-Leibler Divergence.\",\"authors\":\"Johanna Schwinn, Seyedmostafa Sheikhalishahi, Matthaeus Morhart, Iñaki Soto-Rey, Mathias Kaspar, Ludwig Christian Hinske\",\"doi\":\"10.3233/SHTI251495\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>This study assesses feature distribution differences across five intensive care databases using Kullback-Leibler Divergence (KLD). Analyzing bidirectional KLD patterns between individual databases and the composite of others, stratifying by transfusion status. Results reveal heterogeneity: HiRID shows highest divergence in both directions, particularly among transfusion cases; UKA exhibits moderate overall divergence but pronounced differences in transfusion scenarios; MIMIC-IV shows minimal divergence, indicating closest alignment with group distributions. Notably, transfusion cases consistently display higher divergence than non-transfusion cases across all databases, highlighting institution-specific practices. These findings stress the importance of assessing data heterogeneity before implementing federated learning models to understand generalization capabilities.</p>\",\"PeriodicalId\":94357,\"journal\":{\"name\":\"Studies in health technology and informatics\",\"volume\":\"332 \",\"pages\":\"57-61\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2025-10-02\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Studies in health technology and informatics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3233/SHTI251495\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Studies in health technology and informatics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SHTI251495","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Comparative Federated Analytics of Blood Transfused Patients in Five ICU Databases: Using Kullback-Leibler Divergence.
This study assesses feature distribution differences across five intensive care databases using Kullback-Leibler Divergence (KLD). Analyzing bidirectional KLD patterns between individual databases and the composite of others, stratifying by transfusion status. Results reveal heterogeneity: HiRID shows highest divergence in both directions, particularly among transfusion cases; UKA exhibits moderate overall divergence but pronounced differences in transfusion scenarios; MIMIC-IV shows minimal divergence, indicating closest alignment with group distributions. Notably, transfusion cases consistently display higher divergence than non-transfusion cases across all databases, highlighting institution-specific practices. These findings stress the importance of assessing data heterogeneity before implementing federated learning models to understand generalization capabilities.