泛欧洲数据驱动中心的疾病管理联合人工智能平台

V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis
{"title":"泛欧洲数据驱动中心的疾病管理联合人工智能平台","authors":"V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis","doi":"10.1109/BHI56158.2022.9926957","DOIUrl":null,"url":null,"abstract":"Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A federated AI-empowered platform for disease management across a Pan-European data driven hub\",\"authors\":\"V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis\",\"doi\":\"10.1109/BHI56158.2022.9926957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.\",\"PeriodicalId\":347210,\"journal\":{\"name\":\"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BHI56158.2022.9926957\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI56158.2022.9926957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

如今,迫切需要通过打破数据孤岛来实现普遍的卫生数据生态系统。面对大量分散的卫生数据,从安全的数据共享到数据质量和异质性,实现这一目标仍然存在重大的开放性问题和未满足的需求。考虑到这些挑战,我们提出了一个新的联合平台,通过安全共享、管理和基于自然语言处理(NLP)的分散和复杂临床数据结构的协调,释放来自健康数据中介的数据的全部潜力。部署该平台的目的是建立首个泛欧罕见自身免疫性疾病和慢性病数据中心,拥有21个欧洲国家的7551个统一患者记录,术语重叠90%。基于高质量的合成数据谱(Kullback-Leibler散度小于0.01),构建了先进的数据驱动输入器来预测真实患者数据中的缺失记录。与传统的输入器(如kNN输入器)相比,降低了故障检测率(小于2%)。定制的和可解释的联合人工智能算法在已建立的淋巴瘤生成模型数据中心的基础上进行训练,灵敏度为0.87,特异性为0.74,以及一组有效的疾病发生和进展的生物标志物。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A federated AI-empowered platform for disease management across a Pan-European data driven hub
Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信