V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis
{"title":"泛欧洲数据驱动中心的疾病管理联合人工智能平台","authors":"V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis","doi":"10.1109/BHI56158.2022.9926957","DOIUrl":null,"url":null,"abstract":"Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.","PeriodicalId":347210,"journal":{"name":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-09-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A federated AI-empowered platform for disease management across a Pan-European data driven hub\",\"authors\":\"V. Pezoulas, F. Kalatzis, T. Exarchos, Antreas Goules, A. Tzioufas, D. Fotiadis\",\"doi\":\"10.1109/BHI56158.2022.9926957\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.\",\"PeriodicalId\":347210,\"journal\":{\"name\":\"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-09-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/BHI56158.2022.9926957\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/BHI56158.2022.9926957","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A federated AI-empowered platform for disease management across a Pan-European data driven hub
Nowadays there is an intensive need to move towards a universal health data ecosystem by breaking down data silos. Faced with a wealth of dispersed health data, there are still critical open issues and unmet needs to make this feasible, varying from secure data sharing to data quality and heterogeneity. Considering these challenges, we propose a novel federated platform to unlock the full potential of data from health data intermediaries through the secure sharing, curation, and Natural Language Processing (NLP)-based harmonization of dispersed and complex clinical data structures. The platform was deployed to establish a first Pan-European data hub on rare autoimmune and chronic diseases with 7551 harmonized patient records across 21 European countries with a 90% terminology overlap. An advanced data driven imputer was built to predict missing records in the real patient data based on high-quality synthetic data profiles (with Kullback-Leibler divergence less than 0.01). with reduced fault detection rate (less than 2%) compared to conventional imputers, such as, the kNN imputer. Customized and explainable federated AI algorithms were trained on top of the established data hub for lymphomagenesis modeling with 0.87 sensitivity and 0.74 specificity along with a set of validated biomarkers for disease onset and progression.