João Rafael Almeida, Luís Bastião Silva, A. Pazos, J. L. Oliveira
{"title":"将异质患者级数据整合到tranSMART以支持多中心研究","authors":"João Rafael Almeida, Luís Bastião Silva, A. Pazos, J. L. Oliveira","doi":"10.1109/CBMS55023.2022.00018","DOIUrl":null,"url":null,"abstract":"Many medical studies have been conducted aiming for better understanding of the causes of diseases and to assist in treatments and protective factors. In some cases, these studies do not produce impactful findings due to the small number of participants. Some initiatives already invested efforts in conducting multicentre studies, which raises other technical challenges due to the heterogeneity of datasets. The analysis of such data sources implies dealing with different data structures, terminologies, concepts, languages, and most importantly, the knowledge behind the data. In this paper, we present a methodology to centralise different datasets into the tranSMART application, using a harmonising strategy based on standard data schema. This methodology can help researchers to generate evidence from a wider variety of data sources. This proposal was validated using Alzheimer's Disease cohorts from several countries, combining at the end 6,669 subjects and 172 clinical concepts. The harmonised datasets can provide multi-cohort queries and analysis. The software package is available, under the MIT license, at https://github.com/bioinformatics-ua/tranSMART-migrator.","PeriodicalId":218475,"journal":{"name":"2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS)","volume":"59 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Combining heterogeneous patient-level data into tranSMART to support multicentre studies\",\"authors\":\"João Rafael Almeida, Luís Bastião Silva, A. Pazos, J. L. Oliveira\",\"doi\":\"10.1109/CBMS55023.2022.00018\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Many medical studies have been conducted aiming for better understanding of the causes of diseases and to assist in treatments and protective factors. In some cases, these studies do not produce impactful findings due to the small number of participants. Some initiatives already invested efforts in conducting multicentre studies, which raises other technical challenges due to the heterogeneity of datasets. The analysis of such data sources implies dealing with different data structures, terminologies, concepts, languages, and most importantly, the knowledge behind the data. In this paper, we present a methodology to centralise different datasets into the tranSMART application, using a harmonising strategy based on standard data schema. This methodology can help researchers to generate evidence from a wider variety of data sources. This proposal was validated using Alzheimer's Disease cohorts from several countries, combining at the end 6,669 subjects and 172 clinical concepts. The harmonised datasets can provide multi-cohort queries and analysis. The software package is available, under the MIT license, at https://github.com/bioinformatics-ua/tranSMART-migrator.\",\"PeriodicalId\":218475,\"journal\":{\"name\":\"2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS)\",\"volume\":\"59 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-07-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CBMS55023.2022.00018\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE 35th International Symposium on Computer-Based Medical Systems (CBMS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CBMS55023.2022.00018","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Combining heterogeneous patient-level data into tranSMART to support multicentre studies
Many medical studies have been conducted aiming for better understanding of the causes of diseases and to assist in treatments and protective factors. In some cases, these studies do not produce impactful findings due to the small number of participants. Some initiatives already invested efforts in conducting multicentre studies, which raises other technical challenges due to the heterogeneity of datasets. The analysis of such data sources implies dealing with different data structures, terminologies, concepts, languages, and most importantly, the knowledge behind the data. In this paper, we present a methodology to centralise different datasets into the tranSMART application, using a harmonising strategy based on standard data schema. This methodology can help researchers to generate evidence from a wider variety of data sources. This proposal was validated using Alzheimer's Disease cohorts from several countries, combining at the end 6,669 subjects and 172 clinical concepts. The harmonised datasets can provide multi-cohort queries and analysis. The software package is available, under the MIT license, at https://github.com/bioinformatics-ua/tranSMART-migrator.