Daniela Di Carlo , Ruth Ladenstein , Norbert Graf , Johannes Hans Merks , Gustavo Hernández-Peñaloza , Pamela Kearns , Gianni Bisogno
{"title":"儿童癌症数据整合的共同核心变量","authors":"Daniela Di Carlo , Ruth Ladenstein , Norbert Graf , Johannes Hans Merks , Gustavo Hernández-Peñaloza , Pamela Kearns , Gianni Bisogno","doi":"10.1016/j.ejcped.2024.100186","DOIUrl":null,"url":null,"abstract":"<div><h3>Introduction</h3><p>Data-driven research has improved paediatric cancer outcomes for children. However, challenges in sharing data between institutions prevent the use of artificial intelligence (AI) to address substantial unmet needs in children diagnosed with cancer. Harmonising collected data can enable the application of AI for a greater understanding of paediatric cancers. The main goal of the paper was to analyse the currently used childhood cancer databases to identify a core of variables able to capture the most relevant data on the diagnosis and treatment of children and adolescents with cancer.</p></div><div><h3>Methods</h3><p>We arbitrarily identified different types of existing databases dedicated to collecting data of patients with solid tumours, Umbrella, FAR-RMS; PARTNER; ERN PAEDCAN Registry; INSTRUCT and INRG; the common data elements for Rare Disease by Joint Research Centre. The different elements of the CRFs were analysed and ranked “essential” and “good to have”. Domains that included a group of variables structurally connected were identified. Each variable was defined by name, data type, description, and permissible values.</p></div><div><h3>Results</h3><p>We identified six structural domains: Patient registration, Personal information, Disease History, Diagnosis, Treatment, and Follow-up and Events. For each of them, “essential” and “good to have” variables were defined.</p></div><div><h3>Discussion</h3><p>Data harmonisation is essential for enhancing integration and comparability in research. By standardizing data formats and variables, researchers can facilitate data sharing, collaboration, and analysis across multiple studies and datasets. Embracing data harmonization practices will advance application of AI, scientific knowledge, improve research reproducibility, and contribute to evidence-based decision-making in various fields.</p></div>","PeriodicalId":94314,"journal":{"name":"EJC paediatric oncology","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-08-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.sciencedirect.com/science/article/pii/S2772610X24000461/pdfft?md5=fe6d34dd8fea9f6dd2eb50fc1c17e460&pid=1-s2.0-S2772610X24000461-main.pdf","citationCount":"0","resultStr":"{\"title\":\"Common core variables for childhood cancer data integration\",\"authors\":\"Daniela Di Carlo , Ruth Ladenstein , Norbert Graf , Johannes Hans Merks , Gustavo Hernández-Peñaloza , Pamela Kearns , Gianni Bisogno\",\"doi\":\"10.1016/j.ejcped.2024.100186\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div><h3>Introduction</h3><p>Data-driven research has improved paediatric cancer outcomes for children. However, challenges in sharing data between institutions prevent the use of artificial intelligence (AI) to address substantial unmet needs in children diagnosed with cancer. Harmonising collected data can enable the application of AI for a greater understanding of paediatric cancers. The main goal of the paper was to analyse the currently used childhood cancer databases to identify a core of variables able to capture the most relevant data on the diagnosis and treatment of children and adolescents with cancer.</p></div><div><h3>Methods</h3><p>We arbitrarily identified different types of existing databases dedicated to collecting data of patients with solid tumours, Umbrella, FAR-RMS; PARTNER; ERN PAEDCAN Registry; INSTRUCT and INRG; the common data elements for Rare Disease by Joint Research Centre. The different elements of the CRFs were analysed and ranked “essential” and “good to have”. Domains that included a group of variables structurally connected were identified. Each variable was defined by name, data type, description, and permissible values.</p></div><div><h3>Results</h3><p>We identified six structural domains: Patient registration, Personal information, Disease History, Diagnosis, Treatment, and Follow-up and Events. For each of them, “essential” and “good to have” variables were defined.</p></div><div><h3>Discussion</h3><p>Data harmonisation is essential for enhancing integration and comparability in research. By standardizing data formats and variables, researchers can facilitate data sharing, collaboration, and analysis across multiple studies and datasets. Embracing data harmonization practices will advance application of AI, scientific knowledge, improve research reproducibility, and contribute to evidence-based decision-making in various fields.</p></div>\",\"PeriodicalId\":94314,\"journal\":{\"name\":\"EJC paediatric oncology\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-08-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.sciencedirect.com/science/article/pii/S2772610X24000461/pdfft?md5=fe6d34dd8fea9f6dd2eb50fc1c17e460&pid=1-s2.0-S2772610X24000461-main.pdf\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"EJC paediatric oncology\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://www.sciencedirect.com/science/article/pii/S2772610X24000461\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"EJC paediatric oncology","FirstCategoryId":"1085","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2772610X24000461","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Common core variables for childhood cancer data integration
Introduction
Data-driven research has improved paediatric cancer outcomes for children. However, challenges in sharing data between institutions prevent the use of artificial intelligence (AI) to address substantial unmet needs in children diagnosed with cancer. Harmonising collected data can enable the application of AI for a greater understanding of paediatric cancers. The main goal of the paper was to analyse the currently used childhood cancer databases to identify a core of variables able to capture the most relevant data on the diagnosis and treatment of children and adolescents with cancer.
Methods
We arbitrarily identified different types of existing databases dedicated to collecting data of patients with solid tumours, Umbrella, FAR-RMS; PARTNER; ERN PAEDCAN Registry; INSTRUCT and INRG; the common data elements for Rare Disease by Joint Research Centre. The different elements of the CRFs were analysed and ranked “essential” and “good to have”. Domains that included a group of variables structurally connected were identified. Each variable was defined by name, data type, description, and permissible values.
Results
We identified six structural domains: Patient registration, Personal information, Disease History, Diagnosis, Treatment, and Follow-up and Events. For each of them, “essential” and “good to have” variables were defined.
Discussion
Data harmonisation is essential for enhancing integration and comparability in research. By standardizing data formats and variables, researchers can facilitate data sharing, collaboration, and analysis across multiple studies and datasets. Embracing data harmonization practices will advance application of AI, scientific knowledge, improve research reproducibility, and contribute to evidence-based decision-making in various fields.