Tuğçe Çöllüoğlu, Anıl Şahin, Ahmet Çelik, Emine Arzu Kanik
{"title":"Approaching a nationwide registry: analyzing big data in patients with heart failure.","authors":"Tuğçe Çöllüoğlu, Anıl Şahin, Ahmet Çelik, Emine Arzu Kanik","doi":"10.55730/1300-0144.5931","DOIUrl":null,"url":null,"abstract":"<p><strong>Background/aim: </strong>Randomized controlled trials usually lack generabilizity to real-world context. Real-world data, enabled by the use of big data analysis, serve as a connection between the results of trials and the implementation of findings in clinical practice. Nevertheless, using big data in the healthcare has difficulties such as ensuring data quality and consistency. This article aimed to examine the challenges in accessing and utilizing healthcare big data for heart failure (HF) research, drawing from experiences in creating a nationwide HF registry in Türkiye.</p><p><strong>Materials and methods: </strong>We established a team including cardiologists, HF specialists, biostatistics experts, and data analysts. We searched certain key words related to HF, including heart failure, nationwide study, epidemiology, incidence, prevalence, outcomes, comorbidities, medical therapy, and device therapy. We followed each step of the STROBE guidelines for the preparation of a nationwide study. We obtained big data for the TRends-HF trial from the National Healthcare Data System. For the purpose of obtaining big data, we screened 85,279,553 healthcare records of Turkish citizens between January 1, 2016 and December 31, 2022.</p><p><strong>Results: </strong>We created a study cohort with the use of ICD-10 codes by cross-checking HF medication (n = 2,722,151). Concurrent comorbid conditions were determined using ICD-10 codes. All medications and procedures were screened according to ATC codes and SUT codes, respectively. Variables were placed in different columns. We employed SPSS 29.0, MedCalc, and E-PICOS statistical programs for statistical analysis. Phyton-based codes were created to analyze data that was unsuitable for interpretation by conventional statistical programs. We have no missing data for categorical variables. There was missing data for certain continuous variables. Propensity score matching analysis was employed to establish similarity among the studied groups, particularly when investigating treatment effects.</p><p><strong>Conclusion: </strong>To accurately identify patients with HF using ICD-10 codes from big data and provide precise information, it is necessary to establish additional specific criteria for HF and use different statistical programs by experts for correctly analyzing big data.</p>","PeriodicalId":23361,"journal":{"name":"Turkish Journal of Medical Sciences","volume":"54 7","pages":"1455-1460"},"PeriodicalIF":1.2000,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11673634/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Medical Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.55730/1300-0144.5931","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}
引用次数: 0
Abstract
Background/aim: Randomized controlled trials usually lack generabilizity to real-world context. Real-world data, enabled by the use of big data analysis, serve as a connection between the results of trials and the implementation of findings in clinical practice. Nevertheless, using big data in the healthcare has difficulties such as ensuring data quality and consistency. This article aimed to examine the challenges in accessing and utilizing healthcare big data for heart failure (HF) research, drawing from experiences in creating a nationwide HF registry in Türkiye.
Materials and methods: We established a team including cardiologists, HF specialists, biostatistics experts, and data analysts. We searched certain key words related to HF, including heart failure, nationwide study, epidemiology, incidence, prevalence, outcomes, comorbidities, medical therapy, and device therapy. We followed each step of the STROBE guidelines for the preparation of a nationwide study. We obtained big data for the TRends-HF trial from the National Healthcare Data System. For the purpose of obtaining big data, we screened 85,279,553 healthcare records of Turkish citizens between January 1, 2016 and December 31, 2022.
Results: We created a study cohort with the use of ICD-10 codes by cross-checking HF medication (n = 2,722,151). Concurrent comorbid conditions were determined using ICD-10 codes. All medications and procedures were screened according to ATC codes and SUT codes, respectively. Variables were placed in different columns. We employed SPSS 29.0, MedCalc, and E-PICOS statistical programs for statistical analysis. Phyton-based codes were created to analyze data that was unsuitable for interpretation by conventional statistical programs. We have no missing data for categorical variables. There was missing data for certain continuous variables. Propensity score matching analysis was employed to establish similarity among the studied groups, particularly when investigating treatment effects.
Conclusion: To accurately identify patients with HF using ICD-10 codes from big data and provide precise information, it is necessary to establish additional specific criteria for HF and use different statistical programs by experts for correctly analyzing big data.
期刊介绍:
Turkish Journal of Medical sciences is a peer-reviewed comprehensive resource that provides critical up-to-date information on the broad spectrum of general medical sciences. The Journal intended to publish original medical scientific papers regarding the priority based on the prominence, significance, and timeliness of the findings. However since the audience of the Journal is not limited to any subspeciality in a wide variety of medical disciplines, the papers focusing on the technical details of a given medical subspeciality may not be evaluated for publication.