Approaching a nationwide registry: analyzing big data in patients with heart failure.

IF 1.2 4区医学 Q2 MEDICINE, GENERAL & INTERNAL

Turkish Journal of Medical Sciences Pub Date : 2024-05-07 eCollection Date: 2024-01-01 DOI:10.55730/1300-0144.5931

Tuğçe Çöllüoğlu, Anıl Şahin, Ahmet Çelik, Emine Arzu Kanik

{"title":"Approaching a nationwide registry: analyzing big data in patients with heart failure.","authors":"Tuğçe Çöllüoğlu, Anıl Şahin, Ahmet Çelik, Emine Arzu Kanik","doi":"10.55730/1300-0144.5931","DOIUrl":null,"url":null,"abstract":"Background/aim: Randomized controlled trials usually lack generabilizity to real-world context. Real-world data, enabled by the use of big data analysis, serve as a connection between the results of trials and the implementation of findings in clinical practice. Nevertheless, using big data in the healthcare has difficulties such as ensuring data quality and consistency. This article aimed to examine the challenges in accessing and utilizing healthcare big data for heart failure (HF) research, drawing from experiences in creating a nationwide HF registry in Türkiye.Materials and methods: We established a team including cardiologists, HF specialists, biostatistics experts, and data analysts. We searched certain key words related to HF, including heart failure, nationwide study, epidemiology, incidence, prevalence, outcomes, comorbidities, medical therapy, and device therapy. We followed each step of the STROBE guidelines for the preparation of a nationwide study. We obtained big data for the TRends-HF trial from the National Healthcare Data System. For the purpose of obtaining big data, we screened 85,279,553 healthcare records of Turkish citizens between January 1, 2016 and December 31, 2022.Results: We created a study cohort with the use of ICD-10 codes by cross-checking HF medication (n = 2,722,151). Concurrent comorbid conditions were determined using ICD-10 codes. All medications and procedures were screened according to ATC codes and SUT codes, respectively. Variables were placed in different columns. We employed SPSS 29.0, MedCalc, and E-PICOS statistical programs for statistical analysis. Phyton-based codes were created to analyze data that was unsuitable for interpretation by conventional statistical programs. We have no missing data for categorical variables. There was missing data for certain continuous variables. Propensity score matching analysis was employed to establish similarity among the studied groups, particularly when investigating treatment effects.Conclusion: To accurately identify patients with HF using ICD-10 codes from big data and provide precise information, it is necessary to establish additional specific criteria for HF and use different statistical programs by experts for correctly analyzing big data.","PeriodicalId":23361,"journal":{"name":"Turkish Journal of Medical Sciences","volume":"54 7","pages":"1455-1460"},"PeriodicalIF":1.2000,"publicationDate":"2024-05-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11673634/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Turkish Journal of Medical Sciences","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.55730/1300-0144.5931","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/1/1 0:00:00","PubModel":"eCollection","JCR":"Q2","JCRName":"MEDICINE, GENERAL & INTERNAL","Score":null,"Total":0}

引用次数: 0

Abstract

Background/aim: Randomized controlled trials usually lack generabilizity to real-world context. Real-world data, enabled by the use of big data analysis, serve as a connection between the results of trials and the implementation of findings in clinical practice. Nevertheless, using big data in the healthcare has difficulties such as ensuring data quality and consistency. This article aimed to examine the challenges in accessing and utilizing healthcare big data for heart failure (HF) research, drawing from experiences in creating a nationwide HF registry in Türkiye.

Materials and methods: We established a team including cardiologists, HF specialists, biostatistics experts, and data analysts. We searched certain key words related to HF, including heart failure, nationwide study, epidemiology, incidence, prevalence, outcomes, comorbidities, medical therapy, and device therapy. We followed each step of the STROBE guidelines for the preparation of a nationwide study. We obtained big data for the TRends-HF trial from the National Healthcare Data System. For the purpose of obtaining big data, we screened 85,279,553 healthcare records of Turkish citizens between January 1, 2016 and December 31, 2022.

Results: We created a study cohort with the use of ICD-10 codes by cross-checking HF medication (n = 2,722,151). Concurrent comorbid conditions were determined using ICD-10 codes. All medications and procedures were screened according to ATC codes and SUT codes, respectively. Variables were placed in different columns. We employed SPSS 29.0, MedCalc, and E-PICOS statistical programs for statistical analysis. Phyton-based codes were created to analyze data that was unsuitable for interpretation by conventional statistical programs. We have no missing data for categorical variables. There was missing data for certain continuous variables. Propensity score matching analysis was employed to establish similarity among the studied groups, particularly when investigating treatment effects.

Conclusion: To accurately identify patients with HF using ICD-10 codes from big data and provide precise information, it is necessary to establish additional specific criteria for HF and use different statistical programs by experts for correctly analyzing big data.

查看原文本刊更多论文

接近全国登记：分析心力衰竭患者的大数据。

背景/目的：随机对照试验通常缺乏对现实环境的通用性。通过使用大数据分析，现实世界的数据可以作为试验结果和临床实践中发现实施之间的联系。然而，在医疗保健中使用大数据存在一些困难，例如确保数据质量和一致性。本文旨在研究获取和利用心力衰竭（HF）研究的医疗保健大数据所面临的挑战，并从 rkiye创建全国心力衰竭登记处的经验中汲取经验。材料和方法：我们建立了一个由心脏病专家、心衰专家、生物统计学专家和数据分析师组成的团队。我们检索了与心衰相关的关键词，包括心力衰竭、全国性研究、流行病学、发病率、患病率、结局、合并症、药物治疗和器械治疗。我们遵循STROBE指南的每一步来准备一项全国性的研究。我们从国家医疗保健数据系统获得了TRends-HF试验的大数据。为了获得大数据，我们筛选了2016年1月1日至2022年12月31日期间土耳其公民的85,279,553份医疗记录。结果：通过交叉核对心衰用药，我们创建了一个使用ICD-10代码的研究队列（n = 2,722,151）。使用ICD-10代码确定并发合并症。所有药物和程序分别根据ATC代码和SUT代码进行筛选。变量被放在不同的列中。采用SPSS 29.0、MedCalc、E-PICOS统计程序进行统计分析。基于植物的代码是用来分析传统统计程序无法解释的数据的。对于分类变量，我们没有丢失数据。某些连续变量的数据缺失。倾向评分匹配分析用于建立研究组之间的相似性，特别是在调查治疗效果时。结论：要想利用ICD-10代码从大数据中准确识别HF患者并提供准确的信息，需要建立额外的HF具体标准，并由专家使用不同的统计程序来正确分析大数据。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Turkish Journal of Medical Sciences 医学-医学：内科

CiteScore

4.60

自引率

4.30%

发文量

143

审稿时长

3-8 weeks

期刊介绍： Turkish Journal of Medical sciences is a peer-reviewed comprehensive resource that provides critical up-to-date information on the broad spectrum of general medical sciences. The Journal intended to publish original medical scientific papers regarding the priority based on the prominence, significance, and timeliness of the findings. However since the audience of the Journal is not limited to any subspeciality in a wide variety of medical disciplines, the papers focusing on the technical details of a given medical subspeciality may not be evaluated for publication.