NHIRD and TriNetX in Rheumatology: Opportunities and Challenges

IF 2.4 4区 医学 Q2 RHEUMATOLOGY
Chao Chieh Cheng, Po-Cheng Shih, Su Boon Yong, Edward Chia-Cheng Lai
{"title":"NHIRD and TriNetX in Rheumatology: Opportunities and Challenges","authors":"Chao Chieh Cheng,&nbsp;Po-Cheng Shih,&nbsp;Su Boon Yong,&nbsp;Edward Chia-Cheng Lai","doi":"10.1111/1756-185X.70203","DOIUrl":null,"url":null,"abstract":"<p>Rheumatic diseases are prevalent worldwide, with a considerable impact on patients' quality of life and a tendency to require long-term management. These conditions are complicated by common comorbidities, including cardiovascular disease, endocrine disorder, and mood disorder, further increasing both disease burden and treatment complexity. While biological therapies have revolutionized the management of various rheumatic diseases, high costs, potential adverse effects, and the heterogeneity of patient populations remain significant barriers to achieving optimal, personalized care.</p><p>In the history of the development of clinical research, randomized controlled trials (RCTs) have been the gold standard for evaluating efficacy and safety. However, their strict inclusion and exclusion criteria, limited sample sizes, and relatively short follow-up periods may fail to capture the full spectrum of disease heterogeneity and long-term real-world positive and negative outcomes. However, the use of real-world evidence (RWE) in research, as demonstrated in the original article published in the <i>International Journal of Rheumatic Diseases</i> in early 2019 (Su-Boon Yong et al., 2019) [<span>1</span>], has brought new hope for addressing such predicaments. Real-world evidence (RWE) derived from large-scale data sources, such as Taiwan's National Health Insurance Research Database (NHIRD) and the recent build global networks like TriNetX, offers the potential to overcome many of these limitations.</p><p>This editorial intends to explore the growing influence of RWE on rheumatologic research, discussing both its potential benefits and its limitations. We also highlight future directions for leveraging big data—particularly from Taiwan's NHIRD, TriNetX, and other global repositories, to optimize treatment strategies, refine risk prediction models, and guide real-world clinical decision-making in rheumatology.</p><p>The large heterogeneity of diseases leading to the challenge of research, such as systemic lupus erythematosus (SLE), rheumatoid arthritis (RA), and idiopathic inflammatory myositis (IIM), each exhibiting considerable variability in clinical manifestations, progression, and therapeutic responses. Diagnostic accuracy is often compromised by nonspecific presentations and the limited reliability of current biomarkers, incomplete imaging data sets, and then elevating the risk of misdiagnosis. These limitations are especially magnified in rare rheumatic diseases, where small patient populations restrict the feasibility of clinical trials and inflate the costs of multinational studies.</p><p>With the advances of biological and targeted synthetic medications emerging, prolonged follow-up studies are frequently hampered by high financial demands and significant patient attrition. Insufficient resources, particularly in the realm of rare diseases and innovative drug development, further restrict progress [<span>2</span>].</p><p>To overcome these challenges, global collaboration and the adoption of novel methodologies are essential. The large-scale, real-world data sources, such as Taiwan's NHIRD or multi-institutional platforms like TriNetX, can enhance disease surveillance, treatment evaluation, and patient stratification beyond what traditional RCTs can capture.</p><p>With the rapid emergence of biological therapies, randomized controlled trials (RCTs) remain a cornerstone for evaluating efficacy and safety in rheumatology. However, the strict inclusion and exclusion criteria in RCTs often limit their external validity, as real-world patient populations typically present with greater heterogeneity and multiple comorbidities [<span>3</span>]. In contrast, real-world evidence (RWE), derived from patient registries, national health insurance databases (e.g., Taiwan's NHIRD), and electronic healthcare records (EHRs) enables the assessment of treatment outcomes in diverse, large-scale, and long-term clinical settings, more closely mirroring routine practice [<span>4</span>].</p><p>Randomized controlled trials (RCTs) provide high-level causal evidence primarily because randomization effectively mitigates confounding variables within a carefully defined cohort. At the same time, real-world evidence (RWE) offers a broader perspective by capturing larger and more diverse patient populations, including those with rare or complex conditions. This expansive scope facilitates long-term monitoring of treatment effectiveness and adverse events that may be missed in the relatively short duration of many RCTs. Furthermore, RWE studies often have lower costs, allowing rapid responses to pressing clinical or policy-related questions [<span>4</span>].</p><p>Despite their robust internal validity, RCTs, commonly rely on smaller sample sizes, have shorter follow-up periods and higher operational costs, all of which reduce their ability to detect long-term outcomes or rare adverse events. Strict protocols may also exclude patients with multiple comorbidities or atypical disease presentations, thereby limiting the generalizability of RCT findings [<span>3</span>]. By contrast, RWE can suffer from variable data quality across different sources, raising the risks of misclassification, incomplete information, and selection bias. Consequently, rigorous patient matching and sophisticated statistical techniques (e.g., propensity score analysis) are crucial for ensuring that RWE produces reliable and accurate results [<span>4</span>].</p><p>The National Health Insurance Research Database (NHIRD), derived from Taiwan's National Health Insurance system, encompasses claims data for over 99.99% of the population, spanning outpatient and inpatient visits since 2000 [<span>5</span>]. Unlike many EHR-based systems that capture data primarily from specific hospital networks or healthcare systems, NHIRD offers a population-based data set, enhancing its representativeness. Since its inception, NHIRD has amassed extensive information on outpatient and inpatient services, prescription medications, and procedural claims, making it a comprehensive resource for large-scale epidemiological and health services research. By linking to other robust data sources—such as cancer registries and death records—the NHIRD enables longitudinal analyses and facilitates the study of both common and rare diseases in real-world clinical settings [<span>5</span>].</p><p>Several countries have developed comparable population-level data sets, such as the Clinical Practice Research Datalink (CPRD) in the United Kingdom, various national healthcare registries in Scandinavian countries (e.g., Denmark, Sweden), and the National Health Information Database (NHID) in South Korea. What sets the NHIRD apart is its nearly universal enrollment, minimal loss to follow up, and the capacity to investigate a vast array of topics ranging from pharmacoepidemiology to cost-effectiveness analyses. Additionally, NHIRD's scalability allows linkage with other databases, enriching research dimensions and improving data accuracy.</p><p>However, limitations remain. The database's reliance on administrative coding means data accuracy can be affected by coding practices, including potential issues like “upcoding” or coding errors [<span>5</span>]. Important unmeasured confounders, such as disease severity and lifestyle behaviors (e.g., smoking, alcohol use), are often absent, posing challenges for adjusting bias in research. NHIRD also excludes self-paid services, like cosmetic procedures, limiting its scope in certain areas [<span>6</span>]. Privacy and regulatory restrictions further complicate data access, as researchers must conduct onsite analysis at designated centers, with applications subjected to expert review. Despite these challenges, NHIRD remains a unique and invaluable population-based data set, serving as a cornerstone for advancing rheumatology and broader medical research through its comprehensive and representative data.</p><p>TriNetX is a global network that connects EHR data from over 130 healthcare organizations with biopharmaceutical companies to facilitate clinical trials while ensuring patient privacy through the use of aggregated data for feasibility assessments and recruitment.</p><p>Its key advantages include robust privacy safeguards, opportunities for collaboration between hospitals and pharmaceutical companies, and extensive global coverage, making it well suited for multi-institutional studies. Easy access and a friendly operating interface are also advantages of the database. After the adjustment in the recent platform, there are several collaborative networks with different purposes in the database to match the needs. However, limitations include the lack of standardized trial identifiers, which complicate the evaluation of system impact, and a focus on data queries with limited clarity regarding their influence on trial progression [<span>7</span>].</p><p>The National Health Insurance Research Database (NHIRD) and TriNetX exhibit distinct features and applications, as summarized in Table 1. NHIRD is population-based, providing comprehensive claims data, including diagnoses and prescriptions. It is primarily utilized for epidemiological research and health policy analysis, with robust data linkage to other government data sets. However, access to NHIRD is restricted to Taiwanese researchers or collaborators, and limited validation of certain diagnostic data highlights the need for improved accuracy [<span>5, 6</span>].</p><p>In contrast, TriNetX operates as a global federated network, connecting more than 130 healthcare organizations and offering aggregated patient data, such as counts of diagnoses and procedures, to ensure privacy. Its initial setting's primary focus is on supporting sponsor-initiated clinical trials, particularly for feasibility assessments and patient recruitment, though it does not offer data linkage capabilities. TriNetX has fewer access restrictions, varying by participating institutions, and no major data validation concerns have been reported. While both platforms serve valuable but distinct purposes, they differ in data coverage, level of detail, and access policies [<span>7</span>].</p><p>The NHIRD and TriNetX offer complementary strengths for rheumatologic research, each designed to address different investigative goals. NHIRD, with its nearly universal population coverage, excels in epidemiological and health services research, including analyses of disease prevalence, healthcare utilization, and policy impacts. In contrast, TriNetX leverages detailed EHR data, making it particularly suited for examining treatment effectiveness, disease progression, and patient stratification within diverse clinical settings.</p><p>For instance, NHIRD can capture nationwide trends in the incidence and management of rheumatoid arthritis, while TriNetX enables granular assessments of treatment adherence and clinical outcomes in well-defined patient subgroups. By integrating these two data sources, researchers can combine NHIRD's population-level breadth with TriNetX's clinical depth, employing cross-validation to corroborate key findings or using hybrid modeling approaches to explore disparities and outcomes across different cohorts. However, there is still a limitation in the combination of the studies between different claims or databases due to the validity differences and different definitions of raw data. Additionally, advanced machine learning or artificial intelligence methods can enhance both data integration and analysis, revealing novel patterns in rheumatologic care, guiding precision medicine, and informing evidence-based clinical and policy decisions.</p><p>Advancing rheumatologic research demands innovative strategies to navigate the complexities of these diseases. The NHIRD excels in population-level analyses, whereas TriNetX offers granular clinical insights, making their integration a powerful tool for comprehensive investigations. Combining these resources enables researchers to gain a deeper understanding of disease mechanisms, treatment outcomes, and healthcare systems. The combination with EHR database and population-based database mitigates the problem in external validity, misclassification, information, and selection biases from the full perspective analysis. This integrative approach holds the potential to transform rheumatologic research, fostering the development of personalized, data-driven care strategies.</p><p>C.C.C. and P.-C.S.: writing – original draft. S.B.Y., E.C.-C.L.: review and editing. S.B.Y., P.-C.S.: writing – review and editing.</p><p>The authors declare no conflicts of interest.</p>","PeriodicalId":14330,"journal":{"name":"International Journal of Rheumatic Diseases","volume":"28 4","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2025-04-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/1756-185X.70203","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Rheumatic Diseases","FirstCategoryId":"3","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/1756-185X.70203","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"RHEUMATOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Rheumatic diseases are prevalent worldwide, with a considerable impact on patients' quality of life and a tendency to require long-term management. These conditions are complicated by common comorbidities, including cardiovascular disease, endocrine disorder, and mood disorder, further increasing both disease burden and treatment complexity. While biological therapies have revolutionized the management of various rheumatic diseases, high costs, potential adverse effects, and the heterogeneity of patient populations remain significant barriers to achieving optimal, personalized care.

In the history of the development of clinical research, randomized controlled trials (RCTs) have been the gold standard for evaluating efficacy and safety. However, their strict inclusion and exclusion criteria, limited sample sizes, and relatively short follow-up periods may fail to capture the full spectrum of disease heterogeneity and long-term real-world positive and negative outcomes. However, the use of real-world evidence (RWE) in research, as demonstrated in the original article published in the International Journal of Rheumatic Diseases in early 2019 (Su-Boon Yong et al., 2019) [1], has brought new hope for addressing such predicaments. Real-world evidence (RWE) derived from large-scale data sources, such as Taiwan's National Health Insurance Research Database (NHIRD) and the recent build global networks like TriNetX, offers the potential to overcome many of these limitations.

This editorial intends to explore the growing influence of RWE on rheumatologic research, discussing both its potential benefits and its limitations. We also highlight future directions for leveraging big data—particularly from Taiwan's NHIRD, TriNetX, and other global repositories, to optimize treatment strategies, refine risk prediction models, and guide real-world clinical decision-making in rheumatology.

The large heterogeneity of diseases leading to the challenge of research, such as systemic lupus erythematosus (SLE), rheumatoid arthritis (RA), and idiopathic inflammatory myositis (IIM), each exhibiting considerable variability in clinical manifestations, progression, and therapeutic responses. Diagnostic accuracy is often compromised by nonspecific presentations and the limited reliability of current biomarkers, incomplete imaging data sets, and then elevating the risk of misdiagnosis. These limitations are especially magnified in rare rheumatic diseases, where small patient populations restrict the feasibility of clinical trials and inflate the costs of multinational studies.

With the advances of biological and targeted synthetic medications emerging, prolonged follow-up studies are frequently hampered by high financial demands and significant patient attrition. Insufficient resources, particularly in the realm of rare diseases and innovative drug development, further restrict progress [2].

To overcome these challenges, global collaboration and the adoption of novel methodologies are essential. The large-scale, real-world data sources, such as Taiwan's NHIRD or multi-institutional platforms like TriNetX, can enhance disease surveillance, treatment evaluation, and patient stratification beyond what traditional RCTs can capture.

With the rapid emergence of biological therapies, randomized controlled trials (RCTs) remain a cornerstone for evaluating efficacy and safety in rheumatology. However, the strict inclusion and exclusion criteria in RCTs often limit their external validity, as real-world patient populations typically present with greater heterogeneity and multiple comorbidities [3]. In contrast, real-world evidence (RWE), derived from patient registries, national health insurance databases (e.g., Taiwan's NHIRD), and electronic healthcare records (EHRs) enables the assessment of treatment outcomes in diverse, large-scale, and long-term clinical settings, more closely mirroring routine practice [4].

Randomized controlled trials (RCTs) provide high-level causal evidence primarily because randomization effectively mitigates confounding variables within a carefully defined cohort. At the same time, real-world evidence (RWE) offers a broader perspective by capturing larger and more diverse patient populations, including those with rare or complex conditions. This expansive scope facilitates long-term monitoring of treatment effectiveness and adverse events that may be missed in the relatively short duration of many RCTs. Furthermore, RWE studies often have lower costs, allowing rapid responses to pressing clinical or policy-related questions [4].

Despite their robust internal validity, RCTs, commonly rely on smaller sample sizes, have shorter follow-up periods and higher operational costs, all of which reduce their ability to detect long-term outcomes or rare adverse events. Strict protocols may also exclude patients with multiple comorbidities or atypical disease presentations, thereby limiting the generalizability of RCT findings [3]. By contrast, RWE can suffer from variable data quality across different sources, raising the risks of misclassification, incomplete information, and selection bias. Consequently, rigorous patient matching and sophisticated statistical techniques (e.g., propensity score analysis) are crucial for ensuring that RWE produces reliable and accurate results [4].

The National Health Insurance Research Database (NHIRD), derived from Taiwan's National Health Insurance system, encompasses claims data for over 99.99% of the population, spanning outpatient and inpatient visits since 2000 [5]. Unlike many EHR-based systems that capture data primarily from specific hospital networks or healthcare systems, NHIRD offers a population-based data set, enhancing its representativeness. Since its inception, NHIRD has amassed extensive information on outpatient and inpatient services, prescription medications, and procedural claims, making it a comprehensive resource for large-scale epidemiological and health services research. By linking to other robust data sources—such as cancer registries and death records—the NHIRD enables longitudinal analyses and facilitates the study of both common and rare diseases in real-world clinical settings [5].

Several countries have developed comparable population-level data sets, such as the Clinical Practice Research Datalink (CPRD) in the United Kingdom, various national healthcare registries in Scandinavian countries (e.g., Denmark, Sweden), and the National Health Information Database (NHID) in South Korea. What sets the NHIRD apart is its nearly universal enrollment, minimal loss to follow up, and the capacity to investigate a vast array of topics ranging from pharmacoepidemiology to cost-effectiveness analyses. Additionally, NHIRD's scalability allows linkage with other databases, enriching research dimensions and improving data accuracy.

However, limitations remain. The database's reliance on administrative coding means data accuracy can be affected by coding practices, including potential issues like “upcoding” or coding errors [5]. Important unmeasured confounders, such as disease severity and lifestyle behaviors (e.g., smoking, alcohol use), are often absent, posing challenges for adjusting bias in research. NHIRD also excludes self-paid services, like cosmetic procedures, limiting its scope in certain areas [6]. Privacy and regulatory restrictions further complicate data access, as researchers must conduct onsite analysis at designated centers, with applications subjected to expert review. Despite these challenges, NHIRD remains a unique and invaluable population-based data set, serving as a cornerstone for advancing rheumatology and broader medical research through its comprehensive and representative data.

TriNetX is a global network that connects EHR data from over 130 healthcare organizations with biopharmaceutical companies to facilitate clinical trials while ensuring patient privacy through the use of aggregated data for feasibility assessments and recruitment.

Its key advantages include robust privacy safeguards, opportunities for collaboration between hospitals and pharmaceutical companies, and extensive global coverage, making it well suited for multi-institutional studies. Easy access and a friendly operating interface are also advantages of the database. After the adjustment in the recent platform, there are several collaborative networks with different purposes in the database to match the needs. However, limitations include the lack of standardized trial identifiers, which complicate the evaluation of system impact, and a focus on data queries with limited clarity regarding their influence on trial progression [7].

The National Health Insurance Research Database (NHIRD) and TriNetX exhibit distinct features and applications, as summarized in Table 1. NHIRD is population-based, providing comprehensive claims data, including diagnoses and prescriptions. It is primarily utilized for epidemiological research and health policy analysis, with robust data linkage to other government data sets. However, access to NHIRD is restricted to Taiwanese researchers or collaborators, and limited validation of certain diagnostic data highlights the need for improved accuracy [5, 6].

In contrast, TriNetX operates as a global federated network, connecting more than 130 healthcare organizations and offering aggregated patient data, such as counts of diagnoses and procedures, to ensure privacy. Its initial setting's primary focus is on supporting sponsor-initiated clinical trials, particularly for feasibility assessments and patient recruitment, though it does not offer data linkage capabilities. TriNetX has fewer access restrictions, varying by participating institutions, and no major data validation concerns have been reported. While both platforms serve valuable but distinct purposes, they differ in data coverage, level of detail, and access policies [7].

The NHIRD and TriNetX offer complementary strengths for rheumatologic research, each designed to address different investigative goals. NHIRD, with its nearly universal population coverage, excels in epidemiological and health services research, including analyses of disease prevalence, healthcare utilization, and policy impacts. In contrast, TriNetX leverages detailed EHR data, making it particularly suited for examining treatment effectiveness, disease progression, and patient stratification within diverse clinical settings.

For instance, NHIRD can capture nationwide trends in the incidence and management of rheumatoid arthritis, while TriNetX enables granular assessments of treatment adherence and clinical outcomes in well-defined patient subgroups. By integrating these two data sources, researchers can combine NHIRD's population-level breadth with TriNetX's clinical depth, employing cross-validation to corroborate key findings or using hybrid modeling approaches to explore disparities and outcomes across different cohorts. However, there is still a limitation in the combination of the studies between different claims or databases due to the validity differences and different definitions of raw data. Additionally, advanced machine learning or artificial intelligence methods can enhance both data integration and analysis, revealing novel patterns in rheumatologic care, guiding precision medicine, and informing evidence-based clinical and policy decisions.

Advancing rheumatologic research demands innovative strategies to navigate the complexities of these diseases. The NHIRD excels in population-level analyses, whereas TriNetX offers granular clinical insights, making their integration a powerful tool for comprehensive investigations. Combining these resources enables researchers to gain a deeper understanding of disease mechanisms, treatment outcomes, and healthcare systems. The combination with EHR database and population-based database mitigates the problem in external validity, misclassification, information, and selection biases from the full perspective analysis. This integrative approach holds the potential to transform rheumatologic research, fostering the development of personalized, data-driven care strategies.

C.C.C. and P.-C.S.: writing – original draft. S.B.Y., E.C.-C.L.: review and editing. S.B.Y., P.-C.S.: writing – review and editing.

The authors declare no conflicts of interest.

风湿病学中的NHIRD和TriNetX:机遇与挑战
风湿病在世界范围内普遍存在,对患者的生活质量有相当大的影响,并且往往需要长期治疗。这些疾病还伴有常见的合并症,包括心血管疾病、内分泌紊乱和情绪紊乱,进一步增加了疾病负担和治疗复杂性。虽然生物疗法已经彻底改变了各种风湿性疾病的管理,但高昂的费用、潜在的不良反应和患者群体的异质性仍然是实现最佳个性化护理的重大障碍。在临床研究发展史上,随机对照试验(rct)一直是评价疗效和安全性的金标准。然而,它们严格的纳入和排除标准、有限的样本量和相对较短的随访期可能无法捕捉到疾病异质性的全部范围和长期现实世界的阳性和阴性结果。然而,正如2019年初发表在《国际风湿病杂志》(Su-Boon Yong et al., 2019)[1]上的原始文章所展示的那样,在研究中使用真实世界证据(RWE)为解决这些困境带来了新的希望。来自大规模数据源的真实世界证据(RWE),如台湾的国家健康保险研究数据库(NHIRD)和最近建立的全球网络,如TriNetX,提供了克服许多这些限制的潜力。这篇社论旨在探讨RWE在风湿病研究中日益增长的影响,讨论其潜在的益处和局限性。我们还强调了利用大数据的未来方向,特别是来自台湾NHIRD, TriNetX和其他全球存储库的大数据,以优化治疗策略,完善风险预测模型,并指导风湿病的现实临床决策。系统性红斑狼疮(SLE)、类风湿性关节炎(RA)和特发性炎症性肌炎(IIM)等疾病的巨大异质性给研究带来了挑战,每种疾病在临床表现、进展和治疗反应方面都表现出相当大的差异。诊断的准确性经常受到非特异性表现和当前生物标志物的有限可靠性,不完整的成像数据集的影响,然后增加了误诊的风险。在罕见的风湿病中,这些限制尤其被放大,在这些疾病中,患者人数少限制了临床试验的可行性,并增加了跨国研究的成本。随着生物和靶向合成药物的出现,长期的随访研究经常受到高财政需求和显著的患者流失的阻碍。资源不足,特别是在罕见病和创新药物开发领域,进一步限制了进展。为了克服这些挑战,全球合作和采用新方法至关重要。大规模、真实的数据来源,如台湾的NHIRD或TriNetX等多机构平台,可以加强疾病监测、治疗评估和患者分层,这是传统随机对照试验无法捕捉的。随着生物疗法的迅速出现,随机对照试验(rct)仍然是评估风湿病疗效和安全性的基石。然而,rct中严格的纳入和排除标准往往限制了其外部有效性,因为现实世界的患者群体通常存在更大的异质性和多种合共病[10]。相比之下,来自患者登记、国家健康保险数据库(例如,台湾的NHIRD)和电子医疗记录(EHRs)的真实证据(RWE)能够在不同的、大规模的和长期的临床环境中评估治疗结果,更紧密地反映常规实践bb0。随机对照试验(RCTs)提供了高水平的因果证据,主要是因为随机化在一个精心定义的队列中有效地减轻了混杂变量。与此同时,现实世界证据(RWE)通过捕获更大、更多样化的患者群体,包括那些患有罕见或复杂疾病的患者,提供了更广阔的视角。这种广泛的范围有助于长期监测治疗效果和不良事件,而这些不良事件可能在许多随机对照试验相对较短的持续时间内被遗漏。此外,RWE研究通常成本较低,可以对紧迫的临床或政策相关问题做出快速反应。尽管随机对照试验具有强大的内部有效性,但通常依赖较小的样本量,随访时间较短,操作成本较高,所有这些都降低了其检测长期结果或罕见不良事件的能力。严格的方案也可能排除有多种合并症或非典型疾病表现的患者,从而限制了RCT结果的普遍性[10]。 相比之下,RWE可能会受到不同来源的数据质量的影响,从而增加了错误分类、信息不完整和选择偏差的风险。因此,严格的患者匹配和复杂的统计技术(例如,倾向评分分析)对于确保RWE产生可靠和准确的结果至关重要。国民健康保险研究数据库(NHIRD)来源于台湾的国民健康保险系统,包含了自2000年以来超过99.99%的人口的索赔数据,涵盖门诊和住院就诊。与许多主要从特定医院网络或医疗保健系统获取数据的基于电子病历的系统不同,NHIRD提供了基于人群的数据集,增强了其代表性。NHIRD自成立以来积累了大量关于门诊和住院服务、处方药物和程序性索赔的信息,使其成为大规模流行病学和卫生服务研究的综合资源。通过与其他可靠的数据来源(如癌症登记和死亡记录)的联系,NHIRD能够进行纵向分析,并促进对现实世界临床环境中常见和罕见疾病的研究[10]。一些国家已经开发了可比较的人口水平数据集,如英国的临床实践研究数据链(CPRD)、斯堪的纳维亚国家(如丹麦、瑞典)的各种国家卫生保健登记处和韩国的国家卫生信息数据库(NHID)。NHIRD的与众不同之处在于其几乎普遍的登记,最小的后续损失,以及调查从药物流行病学到成本效益分析等广泛主题的能力。此外,NHIRD的可扩展性允许与其他数据库连接,丰富研究维度并提高数据准确性。然而,局限性依然存在。数据库对管理编码的依赖意味着数据的准确性可能会受到编码实践的影响,包括“上编码”或编码错误[5]等潜在问题。重要的未测量混杂因素,如疾病严重程度和生活方式行为(如吸烟、饮酒)往往不存在,这给调整研究偏差带来了挑战。NHIRD还不包括自费服务,如整容手术,限制了其在某些领域的范围。隐私和监管限制进一步使数据访问复杂化,因为研究人员必须在指定的中心进行现场分析,应用程序要经过专家审查。尽管存在这些挑战,NHIRD仍然是一个独特而宝贵的基于人群的数据集,通过其全面和代表性的数据,作为推进风湿病学和更广泛的医学研究的基石。TriNetX是一个全球网络,将来自130多个医疗保健组织的EHR数据与生物制药公司连接起来,以促进临床试验,同时通过使用聚合数据进行可行性评估和招聘,确保患者隐私。它的主要优势包括强大的隐私保护、医院和制药公司之间的合作机会以及广泛的全球覆盖范围,使其非常适合多机构研究。易于访问和友好的操作界面也是数据库的优点。在最近的平台调整后,数据库中有几个不同用途的协作网络来匹配需求。然而,局限性包括缺乏标准化的试验标识符,这使系统影响的评估复杂化,并且关注数据查询,其对试验进展的影响有限。国家健康保险研究数据库(NHIRD)和TriNetX显示出不同的特点和应用,如表1所示。NHIRD以人口为基础,提供全面的索赔数据,包括诊断和处方。它主要用于流行病学研究和卫生政策分析,与其他政府数据集有强有力的数据联系。然而,NHIRD的访问仅限于台湾的研究人员或合作者,并且某些诊断数据的有限验证突出了提高准确性的必要性[5,6]。相比之下,TriNetX作为一个全球联合网络运行,连接了130多个医疗保健组织,并提供汇总的患者数据,如诊断和手术计数,以确保隐私。其初始设置的主要重点是支持赞助商发起的临床试验,特别是可行性评估和患者招募,尽管它不提供数据链接功能。TriNetX的访问限制较少,因参与机构而异,并且没有报告主要的数据验证问题。虽然这两个平台都有各自不同的用途,但它们在数据覆盖范围、详细程度和访问策略[7]方面有所不同。 NHIRD和TriNetX为风湿病研究提供了互补的优势,每个都旨在解决不同的调查目标。全国人口研究中心几乎覆盖所有人口,擅长流行病学和保健服务研究,包括对疾病流行、保健利用和政策影响的分析。相比之下,TriNetX利用详细的电子病历数据,使其特别适合于检查不同临床环境中的治疗效果、疾病进展和患者分层。例如,NHIRD可以捕获全国范围内类风湿关节炎发病率和管理的趋势,而TriNetX可以对明确定义的患者亚组的治疗依从性和临床结果进行细致的评估。通过整合这两个数据源,研究人员可以将NHIRD的人群水平广度与TriNetX的临床深度结合起来,采用交叉验证来证实关键发现,或使用混合建模方法来探索不同队列之间的差异和结果。然而,由于原始数据的有效性差异和不同定义,不同索赔要求或数据库之间的研究组合仍然存在局限性。此外,先进的机器学习或人工智能方法可以增强数据集成和分析,揭示风湿病治疗的新模式,指导精准医学,并为循证临床和政策决策提供信息。推进风湿病研究需要创新的策略来应对这些疾病的复杂性。NHIRD擅长于人口水平的分析,而TriNetX提供细致的临床见解,使他们的整合成为全面调查的有力工具。结合这些资源,使研究人员能够更深入地了解疾病机制、治疗结果和医疗保健系统。EHR数据库与基于人群的数据库结合,从全视角分析,缓解了外部效度、误分类、信息、选择偏差等问题。这种综合方法有可能改变风湿病研究,促进个性化、数据驱动的护理策略的发展。和p.c.s.:写作-原稿。S.B.Y, e.c.c - c.l:审查和编辑。s.b.y., p.c.s.:写作-评论和编辑。作者声明无利益冲突。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
CiteScore
3.70
自引率
4.00%
发文量
362
审稿时长
1 months
期刊介绍: The International Journal of Rheumatic Diseases (formerly APLAR Journal of Rheumatology) is the official journal of the Asia Pacific League of Associations for Rheumatology. The Journal accepts original articles on clinical or experimental research pertinent to the rheumatic diseases, work on connective tissue diseases and other immune and allergic disorders. The acceptance criteria for all papers are the quality and originality of the research and its significance to our readership. Except where otherwise stated, manuscripts are peer reviewed by two anonymous reviewers and the Editor.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信