CohortCharacteristics: an R package for population characterisation in observational studies using the OMOP common data model.

IF 5.9 1区 医学 Q1 PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH
Mike Du, Albert Prats-Uribe, Núria Mercadé-Besora, Kim Lopez-Guell, Yuchen Guo, Marta Alcalde-Herraiz, Xihang Chen, Antonella Delmestri, Wai Yi Man, Talita Duarte-Salles, Anna Palomar, Agustina Giuliodori, Emanuel Brađašević, Antea Jezidžić, Elvira Bräuner, Susanne Bruun, Katia Verhamme, Mees Mosseveld, James T Brash, Dina Vojinovic, Isabella Kaczmarczyk, Akram Mendez, Peter Rijnbeek, Daniel Prieto-Alhambra, Edward Burn, Martí Català
{"title":"CohortCharacteristics: an R package for population characterisation in observational studies using the OMOP common data model.","authors":"Mike Du, Albert Prats-Uribe, Núria Mercadé-Besora, Kim Lopez-Guell, Yuchen Guo, Marta Alcalde-Herraiz, Xihang Chen, Antonella Delmestri, Wai Yi Man, Talita Duarte-Salles, Anna Palomar, Agustina Giuliodori, Emanuel Brađašević, Antea Jezidžić, Elvira Bräuner, Susanne Bruun, Katia Verhamme, Mees Mosseveld, James T Brash, Dina Vojinovic, Isabella Kaczmarczyk, Akram Mendez, Peter Rijnbeek, Daniel Prieto-Alhambra, Edward Burn, Martí Català","doi":"10.1007/s10654-025-01352-4","DOIUrl":null,"url":null,"abstract":"<p><p>Describing cohort characterisation ensures comparability and reproducibility in multi-database observational studies. To address this need, we developed CohortCharacteristics, an open-source R package that facilitates standardised cohort characterisation in datasets mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). This study aims to explain the development of the package and demonstrate its core functionality. We developed CohortCharacteristics, an open-source R package that can perform cohort characterisation for various types of databases. To demonstrate its functionality, we then used CohortCharacteristics to generate descriptive statistics on demographics, comorbidities, medication exposures, cohort overlap, and timing of cohort entries. The study included data from CPRD GOLD (UK), DK-DHR (Denmark), IPCI (Netherlands), IQVIA Longitudinal Patient Database Belgium (IQVIA LPD Belgium), IQVIA DA Germany, NAJS (Croatia), and SIDIAP (Spain), all mapped to the OMOP CDM. The CohortCharacteristics R package is freely available on CRAN with detailed vignettes and documentation on its functionality. Cohort characteristics were generally consistent across databases, with similar age distributions and female representation. CPRD GOLD, NAJS, and SIDIAP exhibited higher prescribing rates for respiratory, cardiovascular, and nervous system medications, while IQVIA databases and DK-DHR reported lower rates. Timing analysis showed that dementia diagnoses typically followed insomnia diagnoses in several databases, supporting existing literature. Antipsychotic prescriptions often occurred after dementia diagnosis, reflecting prescribing practices aligned with clinical guidelines. CohortCharacteristics enables consistent cohort characterisation across a network of data mapped to the OMOP CDM, thereby improving transparency in multi-database research. The package's functionality, demonstrated in this study, illustrates its applicability in observational studies with OMOP CDM data.</p>","PeriodicalId":11907,"journal":{"name":"European Journal of Epidemiology","volume":" ","pages":""},"PeriodicalIF":5.9000,"publicationDate":"2026-04-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"European Journal of Epidemiology","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1007/s10654-025-01352-4","RegionNum":1,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"PUBLIC, ENVIRONMENTAL & OCCUPATIONAL HEALTH","Score":null,"Total":0}
引用次数: 0

Abstract

Describing cohort characterisation ensures comparability and reproducibility in multi-database observational studies. To address this need, we developed CohortCharacteristics, an open-source R package that facilitates standardised cohort characterisation in datasets mapped to the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM). This study aims to explain the development of the package and demonstrate its core functionality. We developed CohortCharacteristics, an open-source R package that can perform cohort characterisation for various types of databases. To demonstrate its functionality, we then used CohortCharacteristics to generate descriptive statistics on demographics, comorbidities, medication exposures, cohort overlap, and timing of cohort entries. The study included data from CPRD GOLD (UK), DK-DHR (Denmark), IPCI (Netherlands), IQVIA Longitudinal Patient Database Belgium (IQVIA LPD Belgium), IQVIA DA Germany, NAJS (Croatia), and SIDIAP (Spain), all mapped to the OMOP CDM. The CohortCharacteristics R package is freely available on CRAN with detailed vignettes and documentation on its functionality. Cohort characteristics were generally consistent across databases, with similar age distributions and female representation. CPRD GOLD, NAJS, and SIDIAP exhibited higher prescribing rates for respiratory, cardiovascular, and nervous system medications, while IQVIA databases and DK-DHR reported lower rates. Timing analysis showed that dementia diagnoses typically followed insomnia diagnoses in several databases, supporting existing literature. Antipsychotic prescriptions often occurred after dementia diagnosis, reflecting prescribing practices aligned with clinical guidelines. CohortCharacteristics enables consistent cohort characterisation across a network of data mapped to the OMOP CDM, thereby improving transparency in multi-database research. The package's functionality, demonstrated in this study, illustrates its applicability in observational studies with OMOP CDM data.

CohortCharacteristics:一个R软件包,用于使用OMOP通用数据模型的观察性研究中的人群特征。
描述队列特征确保了多数据库观察性研究的可比性和可重复性。为了满足这一需求,我们开发了CohortCharacteristics,这是一个开源的R软件包,可以促进映射到观察性医疗结果合作伙伴关系(OMOP)公共数据模型(CDM)的数据集中的标准化队列特征。本研究旨在解释该软件包的发展,并展示其核心功能。我们开发了CohortCharacteristics,这是一个开源的R包,可以为各种类型的数据库执行队列特征。为了证明其功能,我们使用队列特征来生成人口统计学、合并症、药物暴露、队列重叠和队列进入时间的描述性统计数据。该研究包括来自CPRD GOLD(英国)、DK-DHR(丹麦)、IPCI(荷兰)、IQVIA比利时纵向患者数据库(IQVIA LPD比利时)、IQVIA DA德国、NAJS(克罗地亚)和SIDIAP(西班牙)的数据,所有数据都映射到OMOP CDM。CohortCharacteristics R包在CRAN上免费提供,并提供了详细的功能说明和文档。各数据库的队列特征基本一致,年龄分布和女性代表性相似。CPRD GOLD、NAJS和SIDIAP显示呼吸、心血管和神经系统药物的处方率较高,而IQVIA数据库和DK-DHR报告的处方率较低。时间分析显示,在几个数据库中,痴呆诊断通常紧随失眠诊断,支持现有文献。抗精神病药物的处方通常发生在痴呆诊断后,这反映了与临床指南一致的处方实践。CohortCharacteristics能够在映射到OMOP CDM的数据网络中实现一致的队列特征,从而提高多数据库研究的透明度。该包的功能,在本研究中证明,说明了它在OMOP CDM数据的观察性研究中的适用性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
European Journal of Epidemiology
European Journal of Epidemiology 医学-公共卫生、环境卫生与职业卫生
CiteScore
21.40
自引率
1.50%
发文量
109
审稿时长
6-12 weeks
期刊介绍: The European Journal of Epidemiology, established in 1985, is a peer-reviewed publication that provides a platform for discussions on epidemiology in its broadest sense. It covers various aspects of epidemiologic research and statistical methods. The journal facilitates communication between researchers, educators, and practitioners in epidemiology, including those in clinical and community medicine. Contributions from diverse fields such as public health, preventive medicine, clinical medicine, health economics, and computational biology and data science, in relation to health and disease, are encouraged. While accepting submissions from all over the world, the journal particularly emphasizes European topics relevant to epidemiology. The published articles consist of empirical research findings, developments in methodology, and opinion pieces.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信
小红书