Development and evaluation of codelists for identifying marginalised groups in primary care

medRxiv - Primary Care Research Pub Date : 2024-09-13 DOI:10.1101/2024.09.11.24313391

Tetyana Perchyk, Isabella de Vere Hunt, Brian D Nicholson, Luke Mounce, Kate Sykes, Yoryos Lyratzopoulos, Agnieszka Lemanska, Katriina L Whitaker, Robert S Kerrison

{"title":"Development and evaluation of codelists for identifying marginalised groups in primary care","authors":"Tetyana Perchyk, Isabella de Vere Hunt, Brian D Nicholson, Luke Mounce, Kate Sykes, Yoryos Lyratzopoulos, Agnieszka Lemanska, Katriina L Whitaker, Robert S Kerrison","doi":"10.1101/2024.09.11.24313391","DOIUrl":null,"url":null,"abstract":"Background. Primary care electronic health records provide a rich source of information for inequalities research. However, the reliability and validity of the research derived from these records depends on the completeness and resolution of the codelists used to identify marginalised populations. Aim. The aim of this project was to develop comprehensive codelists for identifying ethnic minorities, people with learning disabilities (LD), people with severe mental illness (SMI) and people who are transgender. Design and setting. This study was a codelist development project, conducted using primary care data from the United Kingdom. Method. Groups of interest were defined a priori. Relevant clinical codes were identified by searching Clinical Practice Research Datalink (CPRD) publications, codelist repositories and the CPRD code browser. Relevant codelists were downloaded and merged according to marginalised group. Duplicates were removed and remaining codes reviewed by two general practitioners. Comprehensiveness was assessed in a representative CPRD population of 10,966,759 people, by comparing the frequencies of individuals identified when using the curated codelists, compared to commonly used alternatives. Results. A total of 52 codelists were identified. 1,420 unique codes were selected after removal of duplicates and GP review. Compared with comparator codelists, an additional 48,017 (76.6%), 52,953 (68.9%) and 508 (36.9%) people with a LD, SMI or transgender code were identified. The frequencies identified for ethnicity were consistent with expectations for the UK population. Conclusion. The codelists curated through this project will improve inequalities research by improving standards of identifying marginalised groups in primary care data.","PeriodicalId":501023,"journal":{"name":"medRxiv - Primary Care Research","volume":"33 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"medRxiv - Primary Care Research","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1101/2024.09.11.24313391","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Background. Primary care electronic health records provide a rich source of information for inequalities research. However, the reliability and validity of the research derived from these records depends on the completeness and resolution of the codelists used to identify marginalised populations. Aim. The aim of this project was to develop comprehensive codelists for identifying ethnic minorities, people with learning disabilities (LD), people with severe mental illness (SMI) and people who are transgender. Design and setting. This study was a codelist development project, conducted using primary care data from the United Kingdom. Method. Groups of interest were defined a priori. Relevant clinical codes were identified by searching Clinical Practice Research Datalink (CPRD) publications, codelist repositories and the CPRD code browser. Relevant codelists were downloaded and merged according to marginalised group. Duplicates were removed and remaining codes reviewed by two general practitioners. Comprehensiveness was assessed in a representative CPRD population of 10,966,759 people, by comparing the frequencies of individuals identified when using the curated codelists, compared to commonly used alternatives. Results. A total of 52 codelists were identified. 1,420 unique codes were selected after removal of duplicates and GP review. Compared with comparator codelists, an additional 48,017 (76.6%), 52,953 (68.9%) and 508 (36.9%) people with a LD, SMI or transgender code were identified. The frequencies identified for ethnicity were consistent with expectations for the UK population. Conclusion. The codelists curated through this project will improve inequalities research by improving standards of identifying marginalised groups in primary care data.

查看原文本刊更多论文

开发和评估用于识别初级保健中边缘化群体的代码表

背景。初级保健电子健康记录为不平等现象研究提供了丰富的信息来源。然而，从这些记录中得出的研究结果的可靠性和有效性取决于用于识别边缘化人群的代码表的完整性和分辨率。目标本项目旨在开发用于识别少数民族、学习障碍（LD）患者、严重精神疾病（SMI）患者和变性人的综合代码表。设计与环境。本研究是一个代码表开发项目，利用英国的初级保健数据进行。方法。预先定义感兴趣的群体。通过搜索临床实践研究数据链 (CPRD) 出版物、代码库和 CPRD 代码浏览器确定相关临床代码。下载相关代码表，并根据边缘化群体进行合并。删除重复代码，剩余代码由两名全科医生审核。在具有代表性的 CPRD 10,966,759 人中，通过比较使用经整理的代码表和常用替代代码表所识别的个体频率，对全面性进行评估。结果。共确定了 52 个代码表。在去除重复代码和 GP 审查后，选出了 1,420 个唯一代码。与参照代码表相比，分别增加了 48,017 人（76.6%）、52,953 人（68.9%）和 508 人（36.9%）具有 LD、SMI 或变性代码。所确定的种族频率与对英国人口的预期一致。结论通过该项目编制的代码表将提高初级保健数据中边缘群体的识别标准，从而改进不平等现象的研究。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

medRxiv - Primary Care Research

自引率

0.00%

发文量