Defining phenotypes of disease severity for long-term cardiovascular, renal, metabolic, and mental health conditions in primary care electronic health records: A mixed-methods study using the nominal group technique
IF 4 2区 医学Q2 COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS
Jennifer Cooper , Thomas Jackson , Shamil Haroon , Francesca L. Crowe , Eleanor Hathaway , Leah Fitzsimmons , Krishnarajah Nirantharakumar
{"title":"Defining phenotypes of disease severity for long-term cardiovascular, renal, metabolic, and mental health conditions in primary care electronic health records: A mixed-methods study using the nominal group technique","authors":"Jennifer Cooper , Thomas Jackson , Shamil Haroon , Francesca L. Crowe , Eleanor Hathaway , Leah Fitzsimmons , Krishnarajah Nirantharakumar","doi":"10.1016/j.jbi.2025.104831","DOIUrl":null,"url":null,"abstract":"<div><h3>Objective</h3><div>Inclusion of severity measures for long-term conditions (LTC) could improve prediction models for multiple long-term conditions (MLTC) but some severity measures have limited availability in electronic health records (EHR). We aimed to develop consensus on feasible severity phenotypes for nine cardio-renal-metabolic and mental health conditions.</div></div><div><h3>Methods</h3><div>This was a mixed-methods study using novel methodology. From existing literature, we identified potential severity phenotypes and explored feasibility of their use in EHR through analysis of data from 31 randomly selected general practices in the Clinical Practice Research Datalink (CPRD) Aurum database, a large UK-based primary care EHR database. We recruited clinical academic experts to participate in a survey and nominal group technique workshop. Participants used a Likert scale to rate clinical importance and feasibility for each severity phenotype independently (informed by the exploratory analysis). For the optimal severity phenotype (highest combined score) for each condition, adjusted hazard ratios (aHR) of five-year mortality were calculated using Cox regression on the full CPRD database.</div></div><div><h3>Results</h3><div>Fifteen existing severity indexes for nine conditions informed the survey. Eighteen clinical academics participated in the survey, twelve also participated in the workshops. Combined mean scores for clinical importance and feasibility were highest for estimated glomerular filtration rate (eGFR) for chronic kidney disease (CKD) (9.42/10) and for microvascular complications of diabetes (9.08/10). Mortality was higher for each reduction in eGFR stage; Stage 3b aHR 1.42, 95 %CI 1.41–1.44 versus Stage 3a CKD and for each additional microvascular complication of diabetes; one complication aHR 1.44, 95 %CI 1.32–1.57 versus none. Some phenotypes (e.g., aneurysm diameter) were not well recorded within the database and could not feasibly be applied.</div></div><div><h3>Conclusion</h3><div>We developed a methodology for identifying severity phenotypes in EHRs. Severity phenotypes were identified for diabetes (type 1 and 2), ischaemic heart disease, CKD and peripheral vascular disease. Data quality in EHR should be improved for under-recorded severity measures.</div></div>","PeriodicalId":15263,"journal":{"name":"Journal of Biomedical Informatics","volume":"166 ","pages":"Article 104831"},"PeriodicalIF":4.0000,"publicationDate":"2025-04-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Biomedical Informatics","FirstCategoryId":"3","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S1532046425000607","RegionNum":2,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"COMPUTER SCIENCE, INTERDISCIPLINARY APPLICATIONS","Score":null,"Total":0}
引用次数: 0
Abstract
Objective
Inclusion of severity measures for long-term conditions (LTC) could improve prediction models for multiple long-term conditions (MLTC) but some severity measures have limited availability in electronic health records (EHR). We aimed to develop consensus on feasible severity phenotypes for nine cardio-renal-metabolic and mental health conditions.
Methods
This was a mixed-methods study using novel methodology. From existing literature, we identified potential severity phenotypes and explored feasibility of their use in EHR through analysis of data from 31 randomly selected general practices in the Clinical Practice Research Datalink (CPRD) Aurum database, a large UK-based primary care EHR database. We recruited clinical academic experts to participate in a survey and nominal group technique workshop. Participants used a Likert scale to rate clinical importance and feasibility for each severity phenotype independently (informed by the exploratory analysis). For the optimal severity phenotype (highest combined score) for each condition, adjusted hazard ratios (aHR) of five-year mortality were calculated using Cox regression on the full CPRD database.
Results
Fifteen existing severity indexes for nine conditions informed the survey. Eighteen clinical academics participated in the survey, twelve also participated in the workshops. Combined mean scores for clinical importance and feasibility were highest for estimated glomerular filtration rate (eGFR) for chronic kidney disease (CKD) (9.42/10) and for microvascular complications of diabetes (9.08/10). Mortality was higher for each reduction in eGFR stage; Stage 3b aHR 1.42, 95 %CI 1.41–1.44 versus Stage 3a CKD and for each additional microvascular complication of diabetes; one complication aHR 1.44, 95 %CI 1.32–1.57 versus none. Some phenotypes (e.g., aneurysm diameter) were not well recorded within the database and could not feasibly be applied.
Conclusion
We developed a methodology for identifying severity phenotypes in EHRs. Severity phenotypes were identified for diabetes (type 1 and 2), ischaemic heart disease, CKD and peripheral vascular disease. Data quality in EHR should be improved for under-recorded severity measures.
期刊介绍:
The Journal of Biomedical Informatics reflects a commitment to high-quality original research papers, reviews, and commentaries in the area of biomedical informatics methodology. Although we publish articles motivated by applications in the biomedical sciences (for example, clinical medicine, health care, population health, and translational bioinformatics), the journal emphasizes reports of new methodologies and techniques that have general applicability and that form the basis for the evolving science of biomedical informatics. Articles on medical devices; evaluations of implemented systems (including clinical trials of information technologies); or papers that provide insight into a biological process, a specific disease, or treatment options would generally be more suitable for publication in other venues. Papers on applications of signal processing and image analysis are often more suitable for biomedical engineering journals or other informatics journals, although we do publish papers that emphasize the information management and knowledge representation/modeling issues that arise in the storage and use of biological signals and images. System descriptions are welcome if they illustrate and substantiate the underlying methodology that is the principal focus of the report and an effort is made to address the generalizability and/or range of application of that methodology. Note also that, given the international nature of JBI, papers that deal with specific languages other than English, or with country-specific health systems or approaches, are acceptable for JBI only if they offer generalizable lessons that are relevant to the broad JBI readership, regardless of their country, language, culture, or health system.