A data quality assurance process to improve the precision of analysis of routinely collected administrative data for the NHS (National Health Service) UK.

IF 2.2 3区 医学 Q2 HEALTH CARE SCIENCES & SERVICES
Health Informatics Journal Pub Date : 2025-04-01 Epub Date: 2025-05-21 DOI:10.1177/14604582251334338
Robert M Cook, Alisen Dube, Md Asaduzzaman, Tim Beales, Ross Pearce, Luke Blackwell, Claire Whitehouse, Joshua Miller, Malcolm Gough, Mark Radford, Alison Leary, Sarahjane Jones
{"title":"A data quality assurance process to improve the precision of analysis of routinely collected administrative data for the NHS (National Health Service) UK.","authors":"Robert M Cook, Alisen Dube, Md Asaduzzaman, Tim Beales, Ross Pearce, Luke Blackwell, Claire Whitehouse, Joshua Miller, Malcolm Gough, Mark Radford, Alison Leary, Sarahjane Jones","doi":"10.1177/14604582251334338","DOIUrl":null,"url":null,"abstract":"<p><p><b>Objective:</b> This paper demonstrates a data quality assurance (DQA) process as a means to identify and handle flaws in data, and hence improve the accuracy of an investigation into the prevalence of harmful versus non-harmful/near-miss incident reports in a single NHS acute provider.<b>Methods:</b> The three-step DQA process consists of an initial univariate data quality analysis, followed by a bivariate missingness analysis, and concluding with the design of appropriate multiple imputation techniques. With data quality established, the acuity and incident data were aggregated and aligned to the Ward-Month level for the period August 2015 to December 2020 inclusive. The final analysis was performed using binary regression, pooling results via Reuben's Rule.<b>Results:</b> The application of our three-step quality assurance process was able to detect and correct for common data quality issues. The resulting analysis identified a Ward dependency for the effect of Covid-19 lockdown measures on incident reporting culture which would have been missed without the applied imputation strategy.<b>Conclusions:</b> Our approach outlines a replicable methodology for understanding and fixing data quality issues in operational data. As daily operational decisions are being guided by data, it is important to leverage appropriate imputation techniques and ensure an optimal decision is reached.</p>","PeriodicalId":55069,"journal":{"name":"Health Informatics Journal","volume":"31 2","pages":"14604582251334338"},"PeriodicalIF":2.2000,"publicationDate":"2025-04-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Health Informatics Journal","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1177/14604582251334338","RegionNum":3,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2025/5/21 0:00:00","PubModel":"Epub","JCR":"Q2","JCRName":"HEALTH CARE SCIENCES & SERVICES","Score":null,"Total":0}
引用次数: 0

Abstract

Objective: This paper demonstrates a data quality assurance (DQA) process as a means to identify and handle flaws in data, and hence improve the accuracy of an investigation into the prevalence of harmful versus non-harmful/near-miss incident reports in a single NHS acute provider.Methods: The three-step DQA process consists of an initial univariate data quality analysis, followed by a bivariate missingness analysis, and concluding with the design of appropriate multiple imputation techniques. With data quality established, the acuity and incident data were aggregated and aligned to the Ward-Month level for the period August 2015 to December 2020 inclusive. The final analysis was performed using binary regression, pooling results via Reuben's Rule.Results: The application of our three-step quality assurance process was able to detect and correct for common data quality issues. The resulting analysis identified a Ward dependency for the effect of Covid-19 lockdown measures on incident reporting culture which would have been missed without the applied imputation strategy.Conclusions: Our approach outlines a replicable methodology for understanding and fixing data quality issues in operational data. As daily operational decisions are being guided by data, it is important to leverage appropriate imputation techniques and ensure an optimal decision is reached.

数据质量保证程序,以提高对英国国民保健服务(NHS)例行收集的行政数据的分析精度。
目的:本文展示了数据质量保证(DQA)过程作为识别和处理数据缺陷的一种手段,从而提高了对单一NHS急性提供者中有害与无害/未遂事件报告患病率调查的准确性。方法:三步DQA过程包括初始的单变量数据质量分析,然后是双变量缺失分析,最后设计适当的多重插入技术。在确定数据质量后,对2015年8月至2020年12月期间的锐度和事故数据进行汇总,并与Ward-Month水平保持一致。最后的分析采用二元回归,通过鲁本规则汇总结果。结果:应用我们的三步质量保证流程能够检测和纠正常见的数据质量问题。由此产生的分析确定了Covid-19封锁措施对事件报告文化影响的病房依赖性,如果没有应用归咎策略,这将被忽略。结论:我们的方法概述了一种可复制的方法,用于理解和修复操作数据中的数据质量问题。由于日常运营决策是由数据指导的,因此利用适当的归算技术并确保达成最佳决策非常重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Health Informatics Journal
Health Informatics Journal HEALTH CARE SCIENCES & SERVICES-MEDICAL INFORMATICS
CiteScore
7.80
自引率
6.70%
发文量
80
审稿时长
6 months
期刊介绍: Health Informatics Journal is an international peer-reviewed journal. All papers submitted to Health Informatics Journal are subject to peer review by members of a carefully appointed editorial board. The journal operates a conventional single-blind reviewing policy in which the reviewer’s name is always concealed from the submitting author.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信