Estimating Disease Prevalence in Administrative Data.

IF 1.2 4区 医学 Q4 MEDICINE, RESEARCH & EXPERIMENTAL
Jacek A Kopec
{"title":"Estimating Disease Prevalence in Administrative Data.","authors":"Jacek A Kopec","doi":"10.25011/cim.v45i2.38100","DOIUrl":null,"url":null,"abstract":"<p><strong>Purpose: </strong>Disease prevalence estimates from population-based administrative databases are often biased due to measurement (misclassification) errors. The purpose of this article is to review the methodology for estimating disease prevalence in administrative data, with a focus on bias correction.</p><p><strong>Source: </strong>Several approaches to bias correction in administrative data were reviewed and application of these methods was demonstrated using an example from the literature: physician claims and hospitalization data were employed to estimate diabetes prevalence in Ontario, Canada.</p><p><strong>Findings: </strong>Misclassification bias in prevalence estimates from administrative data can be reduced by developing and selecting an optimal algorithm for case identification, applying a bias correction formula, or using statistical modelling. An algorithm for which sensitivity equals positive predictive value provides an unbiased estimate of prevalence. Bias reduction methods generally require information about the measurement properties of the algorithm, such as sensitivity, specificity, or predictive value. These properties depend on disease type, prevalence, algorithm definition (including the observation window), and may vary by population and time. Prevalence estimates can be improved by applying multivariable disease prediction models.</p><p><strong>Conclusion: </strong>Frequency of a positive case identification algorithm in administrative data is generally not equivalent to disease prevalence. Although prevalence estimates can be corrected for bias using known measurement properties of the algorithm, these properties may be difficult to estimate accurately; therefore, disease prevalence estimates based on administrative data must be treated with caution.</p>","PeriodicalId":50683,"journal":{"name":"Clinical and Investigative Medicine","volume":null,"pages":null},"PeriodicalIF":1.2000,"publicationDate":"2022-06-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Clinical and Investigative Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.25011/cim.v45i2.38100","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q4","JCRName":"MEDICINE, RESEARCH & EXPERIMENTAL","Score":null,"Total":0}
引用次数: 1

Abstract

Purpose: Disease prevalence estimates from population-based administrative databases are often biased due to measurement (misclassification) errors. The purpose of this article is to review the methodology for estimating disease prevalence in administrative data, with a focus on bias correction.

Source: Several approaches to bias correction in administrative data were reviewed and application of these methods was demonstrated using an example from the literature: physician claims and hospitalization data were employed to estimate diabetes prevalence in Ontario, Canada.

Findings: Misclassification bias in prevalence estimates from administrative data can be reduced by developing and selecting an optimal algorithm for case identification, applying a bias correction formula, or using statistical modelling. An algorithm for which sensitivity equals positive predictive value provides an unbiased estimate of prevalence. Bias reduction methods generally require information about the measurement properties of the algorithm, such as sensitivity, specificity, or predictive value. These properties depend on disease type, prevalence, algorithm definition (including the observation window), and may vary by population and time. Prevalence estimates can be improved by applying multivariable disease prediction models.

Conclusion: Frequency of a positive case identification algorithm in administrative data is generally not equivalent to disease prevalence. Although prevalence estimates can be corrected for bias using known measurement properties of the algorithm, these properties may be difficult to estimate accurately; therefore, disease prevalence estimates based on administrative data must be treated with caution.

估算行政数据中的疾病患病率。
目的:基于人群的行政数据库的疾病患病率估计常常由于测量(误分类)错误而有偏差。本文的目的是回顾估算行政数据中疾病患病率的方法,重点是偏差校正。资料来源:本文回顾了几种对行政数据进行偏倚校正的方法,并以文献中的一个例子说明了这些方法的应用:采用医生索赔和住院数据来估计加拿大安大略省的糖尿病患病率。研究结果:通过开发和选择病例识别的最佳算法、应用偏差校正公式或使用统计建模,可以减少行政数据中患病率估计的错误分类偏差。灵敏度等于阳性预测值的算法提供了对患病率的无偏估计。偏倚减少方法通常需要有关算法测量特性的信息,例如灵敏度、特异性或预测值。这些属性取决于疾病类型、患病率、算法定义(包括观察窗口),并可能随人群和时间而变化。应用多变量疾病预测模型可以改进患病率估计。结论:行政数据中阳性病例识别算法的频率通常不等同于疾病患病率。虽然患病率估计可以使用算法的已知测量属性来纠正偏差,但这些属性可能难以准确估计;因此,必须谨慎对待基于行政数据的疾病患病率估计。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Clinical and Investigative Medicine
Clinical and Investigative Medicine 医学-医学:研究与实验
CiteScore
1.50
自引率
12.50%
发文量
18
审稿时长
>12 weeks
期刊介绍: Clinical and Investigative Medicine (CIM), publishes original work in the field of Clinical Investigation. Original work includes clinical or laboratory investigations and clinical reports. Reviews include information for Continuing Medical Education (CME), narrative review articles, systematic reviews, and meta-analyses.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信