Representative pure risk estimation by using data from epidemiologic studies, surveys, and registries: estimating risks for minority subgroups

Lingxiao Wang, Yan Li, Barry I Graubard, Hormuzd A Katki
{"title":"Representative pure risk estimation by using data from epidemiologic studies, surveys, and registries: estimating risks for minority subgroups","authors":"Lingxiao Wang, Yan Li, Barry I Graubard, Hormuzd A Katki","doi":"10.1093/jrsssa/qnad124","DOIUrl":null,"url":null,"abstract":"Abstract Representative risk estimation is fundamental to clinical decision-making. However, risks are often estimated from non-representative epidemiologic studies, which usually under-represent minorities. Model-based methods use population registries to improve external validity of risk estimation but assume hazard ratios are generalisable from samples to the target finite population. ‘Pseudoweighting’ methods improve representativeness of studies by using an external probability-based survey as the reference, but the resulting estimators can be biased due to propensity model misspecification and inefficient due to highly variable pseudoweights or small sample sizes of minorities in the cohort and/or survey. We propose a two-step pseudoweighting procedure that post-stratifies the event rates among age/race/sex strata in the pseudoweighted cohort to the population rates, to produce efficient and robust pure risk estimation (i.e. a cause-specific absolute risk in the absence of competing events). For developing an all-cause mortality risk model representative for the USA, our findings suggest that hazard ratios for minorities are not generalisable, and that surveys can have inadequate numbers of events for minorities. Post-stratification on event rates is crucial for obtaining reliable risk estimation for minority subgroups.","PeriodicalId":49985,"journal":{"name":"Journal of the Royal Statistical Society","volume":" 1203","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of the Royal Statistical Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1093/jrsssa/qnad124","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Abstract Representative risk estimation is fundamental to clinical decision-making. However, risks are often estimated from non-representative epidemiologic studies, which usually under-represent minorities. Model-based methods use population registries to improve external validity of risk estimation but assume hazard ratios are generalisable from samples to the target finite population. ‘Pseudoweighting’ methods improve representativeness of studies by using an external probability-based survey as the reference, but the resulting estimators can be biased due to propensity model misspecification and inefficient due to highly variable pseudoweights or small sample sizes of minorities in the cohort and/or survey. We propose a two-step pseudoweighting procedure that post-stratifies the event rates among age/race/sex strata in the pseudoweighted cohort to the population rates, to produce efficient and robust pure risk estimation (i.e. a cause-specific absolute risk in the absence of competing events). For developing an all-cause mortality risk model representative for the USA, our findings suggest that hazard ratios for minorities are not generalisable, and that surveys can have inadequate numbers of events for minorities. Post-stratification on event rates is crucial for obtaining reliable risk estimation for minority subgroups.
通过使用流行病学研究、调查和登记的数据进行具有代表性的纯风险估计:估计少数亚群的风险
代表性风险评估是临床决策的基础。然而,风险往往是根据非代表性的流行病学研究来估计的,这些研究通常不能充分代表少数群体。基于模型的方法使用总体注册表来提高风险估计的外部有效性,但假设风险比从样本到目标有限总体是可推广的。“伪加权”方法通过使用外部基于概率的调查作为参考来提高研究的代表性,但由于倾向模型的错误规范,以及由于伪权重的高度可变或队列和/或调查中少数群体的小样本量,由此产生的估计器可能存在偏差,效率低下。我们提出了一个两步伪加权程序,将伪加权队列中年龄/种族/性别阶层的事件发生率分层后到人口发生率,以产生有效和稳健的纯风险估计(即在没有竞争事件的情况下,特定原因的绝对风险)。为了建立一个代表美国的全因死亡率风险模型,我们的研究结果表明,少数族裔的风险比不具有普遍性,而且调查中少数族裔的事件数量可能不足。事件发生率的后分层对于获得少数亚群的可靠风险估计至关重要。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信