Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies.

IF 10.1 1区 环境科学与生态学 Q1 ENVIRONMENTAL SCIENCES
Environmental Health Perspectives Pub Date : 2024-12-01 Epub Date: 2024-12-24 DOI:10.1289/EHP14476
Min Shi, David M Umbach, Clarice R Weinberg
{"title":"Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies.","authors":"Min Shi, David M Umbach, Clarice R Weinberg","doi":"10.1289/EHP14476","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the \"subcohort,\" plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies.</p><p><strong>Objectives: </strong>We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease.</p><p><strong>Methods: </strong>Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data.</p><p><strong>Results: </strong>Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays.</p><p><strong>Discussion: </strong>When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.</p>","PeriodicalId":11862,"journal":{"name":"Environmental Health Perspectives","volume":"132 12","pages":"127004"},"PeriodicalIF":10.1000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668240/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Perspectives","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1289/EHP14476","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/24 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0

Abstract

Background: Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies.

Objectives: We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease.

Methods: Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data.

Results: Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays.

Discussion: When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.

在队列研究中使用病例队列分析时,汇集生物标本进行有效的暴露评估。
背景:大型前瞻性队列研究在确定暴露-疾病关联方面取得了丰硕成果。在入组时收集生物标本(如血液、尿液)的队列中,分析人员可以采用病例队列方法:从队列参与者的随机样本(称为“亚队列”)中提取生物标本,再加上不属于亚队列的事件病例样本进行分析。对多种疾病结果重复使用亚队列数据可以降低成本并保存标本档案。在分析前汇集生物标本既可以节省资金,又可以减少档案的消耗,但尚未对队列研究进行研究。目的:我们开发和评估与罕见疾病暴露风险相关的病例队列分析的生物标本汇集策略。方法:我们的方法包括根据诊断时间(如年龄)分组后,为不在亚队列中的病例构建池集。相比之下,子队列的成员在构建池集之前按年龄分组。然后,分析师拟合一个逻辑回归模型,该模型根据风险年龄和池集大小共同分层,并根据混杂因素进行调整。我们使用模拟(288个采样场景,每个模拟数据集为1000个)来评估该方法在不同规模池集中的性能,并通过重新分析姊妹研究数据来说明其在环境流行病学研究中的应用。结果:参数估计几乎是无偏的,使用标准误差的自举估计构建的95%置信区间表现良好。在同样基于自举标准误差的统计测试中,每池池最多8个样本只会造成适度的功率损失。将更多的队列成员分配到亚队列,并相应地增加每个池的标本数量,大大提高了功率和精度,同时减少了检测次数。讨论:当使用病例队列分析来研究与队列研究中使用生物标本评估的暴露相关的疾病结果时,流行病学家应该考虑将生物标本汇集作为提高统计能力、保存不可替代的档案和节省资金的一种方法。https://doi.org/10.1289/EHP14476。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Environmental Health Perspectives
Environmental Health Perspectives 环境科学-公共卫生、环境卫生与职业卫生
CiteScore
14.40
自引率
2.90%
发文量
388
审稿时长
6 months
期刊介绍: Environmental Health Perspectives (EHP) is a monthly peer-reviewed journal supported by the National Institute of Environmental Health Sciences, part of the National Institutes of Health under the U.S. Department of Health and Human Services. Its mission is to facilitate discussions on the connections between the environment and human health by publishing top-notch research and news. EHP ranks third in Public, Environmental, and Occupational Health, fourth in Toxicology, and fifth in Environmental Sciences.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信