{"title":"Pooling Biospecimens for Efficient Exposure Assessment When Using Case-Cohort Analysis in Cohort Studies.","authors":"Min Shi, David M Umbach, Clarice R Weinberg","doi":"10.1289/EHP14476","DOIUrl":null,"url":null,"abstract":"<p><strong>Background: </strong>Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the \"subcohort,\" plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies.</p><p><strong>Objectives: </strong>We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease.</p><p><strong>Methods: </strong>Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data.</p><p><strong>Results: </strong>Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays.</p><p><strong>Discussion: </strong>When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.</p>","PeriodicalId":11862,"journal":{"name":"Environmental Health Perspectives","volume":"132 12","pages":"127004"},"PeriodicalIF":10.1000,"publicationDate":"2024-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11668240/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Environmental Health Perspectives","FirstCategoryId":"93","ListUrlMain":"https://doi.org/10.1289/EHP14476","RegionNum":1,"RegionCategory":"环境科学与生态学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"2024/12/24 0:00:00","PubModel":"Epub","JCR":"Q1","JCRName":"ENVIRONMENTAL SCIENCES","Score":null,"Total":0}
引用次数: 0
Abstract
Background: Large prospective cohort studies have been fruitful for identifying exposure-disease associations. In a cohort where biospecimens (e.g., blood, urine) were collected at enrollment, analysts can exploit a case-cohort approach: Biospecimens from a random sample of cohort participants, called the "subcohort," plus a sample of incident cases that were not part of the subcohort are assayed. Reusing subcohort data for multiple disease outcomes can reduce costs and conserve specimen archives. Pooling biospecimen samples before assay could both save money and reduce depletion of the archive but has not been studied for cohort studies.
Objectives: We develop and evaluate a biospecimen pooling strategy for case-cohort analyses that relate an exposure to risk of a rare disease.
Methods: Our approach involves constructing pooling sets for cases not in the subcohort after grouping them according to time of diagnosis (e.g., age). In contrast, members of the subcohort are grouped by age at entry before constructing pooling sets. The analyst then fits a logistic regression model that jointly stratifies by age at risk and pooling set size and adjusts for confounders. We used simulations (288 sampling scenarios with 1,000 simulated datasets each) to evaluate the performance of this approach for several sizes of pooling sets and illustrated its application to environmental epidemiologic studies by reanalyzing Sister Study data.
Results: Parameter estimates were nearly unbiased, and 95% confidence intervals constructed using a bootstrap estimate of the standard error performed well. In statistical tests also based on the bootstrap standard error, pooling up to 8 specimens per pool caused only modest loss of power. Assigning more cohort members to the subcohort and commensurately increasing the number of specimens per pool improved power and precision substantially while reducing the number of assays.
Discussion: When using case-cohort analysis to study disease outcomes in relation to exposures assessed using biospecimens in a cohort study, epidemiologists should consider biospecimen pooling as a way to improve statistical power, conserve irreplaceable archives, and save money. https://doi.org/10.1289/EHP14476.
期刊介绍:
Environmental Health Perspectives (EHP) is a monthly peer-reviewed journal supported by the National Institute of Environmental Health Sciences, part of the National Institutes of Health under the U.S. Department of Health and Human Services. Its mission is to facilitate discussions on the connections between the environment and human health by publishing top-notch research and news. EHP ranks third in Public, Environmental, and Occupational Health, fourth in Toxicology, and fifth in Environmental Sciences.