Regression analysis of group-tested current status data

IF 2.4 2区 数学 Q2 BIOLOGY
Biometrika Pub Date : 2024-02-08 DOI:10.1093/biomet/asae006
Shuwei Li, Tao Hu, Lianming Wang, Christopher S McMahan, Joshua M Tebbs
{"title":"Regression analysis of group-tested current status data","authors":"Shuwei Li, Tao Hu, Lianming Wang, Christopher S McMahan, Joshua M Tebbs","doi":"10.1093/biomet/asae006","DOIUrl":null,"url":null,"abstract":"Summary Group testing is an effective way to reduce the time and cost associated with conducting large-scale screening for infectious diseases. Benefits are realized through testing pools formed by combining specimens, such as blood or urine, from different individuals. In some studies, individuals are assessed only once and a time-to-event endpoint is recorded, for example, the time until infection. Combining group testing with this type of endpoint results in group-tested current status data (?). To analyse these complex data, we propose methods which estimate a proportional hazards regression model based on test outcomes from measuring the pools. A sieve maximum likelihood estimation approach is developed that approximates the cumulative baseline hazard function with a piecewise constant function. To identify the sieve estimator, a computationally efficient expectation-maximization algorithm is derived by using data augmentation. Asymptotic properties of both the parametric and nonparametric components of the sieve estimator are then established by applying modern empirical process theory. Numerical results from simulation studies show that our proposed method performs nominally and has advantages over the corresponding estimation method based on individual testing results. We illustrate our work by analysing a chlamydia dataset collected by the State Hygienic Laboratory at the University of Iowa.","PeriodicalId":9001,"journal":{"name":"Biometrika","volume":"25 1","pages":""},"PeriodicalIF":2.4000,"publicationDate":"2024-02-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biometrika","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomet/asae006","RegionNum":2,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Summary Group testing is an effective way to reduce the time and cost associated with conducting large-scale screening for infectious diseases. Benefits are realized through testing pools formed by combining specimens, such as blood or urine, from different individuals. In some studies, individuals are assessed only once and a time-to-event endpoint is recorded, for example, the time until infection. Combining group testing with this type of endpoint results in group-tested current status data (?). To analyse these complex data, we propose methods which estimate a proportional hazards regression model based on test outcomes from measuring the pools. A sieve maximum likelihood estimation approach is developed that approximates the cumulative baseline hazard function with a piecewise constant function. To identify the sieve estimator, a computationally efficient expectation-maximization algorithm is derived by using data augmentation. Asymptotic properties of both the parametric and nonparametric components of the sieve estimator are then established by applying modern empirical process theory. Numerical results from simulation studies show that our proposed method performs nominally and has advantages over the corresponding estimation method based on individual testing results. We illustrate our work by analysing a chlamydia dataset collected by the State Hygienic Laboratory at the University of Iowa.
对分组测试的现状数据进行回归分析
摘要 群体检测是减少大规模传染病筛查所需的时间和成本的有效方法。通过将不同个体的血液或尿液等标本组合成检测池,可实现效益。在某些研究中,只对个体进行一次评估,并记录时间终点,如感染前的时间。将分组检测与这类终点相结合,就会得到分组检测的当前状态数据(?)为了分析这些复杂的数据,我们提出了一些方法,这些方法基于测量池的测试结果来估计比例危险回归模型。我们开发了一种筛网最大似然估计方法,用一个片断常数函数来近似累积基线危害函数。为了确定筛网估计器,利用数据扩增推导出了一种计算高效的期望最大化算法。然后,通过应用现代经验过程理论,建立了筛网估计器的参数和非参数部分的渐近特性。模拟研究的数值结果表明,与基于单个测试结果的相应估算方法相比,我们提出的方法具有显著的性能和优势。我们通过分析爱荷华大学国家卫生实验室收集的衣原体数据集来说明我们的工作。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biometrika
Biometrika 生物-生物学
CiteScore
5.50
自引率
3.70%
发文量
56
审稿时长
6-12 weeks
期刊介绍: Biometrika is primarily a journal of statistics in which emphasis is placed on papers containing original theoretical contributions of direct or potential value in applications. From time to time, papers in bordering fields are also published.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信