Bayesian Additive Regression Trees for Group Testing Data.

IF 1.8 4区医学 Q3 MATHEMATICAL & COMPUTATIONAL BIOLOGY

Statistics in Medicine Pub Date : 2025-03-15 DOI:10.1002/sim.70052

Madeleine E St Ville, Christopher S McMahan, Joe D Bible, Joshua M Tebbs, Christopher R Bilder

{"title":"Bayesian Additive Regression Trees for Group Testing Data.","authors":"Madeleine E St Ville, Christopher S McMahan, Joe D Bible, Joshua M Tebbs, Christopher R Bilder","doi":"10.1002/sim.70052","DOIUrl":null,"url":null,"abstract":"<p><p>When screening for low-prevalence diseases, pooling specimens (e.g., blood, urine, swabs, etc.) through group testing has the potential to substantially reduce costs when compared to testing specimens individually. A common goal in group testing applications is to estimate the relationship between an individual's true disease status and their individual-level covariate information. However, estimating such a relationship is a non-trivial problem because true individual disease statuses are unknown due to the group testing protocol and the possibility of imperfect testing. While several regression methods have been developed in recent years to accommodate the complexity of group testing data, the functional form of covariate effects is typically assumed to be known. To avoid model misspecification and to provide a more flexible approach, we propose a Bayesian additive regression trees framework to model the individual-level probability of disease with potentially misclassified group testing data. Our methods can be used to analyze data arising from any group testing protocol with the goal of estimating unknown functions of covariates and assay classification accuracy probabilities.</p>","PeriodicalId":21879,"journal":{"name":"Statistics in Medicine","volume":"44 6","pages":"e70052"},"PeriodicalIF":1.8000,"publicationDate":"2025-03-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11907685/pdf/","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistics in Medicine","FirstCategoryId":"3","ListUrlMain":"https://doi.org/10.1002/sim.70052","RegionNum":4,"RegionCategory":"医学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"MATHEMATICAL & COMPUTATIONAL BIOLOGY","Score":null,"Total":0}

引用次数: 0

Abstract

When screening for low-prevalence diseases, pooling specimens (e.g., blood, urine, swabs, etc.) through group testing has the potential to substantially reduce costs when compared to testing specimens individually. A common goal in group testing applications is to estimate the relationship between an individual's true disease status and their individual-level covariate information. However, estimating such a relationship is a non-trivial problem because true individual disease statuses are unknown due to the group testing protocol and the possibility of imperfect testing. While several regression methods have been developed in recent years to accommodate the complexity of group testing data, the functional form of covariate effects is typically assumed to be known. To avoid model misspecification and to provide a more flexible approach, we propose a Bayesian additive regression trees framework to model the individual-level probability of disease with potentially misclassified group testing data. Our methods can be used to analyze data arising from any group testing protocol with the goal of estimating unknown functions of covariates and assay classification accuracy probabilities.

查看原文本刊更多论文

组测试数据的贝叶斯加性回归树。

在筛查低流行率疾病时，通过群体检测将标本（如血液、尿液、拭子等）集中起来，有可能比单独检测标本大大降低成本。群体检测应用的一个共同目标是估算个体真实疾病状态与其个体水平协变量信息之间的关系。然而，估算这种关系并不是一个简单的问题，因为由于集体检测方案和不完善检测的可能性，真实的个体疾病状态是未知的。虽然近年来已开发出几种回归方法来适应分组测试数据的复杂性，但一般都假定协变量效应的函数形式是已知的。为了避免模型的错误规范并提供一种更灵活的方法，我们提出了一种贝叶斯加性回归树框架，用于对可能存在分类错误的群体测试数据进行个体层面的疾病概率建模。我们的方法可用于分析任何群体测试方案产生的数据，目的是估计协变量的未知函数和检测分类准确概率。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Statistics in Medicine 医学-公共卫生、环境卫生与职业卫生

CiteScore

3.40

自引率

10.00%

发文量

334

审稿时长

2-4 weeks

期刊介绍： The journal aims to influence practice in medicine and its associated sciences through the publication of papers on statistical and other quantitative methods. Papers will explain new methods and demonstrate their application, preferably through a substantive, real, motivating example or a comprehensive evaluation based on an illustrative example. Alternatively, papers will report on case-studies where creative use or technical generalizations of established methodology is directed towards a substantive application. Reviews of, and tutorials on, general topics relevant to the application of statistics to medicine will also be published. The main criteria for publication are appropriateness of the statistical methods to a particular medical problem and clarity of exposition. Papers with primarily mathematical content will be excluded. The journal aims to enhance communication between statisticians, clinicians and medical researchers.