Double robust conditional independence test for novel biomarkers given established risk factors with survival data.

IF 1.7 4区 数学 Q3 BIOLOGY
Biometrics Pub Date : 2025-10-08 DOI:10.1093/biomtc/ujaf133
Baoying Yang, Jing Qin, Jing Ning, Yukun Liu
{"title":"Double robust conditional independence test for novel biomarkers given established risk factors with survival data.","authors":"Baoying Yang, Jing Qin, Jing Ning, Yukun Liu","doi":"10.1093/biomtc/ujaf133","DOIUrl":null,"url":null,"abstract":"<p><p>Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $T\\perp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ \\mathrm{\\scriptscriptstyle \\top } }, Z^{ \\mathrm{\\scriptscriptstyle \\top } })^{ \\mathrm{\\scriptscriptstyle \\top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${\\rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.</p>","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biometrics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomtc/ujaf133","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOLOGY","Score":null,"Total":0}
引用次数: 0

Abstract

Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $T\perp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ \mathrm{\scriptscriptstyle \top } }, Z^{ \mathrm{\scriptscriptstyle \top } })^{ \mathrm{\scriptscriptstyle \top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${\rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.

双鲁棒条件独立测试新的生物标志物给定的风险因素与生存数据。
条件独立是理解变量间概率关系的基本概念,在因果推理和机器学习等领域有着广泛的应用。这项研究的重点是测试条件独立性,$T\perp X|Z$,其中T代表可能受到正确审查的生存数据,Z代表T的既定风险因素,X代表潜在的新生物标志物。目标是确定新的生物标志物,为进一步的风险评估和预测提供额外的优点。这可以通过使用偏似然比或参数似然比统计量来评估给定$(X^{\mathrm{\scriptscriptstyle \top}}, Z^{\mathrm{\scriptscriptstyle \top}})^{\mathrm{\scriptscriptstyle \top}}$的条件模型中X的系数向量是否等于零来实现。传统的检验,如直接将似然比与卡方分布进行比较,可能会在模型错误规范下产生错误的i型错误率。作为替代方案,我们提出了一种基于重采样的方法来近似似然比的分布。所提出的测试的一个关键优势是它的双重鲁棒性:当条件结果模型或${\rm pr} (X|Z)$的工作模型被正确指定时,它实现了近似正确的i型错误率。此外,可以结合机器学习技术来提高测试性能。模拟研究和对阿尔茨海默病神经成像倡议(ADNI)数据的应用证明了所提出的测试的有限样本性能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Biometrics
Biometrics 生物-生物学
CiteScore
2.70
自引率
5.30%
发文量
178
审稿时长
4-8 weeks
期刊介绍: The International Biometric Society is an international society promoting the development and application of statistical and mathematical theory and methods in the biosciences, including agriculture, biomedical science and public health, ecology, environmental sciences, forestry, and allied disciplines. The Society welcomes as members statisticians, mathematicians, biological scientists, and others devoted to interdisciplinary efforts in advancing the collection and interpretation of information in the biosciences. The Society sponsors the biennial International Biometric Conference, held in sites throughout the world; through its National Groups and Regions, it also Society sponsors regional and local meetings.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信