双鲁棒条件独立测试新的生物标志物给定的风险因素与生存数据。

IF 1.7 4区数学 Q3 BIOLOGY

Biometrics Pub Date : 2025-10-08 DOI:10.1093/biomtc/ujaf133

Baoying Yang, Jing Qin, Jing Ning, Yukun Liu

{"title":"双鲁棒条件独立测试新的生物标志物给定的风险因素与生存数据。","authors":"Baoying Yang, Jing Qin, Jing Ning, Yukun Liu","doi":"10.1093/biomtc/ujaf133","DOIUrl":null,"url":null,"abstract":"Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $T\\perp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ \\mathrm{\\scriptscriptstyle \\top } }, Z^{ \\mathrm{\\scriptscriptstyle \\top } })^{ \\mathrm{\\scriptscriptstyle \\top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${\\rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.","PeriodicalId":8930,"journal":{"name":"Biometrics","volume":"81 4","pages":""},"PeriodicalIF":1.7000,"publicationDate":"2025-10-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Double robust conditional independence test for novel biomarkers given established risk factors with survival data.\",\"authors\":\"Baoying Yang, Jing Qin, Jing Ning, Yukun Liu\",\"doi\":\"10.1093/biomtc/ujaf133\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $T\\\\perp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ \\\\mathrm{\\\\scriptscriptstyle \\\\top } }, Z^{ \\\\mathrm{\\\\scriptscriptstyle \\\\top } })^{ \\\\mathrm{\\\\scriptscriptstyle \\\\top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${\\\\rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.\",\"PeriodicalId\":8930,\"journal\":{\"name\":\"Biometrics\",\"volume\":\"81 4\",\"pages\":\"\"},\"PeriodicalIF\":1.7000,\"publicationDate\":\"2025-10-08\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Biometrics\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1093/biomtc/ujaf133\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"BIOLOGY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Biometrics","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1093/biomtc/ujaf133","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"BIOLOGY","Score":null,"Total":0}

引用次数: 0

摘要

条件独立是理解变量间概率关系的基本概念，在因果推理和机器学习等领域有着广泛的应用。这项研究的重点是测试条件独立性，$T\perp X|Z$，其中T代表可能受到正确审查的生存数据，Z代表T的既定风险因素，X代表潜在的新生物标志物。目标是确定新的生物标志物，为进一步的风险评估和预测提供额外的优点。这可以通过使用偏似然比或参数似然比统计量来评估给定$(X^{\mathrm{\scriptscriptstyle \top}}, Z^{\mathrm{\scriptscriptstyle \top}})^{\mathrm{\scriptscriptstyle \top}}$的条件模型中X的系数向量是否等于零来实现。传统的检验，如直接将似然比与卡方分布进行比较，可能会在模型错误规范下产生错误的i型错误率。作为替代方案，我们提出了一种基于重采样的方法来近似似然比的分布。所提出的测试的一个关键优势是它的双重鲁棒性：当条件结果模型或${\rm pr} (X|Z)$的工作模型被正确指定时，它实现了近似正确的i型错误率。此外，可以结合机器学习技术来提高测试性能。模拟研究和对阿尔茨海默病神经成像倡议（ADNI）数据的应用证明了所提出的测试的有限样本性能。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Double robust conditional independence test for novel biomarkers given established risk factors with survival data.

Conditional independence is a foundational concept for understanding probabilistic relationships among variables, with broad applications in fields such as causal inference and machine learning. This study focuses on testing conditional independence, $T\perp X|Z$, where T represents survival data possibly subject to right censoring, Z represents established risk factors for T, and X represents potential novel biomarkers. The goal is to identify novel biomarkers that offer additional merits for further risk assessment and prediction. This can be achieved by using either the partial or parametric likelihood ratio statistic to evaluate whether the coefficient vector of X in the conditional model of T given $(X^{ \mathrm{\scriptscriptstyle \top } }, Z^{ \mathrm{\scriptscriptstyle \top } })^{ \mathrm{\scriptscriptstyle \top } }$ is equal to zero. Traditional tests such as directly comparing likelihood ratios to chi-squared distributions may produce erroneous type-I error rates under model misspecification. As an alternative, we propose a resampling-based method to approximate the distribution of the likelihood ratios. A key advantage of the proposed test is its double robustness: it achieves approximately correct type-I error rates when either the conditional outcome model or the working model of ${\rm pr} (X|Z)$ is correctly specified. Additionally, machine learning techniques can be incorporated to improve test performance. Simulation studies and the application to the Alzheimer's Disease Neuroimaging Initiative (ADNI) data demonstrate the finite-sample performance of the proposed tests.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Biometrics 生物-生物学

CiteScore

2.70

自引率

5.30%

发文量

178

审稿时长

4-8 weeks

期刊介绍： The International Biometric Society is an international society promoting the development and application of statistical and mathematical theory and methods in the biosciences, including agriculture, biomedical science and public health, ecology, environmental sciences, forestry, and allied disciplines. The Society welcomes as members statisticians, mathematicians, biological scientists, and others devoted to interdisciplinary efforts in advancing the collection and interpretation of information in the biosciences. The Society sponsors the biennial International Biometric Conference, held in sites throughout the world; through its National Groups and Regions, it also Society sponsors regional and local meetings.