Biostatistics最新文献_第4页

Causal functional mediation analysis with an application to functional magnetic resonance imaging data. 因果功能中介分析及其在功能磁共振成像数据中的应用。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf019

Yi Zhao, Xi Luo, Michael E Sobel, Martin A Lindquist, Brian S Caffo

引用次数: 0

Model-based multifacet clustering with high-dimensional omics applications. 基于模型的多面聚类与高维 omics 应用。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae020

Wei Zong, Danyang Li, Marianne L Seney, Colleen A Mcclung, George C Tseng

引用次数: 0

Speeding up interval estimation for R2-based mediation effect of high-dimensional mediators via cross-fitting. 通过交叉拟合，加快基于 R2 的高维中介效应的区间估计。

IF 2 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae037

Zhichao Xu, Chunlin Li, Sunyi Chi, Tianzhong Yang, Peng Wei

{"title":"Speeding up interval estimation for R2-based mediation effect of high-dimensional mediators via cross-fitting.","authors":"Zhichao Xu, Chunlin Li, Sunyi Chi, Tianzhong Yang, Peng Wei","doi":"10.1093/biostatistics/kxae037","DOIUrl":"10.1093/biostatistics/kxae037","url":null,"abstract":"Mediation analysis is a useful tool in investigating how molecular phenotypes such as gene expression mediate the effect of exposure on health outcomes. However, commonly used mean-based total mediation effect measures may suffer from cancellation of component-wise mediation effects in opposite directions in the presence of high-dimensional omics mediators. To overcome this limitation, we recently proposed a variance-based R-squared total mediation effect measure that relies on the computationally intensive nonparametric bootstrap for confidence interval estimation. In the work described herein, we formulated a more efficient two-stage, cross-fitted estimation procedure for the R2 measure. To avoid potential bias, we performed iterative Sure Independence Screening (iSIS) in two subsamples to exclude the non-mediators, followed by ordinary least squares regressions for the variance estimation. We then constructed confidence intervals based on the newly derived closed-form asymptotic distribution of the R2 measure. Extensive simulation studies demonstrated that this proposed procedure is much more computationally efficient than the resampling-based method, with comparable coverage probability. Furthermore, when applied to the Framingham Heart Study, the proposed method replicated the established finding of gene expression mediating age-related variation in systolic blood pressure and identified the role of gene expression profiles in the relationship between sex and high-density lipoprotein cholesterol level. The proposed estimation procedure is implemented in R package CFR2M.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823199/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142481495","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

The impact of coarsening an exposure on partial identifiability in instrumental variable settings. 在工具变量设置中，粗化暴露对部分可识别性的影响。

IF 2 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae042

Erin E Gabriel, Michael C Sachs, Arvid Sjölander

引用次数: 0

Fast standard error estimation for joint models of longitudinal and time-to-event data based on stochastic EM algorithms. 基于随机 EM 算法的纵向数据和时间到事件数据联合模型的快速标准误差估计。

IF 2 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae043

Tingting Yu, Lang Wu, Ronald J Bosch, Davey M Smith, Rui Wang

{"title":"Fast standard error estimation for joint models of longitudinal and time-to-event data based on stochastic EM algorithms.","authors":"Tingting Yu, Lang Wu, Ronald J Bosch, Davey M Smith, Rui Wang","doi":"10.1093/biostatistics/kxae043","DOIUrl":"10.1093/biostatistics/kxae043","url":null,"abstract":"Maximum likelihood inference can often become computationally intensive when performing joint modeling of longitudinal and time-to-event data, due to the intractable integrals in the joint likelihood function. The computational challenges escalate further when modeling HIV-1 viral load data, owing to the nonlinear trajectories and the presence of left-censored data resulting from the assay's lower limit of quantification. In this paper, for a joint model comprising a nonlinear mixed-effect model and a Cox Proportional Hazards model, we develop a computationally efficient Stochastic EM (StEM) algorithm for parameter estimation. Furthermore, we propose a novel technique for fast standard error estimation, which directly estimates standard errors from the results of StEM iterations and is broadly applicable to various joint modeling settings, such as those containing generalized linear mixed-effect models, parametric survival models, or joint models with more than two submodels. We evaluate the performance of the proposed methods through simulation studies and apply them to HIV-1 viral load data from six AIDS Clinical Trials Group studies to characterize viral rebound trajectories following the interruption of antiretroviral therapy (ART), accounting for the informative duration of off-ART periods.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":" ","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC11823262/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"142632694","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Adaptive Gaussian Markov random fields for child mortality estimation. 用于儿童死亡率估算的自适应高斯马尔可夫随机场。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae030

Serge Aleshin-Guendel, Jon Wakefield

引用次数: 0

Incorporating historic information to further improve power when conducting Bayesian information borrowing in basket trials. 在篮子试验中引入历史信息，进一步提高贝叶斯信息的有效性。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf016

Libby Daniells, Pavel Mozgunov, Helen Barnett, Alun Bedding, Thomas Jaki

{"title":"Incorporating historic information to further improve power when conducting Bayesian information borrowing in basket trials.","authors":"Libby Daniells, Pavel Mozgunov, Helen Barnett, Alun Bedding, Thomas Jaki","doi":"10.1093/biostatistics/kxaf016","DOIUrl":"10.1093/biostatistics/kxaf016","url":null,"abstract":"In basket trials a single therapeutic treatment is tested on several patient populations simultaneously, each of which forming a basket, where patients across all baskets on the trial share a common genetic aberration. These trials allow testing of treatments on small groups of patients, however, limited basket sample sizes can result in inadequate precision and power of estimates. It is well known that Bayesian information borrowing models such as the exchangeability-nonexchangeability (EXNEX) model can be implemented to tackle such a problem, drawing on information from one basket when making inference in another. An alternative approach to improve power of estimates, is to incorporate any historical or external information available. This paper considers models that amalgamate both forms of information borrowing, allowing borrowing between baskets in the ongoing trial whilst also drawing on response data from historical sources, with the aim to further improve treatment effect estimates. We propose several Bayesian information borrowing approaches that incorporate historical information into the model. These methods are data-driven, updating the degree of borrowing based on the level of homogeneity between information sources. A thorough simulation study is presented to draw comparisons between the proposed approaches, whilst also comparing to the standard EXNEX model in which no historical information is utilized. The models are also applied to a real-life trial example to demonstrate their performance in practice. We show that the incorporation of historic data under the novel approaches can lead to a substantial improvement in precision and power of treatment effect estimates when such data is homogeneous to the responses in the ongoing trial. Under some approaches, this came alongside an inflation in type I error rate in cases of heterogeneity. However, the use of a power prior in the EXNEX model is shown to increase power and precision, whilst maintaining similar error rates to the standard EXNEX model.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12204204/pdf/","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144327836","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"OA","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Predicting distributions of physical activity profiles in the National Health and Nutrition Examination Survey database using a partially linear Fréchet single index model. 使用部分线性fr<s:1>单指数模型预测国家健康和营养检查调查数据库中身体活动概况的分布。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf013

Marcos Matabuena, Aritra Ghosal, Wendy Meiring, Alexander Petersen

{"title":"Predicting distributions of physical activity profiles in the National Health and Nutrition Examination Survey database using a partially linear Fréchet single index model.","authors":"Marcos Matabuena, Aritra Ghosal, Wendy Meiring, Alexander Petersen","doi":"10.1093/biostatistics/kxaf013","DOIUrl":"10.1093/biostatistics/kxaf013","url":null,"abstract":"Object-oriented data analysis is a fascinating and evolving field in modern statistical science, with the potential to make significant contributions to biomedical applications. This statistical framework facilitates the development of new methods to analyze complex data objects that capture more information than traditional clinical biomarkers. This paper applies the object-oriented framework to analyze physical activity levels, measured by accelerometers, as response objects in a regression model. Unlike traditional summary metrics, we utilize a recently proposed representation of physical activity data as a distributional object, providing a more nuanced and complete profile of individual energy expenditure across all ranges of monitoring intensity. A novel hybrid Fréchet regression model is proposed and applied to US population accelerometer data from National Health and Nutrition Examination Survey (NHANES) 2011 to 2014. The semi-parametric nature of the model allows for the inclusion of nonlinear effects for critical variables, such as age, which are biologically known to have subtle impacts on physical activity. Simultaneously, the inclusion of linear effects preserves interpretability for other variables, particularly categorical covariates such as ethnicity and sex. The results obtained are valuable from a public health perspective and could lead to new strategies for optimizing physical activity interventions in specific American subpopulations.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":1.8,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144129647","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

Stochastic EM algorithm for partially observed stochastic epidemics with individual heterogeneity. 具有个体异质性的部分观测随机流行病的随机 EM 算法。

IF 1.8 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxae018

Fan Bu, Allison E Aiello, Alexander Volfovsky, Jason Xu

引用次数: 0

The winner's curse under dependence: repairing empirical Bayes using convoluted densities. 依赖下的赢家诅咒：用卷积密度修复经验贝叶斯。

IF 2 3区数学

Biostatistics Pub Date : 2024-12-31 DOI: 10.1093/biostatistics/kxaf025

Stijn Hawinkel, Olivier Thas, Steven Maere

{"title":"The winner's curse under dependence: repairing empirical Bayes using convoluted densities.","authors":"Stijn Hawinkel, Olivier Thas, Steven Maere","doi":"10.1093/biostatistics/kxaf025","DOIUrl":"https://doi.org/10.1093/biostatistics/kxaf025","url":null,"abstract":"The winner's curse is a form of selection bias that arises when estimates are obtained for a large number of features, but only a subset of most extreme estimates is reported. It occurs in large scale significance testing as well as in rank-based selection, and imperils reproducibility of findings and follow-up study design. Several methods correcting for this selection bias have been proposed, but questions remain on their susceptibility to dependence between features since theoretical analyses and comparative studies are few. We prove that estimation through Tweedie's formula is biased in presence of strong dependence, and propose a convolution of its density estimator to restore its competitive performance, which also aids other empirical Bayes methods. Furthermore, we perform a comprehensive simulation study comparing different classes of winner's curse correction methods for point estimates as well as confidence intervals under dependence. We find a bootstrap method and empirical Bayes methods with density convolution to perform best at correcting the selection bias, although this correction generally does not improve the feature ranking. Finally, we apply the methods to a comparison of single-feature versus multi-feature prediction models in predicting Brassica napus phenotypes from gene expression data, demonstrating that the superiority of the best single-feature model may be illusory.","PeriodicalId":55357,"journal":{"name":"Biostatistics","volume":"26 1","pages":""},"PeriodicalIF":2.0,"publicationDate":"2024-12-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"144979577","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":3,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0