Daniel P Beavers, Yutong Li, James D Stamey, Stephanie Powers, Walter T Ambrosius
{"title":"歧义错分类二元协变量逻辑回归的贝叶斯变量选择。","authors":"Daniel P Beavers, Yutong Li, James D Stamey, Stephanie Powers, Walter T Ambrosius","doi":"10.1080/03610918.2025.2496305","DOIUrl":null,"url":null,"abstract":"<p><p>A Bayesian approach for variable selection is developed for use in models with a misclassified binary predictor variable. We define the main outcome model containing the latent predictor, the measurement model associated with the prevalence of the predictor, and the sensitivity and specificity models of the fallible classifier conditioned on the true value of the predictor. We use binary indicator variables to execute the Gibbs sampler-based variable selection process, and we identify the highest posterior probability model given the data. We demonstrate the performance of the procedure in several simulation studies, and we utilize the selection method to optimize model performance in two datasets.</p>","PeriodicalId":55240,"journal":{"name":"Communications in Statistics-Simulation and Computation","volume":" ","pages":""},"PeriodicalIF":0.8000,"publicationDate":"2025-05-05","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12371521/pdf/","citationCount":"0","resultStr":"{\"title\":\"Bayesian variable selection for logistic regression with a differentially misclassified binary covariate.\",\"authors\":\"Daniel P Beavers, Yutong Li, James D Stamey, Stephanie Powers, Walter T Ambrosius\",\"doi\":\"10.1080/03610918.2025.2496305\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p><p>A Bayesian approach for variable selection is developed for use in models with a misclassified binary predictor variable. We define the main outcome model containing the latent predictor, the measurement model associated with the prevalence of the predictor, and the sensitivity and specificity models of the fallible classifier conditioned on the true value of the predictor. We use binary indicator variables to execute the Gibbs sampler-based variable selection process, and we identify the highest posterior probability model given the data. We demonstrate the performance of the procedure in several simulation studies, and we utilize the selection method to optimize model performance in two datasets.</p>\",\"PeriodicalId\":55240,\"journal\":{\"name\":\"Communications in Statistics-Simulation and Computation\",\"volume\":\" \",\"pages\":\"\"},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2025-05-05\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://www.ncbi.nlm.nih.gov/pmc/articles/PMC12371521/pdf/\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Communications in Statistics-Simulation and Computation\",\"FirstCategoryId\":\"100\",\"ListUrlMain\":\"https://doi.org/10.1080/03610918.2025.2496305\",\"RegionNum\":4,\"RegionCategory\":\"数学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Communications in Statistics-Simulation and Computation","FirstCategoryId":"100","ListUrlMain":"https://doi.org/10.1080/03610918.2025.2496305","RegionNum":4,"RegionCategory":"数学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
Bayesian variable selection for logistic regression with a differentially misclassified binary covariate.
A Bayesian approach for variable selection is developed for use in models with a misclassified binary predictor variable. We define the main outcome model containing the latent predictor, the measurement model associated with the prevalence of the predictor, and the sensitivity and specificity models of the fallible classifier conditioned on the true value of the predictor. We use binary indicator variables to execute the Gibbs sampler-based variable selection process, and we identify the highest posterior probability model given the data. We demonstrate the performance of the procedure in several simulation studies, and we utilize the selection method to optimize model performance in two datasets.
期刊介绍:
The Simulation and Computation series intends to publish papers that make theoretical and methodological advances relating to computational aspects of Probability and Statistics. Simulational assessment and comparison of the performance of statistical and probabilistic methods will also be considered for publication. Papers stressing graphical methods, resampling and other computationally intensive methods will be particularly relevant. In addition, special issues dedicated to a specific topic of current interest will also be published in this series periodically, providing an exhaustive and up-to-date review of that topic to the readership.