在生成逻辑回归模型样本时控制分离度

IF 0.8 Q3 STATISTICS & PROBABILITY
Huong T. T. Pham, Hoa Pham
{"title":"在生成逻辑回归模型样本时控制分离度","authors":"Huong T. T. Pham, Hoa Pham","doi":"10.3103/s1066530724700017","DOIUrl":null,"url":null,"abstract":"<h3 data-test=\"abstract-sub-heading\">Abstract</h3><p>Separation has a significant impact on parameter estimates for logistic regression models in frequentist approach and in Bayesian approach. When separation presents in a sample, the maximum likelihood estimation (MLE) does not exist through standard estimation methods. The existence of posterior means is affected by the presence of separation and also depended on the forms of prior distributions. Therefore, controlling the appearance of separation in generating samples from the logistic regression models has an important role for parameter estimation techniques. In this paper, we propose necessary and sufficient conditions for separation occurring in the logistic regression samples with two dimensional models and multiple dimensional models of independent variables. By using the technique of rotating Castesian coordinates of p dimensions, the characteristic of separation occurring in general cases is presented. Using these results, we propose algorithms to control the probability of separation appearance in generated samples for given sample sizes and multiple dimensional models of independent variables. The simulation studies show that the proposed algorithms can effectively generate the designed random samples with controlling the probability of separation appearance.</p>","PeriodicalId":46039,"journal":{"name":"Mathematical Methods of Statistics","volume":null,"pages":null},"PeriodicalIF":0.8000,"publicationDate":"2024-04-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Controlling Separation in Generating Samples for Logistic Regression Models\",\"authors\":\"Huong T. T. Pham, Hoa Pham\",\"doi\":\"10.3103/s1066530724700017\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<h3 data-test=\\\"abstract-sub-heading\\\">Abstract</h3><p>Separation has a significant impact on parameter estimates for logistic regression models in frequentist approach and in Bayesian approach. When separation presents in a sample, the maximum likelihood estimation (MLE) does not exist through standard estimation methods. The existence of posterior means is affected by the presence of separation and also depended on the forms of prior distributions. Therefore, controlling the appearance of separation in generating samples from the logistic regression models has an important role for parameter estimation techniques. In this paper, we propose necessary and sufficient conditions for separation occurring in the logistic regression samples with two dimensional models and multiple dimensional models of independent variables. By using the technique of rotating Castesian coordinates of p dimensions, the characteristic of separation occurring in general cases is presented. Using these results, we propose algorithms to control the probability of separation appearance in generated samples for given sample sizes and multiple dimensional models of independent variables. The simulation studies show that the proposed algorithms can effectively generate the designed random samples with controlling the probability of separation appearance.</p>\",\"PeriodicalId\":46039,\"journal\":{\"name\":\"Mathematical Methods of Statistics\",\"volume\":null,\"pages\":null},\"PeriodicalIF\":0.8000,\"publicationDate\":\"2024-04-25\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Mathematical Methods of Statistics\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.3103/s1066530724700017\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q3\",\"JCRName\":\"STATISTICS & PROBABILITY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Mathematical Methods of Statistics","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3103/s1066530724700017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q3","JCRName":"STATISTICS & PROBABILITY","Score":null,"Total":0}
引用次数: 0

摘要

摘要 在频数法和贝叶斯法中,分离对逻辑回归模型的参数估计有重大影响。当样本中出现分离时,最大似然估计(MLE)就无法通过标准估计方法实现。后验均值的存在受到分离现象的影响,同时也取决于先验分布的形式。因此,在生成逻辑回归模型样本时控制分离的出现对参数估计技术具有重要作用。本文提出了自变量二维模型和多维模型的逻辑回归样本出现分离的必要条件和充分条件。通过使用 p 维旋转 Castesian 坐标技术,提出了一般情况下发生分离的特征。利用这些结果,我们提出了在给定样本大小和自变量多维模型的情况下,控制生成样本中出现分离的概率的算法。模拟研究表明,所提出的算法可以有效生成设计的随机样本,并控制分离出现的概率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Controlling Separation in Generating Samples for Logistic Regression Models

Abstract

Separation has a significant impact on parameter estimates for logistic regression models in frequentist approach and in Bayesian approach. When separation presents in a sample, the maximum likelihood estimation (MLE) does not exist through standard estimation methods. The existence of posterior means is affected by the presence of separation and also depended on the forms of prior distributions. Therefore, controlling the appearance of separation in generating samples from the logistic regression models has an important role for parameter estimation techniques. In this paper, we propose necessary and sufficient conditions for separation occurring in the logistic regression samples with two dimensional models and multiple dimensional models of independent variables. By using the technique of rotating Castesian coordinates of p dimensions, the characteristic of separation occurring in general cases is presented. Using these results, we propose algorithms to control the probability of separation appearance in generated samples for given sample sizes and multiple dimensional models of independent variables. The simulation studies show that the proposed algorithms can effectively generate the designed random samples with controlling the probability of separation appearance.

求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
Mathematical Methods of Statistics
Mathematical Methods of Statistics STATISTICS & PROBABILITY-
CiteScore
0.60
自引率
0.00%
发文量
2
期刊介绍: Mathematical Methods of Statistics  is an is an international peer reviewed journal dedicated to the mathematical foundations of statistical theory. It primarily publishes research papers with complete proofs and, occasionally, review papers on particular problems of statistics. Papers dealing with applications of statistics are also published if they contain new theoretical developments to the underlying statistical methods. The journal provides an outlet for research in advanced statistical methodology and for studies where such methodology is effectively used or which stimulate its further development.
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信