{"title":"选择模型内生性和异质性的贝叶斯半参数方法","authors":"Yang Li, Asim Ansari","doi":"10.2139/ssrn.2270993","DOIUrl":null,"url":null,"abstract":"Marketing variables that are included in consumer discrete choice models are often endogenous. Extant treatments using likelihood-based estimators impose parametric distributional assumptions, such as normality, on the source of endogeneity. These assumptions are restrictive because misspecified distributions have an impact on parameter estimates and associated elasticities. The normality assumption for endogeneity can be inconsistent with some marginal cost specifications given a price-setting process, although they are consistent with other specifications. In this paper, we propose a heterogeneous Bayesian semiparametric approach for modeling choice endogeneity that offers a flexible and robust alternative to parametric methods. Specifically, we construct centered Dirichlet process mixtures CDPM to allow uncertainty over the distribution of endogeneity errors. In a similar vein, we also model consumer preference heterogeneity nonparametrically via a CDPM. Results on simulated data show that incorrect distributional assumptions can lead to poor recovery of model parameters and price elasticities, whereas the proposed semiparametric model is able to robustly recover the true parameters in an efficient fashion. In addition, the CDPM offers the benefits of automatically inferring the number of mixture components that are appropriate for a given data set and is able to reconstruct the shape of the underlying distributions for endogeneity and heterogeneity errors. We apply our approach to two scanner panel data sets. Model comparison statistics indicate the superiority of the semiparametric specification and the results show that parameter and elasticity estimates are sensitive to the choice of distributional forms. Moreover, the CDPM specification yields evidence of multimodality, skewness, and outlying observations in these real data sets. \n \nData, as supplemental material, are available at http://dx.doi.org/10.1287/mnsc.2013.1811 . \n \nThis paper was accepted by J. Miguel Villas-Boas, marketing.","PeriodicalId":165362,"journal":{"name":"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"41","resultStr":"{\"title\":\"A Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models\",\"authors\":\"Yang Li, Asim Ansari\",\"doi\":\"10.2139/ssrn.2270993\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Marketing variables that are included in consumer discrete choice models are often endogenous. Extant treatments using likelihood-based estimators impose parametric distributional assumptions, such as normality, on the source of endogeneity. These assumptions are restrictive because misspecified distributions have an impact on parameter estimates and associated elasticities. The normality assumption for endogeneity can be inconsistent with some marginal cost specifications given a price-setting process, although they are consistent with other specifications. In this paper, we propose a heterogeneous Bayesian semiparametric approach for modeling choice endogeneity that offers a flexible and robust alternative to parametric methods. Specifically, we construct centered Dirichlet process mixtures CDPM to allow uncertainty over the distribution of endogeneity errors. In a similar vein, we also model consumer preference heterogeneity nonparametrically via a CDPM. Results on simulated data show that incorrect distributional assumptions can lead to poor recovery of model parameters and price elasticities, whereas the proposed semiparametric model is able to robustly recover the true parameters in an efficient fashion. In addition, the CDPM offers the benefits of automatically inferring the number of mixture components that are appropriate for a given data set and is able to reconstruct the shape of the underlying distributions for endogeneity and heterogeneity errors. We apply our approach to two scanner panel data sets. Model comparison statistics indicate the superiority of the semiparametric specification and the results show that parameter and elasticity estimates are sensitive to the choice of distributional forms. Moreover, the CDPM specification yields evidence of multimodality, skewness, and outlying observations in these real data sets. \\n \\nData, as supplemental material, are available at http://dx.doi.org/10.1287/mnsc.2013.1811 . \\n \\nThis paper was accepted by J. Miguel Villas-Boas, marketing.\",\"PeriodicalId\":165362,\"journal\":{\"name\":\"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-05-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"41\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.2270993\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Discrete Regression & Qualitative Choice Models (Single) (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.2270993","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 41
摘要
包含在消费者离散选择模型中的营销变量通常是内生的。现有的基于似然估计的处理方法对内生性的来源施加了参数分布假设,如正态性。这些假设是限制性的,因为错误指定的分布对参数估计和相关的弹性有影响。内生性的正态性假设可能与给定定价过程的某些边际成本规格不一致,尽管它们与其他规格一致。在本文中,我们提出了一种异构贝叶斯半参数方法来建模选择内生性,它提供了一种灵活和鲁棒的替代参数方法。具体来说,我们构建了中心狄利克雷过程混合CDPM,以允许内质性误差分布的不确定性。同样,我们也通过CDPM对消费者偏好异质性进行非参数化建模。模拟数据的结果表明,不正确的分布假设会导致模型参数和价格弹性的较差恢复,而所提出的半参数模型能够有效地鲁棒恢复真实参数。此外,CDPM还提供了自动推断适合给定数据集的混合成分数量的好处,并且能够重建内质性和异质性误差的潜在分布形状。我们将我们的方法应用于两个扫描面板数据集。模型比较统计表明了半参数规范的优越性,结果表明参数估计和弹性估计对分布形式的选择很敏感。此外,CDPM规范在这些真实数据集中产生了多模态、偏态和离群观测的证据。作为补充资料的数据可在http://dx.doi.org/10.1287/mnsc.2013.1811上获得。这篇论文被市场营销学的J. Miguel Villas-Boas接受。
A Bayesian Semiparametric Approach for Endogeneity and Heterogeneity in Choice Models
Marketing variables that are included in consumer discrete choice models are often endogenous. Extant treatments using likelihood-based estimators impose parametric distributional assumptions, such as normality, on the source of endogeneity. These assumptions are restrictive because misspecified distributions have an impact on parameter estimates and associated elasticities. The normality assumption for endogeneity can be inconsistent with some marginal cost specifications given a price-setting process, although they are consistent with other specifications. In this paper, we propose a heterogeneous Bayesian semiparametric approach for modeling choice endogeneity that offers a flexible and robust alternative to parametric methods. Specifically, we construct centered Dirichlet process mixtures CDPM to allow uncertainty over the distribution of endogeneity errors. In a similar vein, we also model consumer preference heterogeneity nonparametrically via a CDPM. Results on simulated data show that incorrect distributional assumptions can lead to poor recovery of model parameters and price elasticities, whereas the proposed semiparametric model is able to robustly recover the true parameters in an efficient fashion. In addition, the CDPM offers the benefits of automatically inferring the number of mixture components that are appropriate for a given data set and is able to reconstruct the shape of the underlying distributions for endogeneity and heterogeneity errors. We apply our approach to two scanner panel data sets. Model comparison statistics indicate the superiority of the semiparametric specification and the results show that parameter and elasticity estimates are sensitive to the choice of distributional forms. Moreover, the CDPM specification yields evidence of multimodality, skewness, and outlying observations in these real data sets.
Data, as supplemental material, are available at http://dx.doi.org/10.1287/mnsc.2013.1811 .
This paper was accepted by J. Miguel Villas-Boas, marketing.