Hierarchical Bayes Conjoint Choice Models - Model Framework, Bayesian Inference, Model Selection, and Interpretation of Estimation Results

Nils Goeken, P. Kurz, Winfried Steiner
{"title":"Hierarchical Bayes Conjoint Choice Models - Model Framework, Bayesian Inference, Model Selection, and Interpretation of Estimation Results","authors":"Nils Goeken, P. Kurz, Winfried Steiner","doi":"10.15358/0344-1369-2021-3-49","DOIUrl":null,"url":null,"abstract":"Choice-based conjoint (CBC) is nowadays the most widely used variant of conjoint analysis, a class of methods for measuring consumer preferences. The primary reason for the increasing dominance of the CBC approach over the last 35 years is that it closely mimics real choice behavior of consumers by asking respondents repeatedly to choose their preferred alternative from a set of several offered alternatives (choice sets). Within the framework of CBC analysis, the multinomial logit (MNL) model is the most frequently used discrete choice model due to the existence of closed form solutions for conditional choice probabilities. The popularity of CBC and the MNL model has grown even more since the introduction of hierarchical Bayesian (HB) estimation techniques that accommodate individual consumer heterogeneity in choice data, and which have now become state-of-the-art in marketing theory and practice. Still, researchers and practitioners have to make further decisions under this framework (CBC, MNL, HB estimation), such as how to represent preference heterogeneity. Here, using a normal distribution (and therefore a unimodal distribution) has become the standard approach in the marketing literature. However, the thin tails of the normal distribution suggest that the standard HB-MNL model should not be the “go-to” approach to approximate multimodal preference distributions, because individual preference patterns lying at the tails of the normal distribution (i.e., that do not fit well with the assumption of a unimodal distribution) tend to be shrunk to the population mean. This shrinkage, especially in multimodal data settings, could mask important information (e.g., new or different structures in the data). A mixture of normal distributions avoids this limited flexibility of the most simple continuous approach of assuming a unimodal prior heterogeneity distribution. There are currently two prominent HB-CBC modeling approaches embedding the mixture-of-normals (MoN) approach: the more widespread MoN-HB-MNL model, and the Dirichlet process mixture (DPM)-HB-MNL model. In this article, we review the prominent HB-MNL model (with its normal prior), the MoN-HB-MNL model, and the DPM-HB-MNL model and apply them to an empirical multi-country CBC data set. We compare the statistical performance of the three models in terms of goodness-of-fit and predictive accuracy, show how to include consumer background characteristics in the upper level of these models, and illustrate how to interpret the estimation results (with a special focus on cross-county heterogeneity). In sum, our article serves as a kind of user guide to the estimation and interpretation of Hierarchical Bayes Conjoint Choice Models. For our data, we observed that all three choice models (both with and without consumer background characteristics) resulted in a one-component solution. The DPM-HB-MNL model nevertheless yielded a higher cross-validated hit rate compared to the MoN-HB-MNL and the HB-MNL models due to its even more flexible prior assumptions. The two latter models tended to slightly overfit our empirical data, which was reflected by higher goodness-of-fit statistics but a lower predictive accuracy compared to the DPM-HB-MNL model. We showed that this result could be attributed to the weaker extent of Bayesian shrinkage of these two models. The DPM-HB-MNL model showed a stronger shrinkage effect and seems therefore somewhat more robust against overfitting. Including consumer background characteristics in terms of country of origin information for the respondents did not improve the statistical model performance (especially not the predictive performance). Still, using the country of origin information for respondents in a post-hoc segmentation analysis helped us to explain some differences in brand preferences between the five countries.","PeriodicalId":446283,"journal":{"name":"Marketing ZFP","volume":"75 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Marketing ZFP","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.15358/0344-1369-2021-3-49","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Choice-based conjoint (CBC) is nowadays the most widely used variant of conjoint analysis, a class of methods for measuring consumer preferences. The primary reason for the increasing dominance of the CBC approach over the last 35 years is that it closely mimics real choice behavior of consumers by asking respondents repeatedly to choose their preferred alternative from a set of several offered alternatives (choice sets). Within the framework of CBC analysis, the multinomial logit (MNL) model is the most frequently used discrete choice model due to the existence of closed form solutions for conditional choice probabilities. The popularity of CBC and the MNL model has grown even more since the introduction of hierarchical Bayesian (HB) estimation techniques that accommodate individual consumer heterogeneity in choice data, and which have now become state-of-the-art in marketing theory and practice. Still, researchers and practitioners have to make further decisions under this framework (CBC, MNL, HB estimation), such as how to represent preference heterogeneity. Here, using a normal distribution (and therefore a unimodal distribution) has become the standard approach in the marketing literature. However, the thin tails of the normal distribution suggest that the standard HB-MNL model should not be the “go-to” approach to approximate multimodal preference distributions, because individual preference patterns lying at the tails of the normal distribution (i.e., that do not fit well with the assumption of a unimodal distribution) tend to be shrunk to the population mean. This shrinkage, especially in multimodal data settings, could mask important information (e.g., new or different structures in the data). A mixture of normal distributions avoids this limited flexibility of the most simple continuous approach of assuming a unimodal prior heterogeneity distribution. There are currently two prominent HB-CBC modeling approaches embedding the mixture-of-normals (MoN) approach: the more widespread MoN-HB-MNL model, and the Dirichlet process mixture (DPM)-HB-MNL model. In this article, we review the prominent HB-MNL model (with its normal prior), the MoN-HB-MNL model, and the DPM-HB-MNL model and apply them to an empirical multi-country CBC data set. We compare the statistical performance of the three models in terms of goodness-of-fit and predictive accuracy, show how to include consumer background characteristics in the upper level of these models, and illustrate how to interpret the estimation results (with a special focus on cross-county heterogeneity). In sum, our article serves as a kind of user guide to the estimation and interpretation of Hierarchical Bayes Conjoint Choice Models. For our data, we observed that all three choice models (both with and without consumer background characteristics) resulted in a one-component solution. The DPM-HB-MNL model nevertheless yielded a higher cross-validated hit rate compared to the MoN-HB-MNL and the HB-MNL models due to its even more flexible prior assumptions. The two latter models tended to slightly overfit our empirical data, which was reflected by higher goodness-of-fit statistics but a lower predictive accuracy compared to the DPM-HB-MNL model. We showed that this result could be attributed to the weaker extent of Bayesian shrinkage of these two models. The DPM-HB-MNL model showed a stronger shrinkage effect and seems therefore somewhat more robust against overfitting. Including consumer background characteristics in terms of country of origin information for the respondents did not improve the statistical model performance (especially not the predictive performance). Still, using the country of origin information for respondents in a post-hoc segmentation analysis helped us to explain some differences in brand preferences between the five countries.
层次贝叶斯联合选择模型-模型框架,贝叶斯推理,模型选择,和估计结果的解释
基于选择的联合分析(CBC)是目前使用最广泛的联合分析,一类测量消费者偏好的方法。在过去的35年里,CBC方法日益占据主导地位的主要原因是,它通过要求受访者反复从一组提供的备选方案(选择集)中选择他们喜欢的备选方案,密切模仿了消费者的真实选择行为。在CBC分析的框架中,由于条件选择概率的闭形式解的存在,多项logit (MNL)模型是最常用的离散选择模型。自引入层次贝叶斯(HB)估计技术以来,CBC和MNL模型的受欢迎程度进一步提高,该技术适应了个人消费者选择数据的异质性,现在已成为营销理论和实践中最先进的技术。然而,在这个框架下(CBC、MNL、HB估计),研究者和实践者还需要做出进一步的决策,例如如何表示偏好异质性。在这里,使用正态分布(因此是单峰分布)已成为营销文献中的标准方法。然而,正态分布的细尾表明,标准HB-MNL模型不应该是近似多模态偏好分布的“首选”方法,因为位于正态分布尾部的个体偏好模式(即,与单峰分布的假设不太吻合)往往会缩小到总体均值。这种收缩,特别是在多模态数据设置中,可能掩盖重要信息(例如,数据中新的或不同的结构)。正态分布的混合避免了假设单峰先验异质性分布的最简单连续方法的有限灵活性。目前有两种突出的嵌入法向混合(MoN)方法的HB-CBC建模方法:更广泛的MoN-HB-MNL模型和Dirichlet过程混合(DPM)-HB-MNL模型。在本文中,我们回顾了著名的HB-MNL模型(具有正常先验),MoN-HB-MNL模型和DPM-HB-MNL模型,并将它们应用于经验多国CBC数据集。我们比较了三种模型在拟合优度和预测精度方面的统计性能,展示了如何将消费者背景特征纳入这些模型的上层,并说明了如何解释估计结果(特别关注跨县异质性)。总之,本文对层次贝叶斯联合选择模型的估计和解释提供了一种用户指南。对于我们的数据,我们观察到所有三种选择模型(包括有和没有消费者背景特征)都导致了一个单组件解决方案。然而,由于DPM-HB-MNL模型具有更灵活的先验假设,因此与MoN-HB-MNL和HB-MNL模型相比,DPM-HB-MNL模型产生了更高的交叉验证命中率。后两个模型倾向于略微过拟合我们的经验数据,这反映在与DPM-HB-MNL模型相比,较高的拟合优度统计数据,但预测精度较低。结果表明,这一结果可归因于这两个模型的贝叶斯收缩程度较弱。DPM-HB-MNL模型显示出更强的收缩效应,因此似乎对过拟合更具鲁棒性。包括消费者背景特征方面的原产国信息对于被调查者并没有提高统计模型的性能(尤其是没有提高预测性能)。尽管如此,在事后细分分析中使用受访者的原产国信息帮助我们解释了五个国家之间品牌偏好的一些差异。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信