在收入管理数据上估算未观察到的无购买需求

ERN: Statistical Decision Theory; Operations Research (Topic) Pub Date : 2020-01-26 DOI:10.2139/ssrn.3525773

Anran Li, K. Talluri

{"title":"在收入管理数据上估算未观察到的无购买需求","authors":"Anran Li, K. Talluri","doi":"10.2139/ssrn.3525773","DOIUrl":null,"url":null,"abstract":"Discrete-choice models such as the multinomial-logit (MNL) model are increasingly being used to model customer purchase behaviour in hotels, airlines, fashion retail and e-commerce. Their estimation however has proved difficult in practice. One reason for this difficulty is well-known: most firms in retail do not, or unable to, record customers who were interested in purchasing but did not buy the product (``no-purchasers\"). Estimating demand even with the simplest discrete-choice model such as the MNL becomes challenging then as we do not know the fraction that have chosen an outside option (not purchased). Indeed, the parameters of the MNL model may not be identifiable with such data. Some previous approaches have proposed using ``market-share\" to pin down the parameter associated with the outside option. However, in many industries, market-share data is difficult to obtain, and for some, such as fashion products, has little meaning. In this paper we point out an additional difficulty that arises in practice: Many firms constantly monitor sales and optimize their prices and assortments based on partially observed demand. This leads to an optimization-induced endogeneity as the input used for estimation has been processed by optimization that takes both past data as well as future demand trends in setting controls. As we demonstrate, methods that work well on randomly generated assortments may do badly on optimized assortment data. In this paper we propose a robust method when the firm cannot observe no-purchases, has no market-share information, and the optimization-induced endogeneity exists in the data. The method is a two-step GMM (Generalized Method of Moments) procedure for which we show the estimates are consistent, and we give intuition for its robustness. In Monte-Carlo simulations the performance of our method matches existing methods on randomly generated controls, and is superior in accuracy and robustness when optimization-induced endogeneity is present. On a large real-world data set from the fashion industry --- subject to markdowns as well as stock-outs and unknown management controls --- our method provides very reasonable and robust estimates compared to existing methods.","PeriodicalId":200007,"journal":{"name":"ERN: Statistical Decision Theory; Operations Research (Topic)","volume":"26 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-01-26","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Estimating Demand With Unobserved No-Purchases on Revenue-Managed Data\",\"authors\":\"Anran Li, K. Talluri\",\"doi\":\"10.2139/ssrn.3525773\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Discrete-choice models such as the multinomial-logit (MNL) model are increasingly being used to model customer purchase behaviour in hotels, airlines, fashion retail and e-commerce. Their estimation however has proved difficult in practice. One reason for this difficulty is well-known: most firms in retail do not, or unable to, record customers who were interested in purchasing but did not buy the product (``no-purchasers\\\"). Estimating demand even with the simplest discrete-choice model such as the MNL becomes challenging then as we do not know the fraction that have chosen an outside option (not purchased). Indeed, the parameters of the MNL model may not be identifiable with such data. Some previous approaches have proposed using ``market-share\\\" to pin down the parameter associated with the outside option. However, in many industries, market-share data is difficult to obtain, and for some, such as fashion products, has little meaning. In this paper we point out an additional difficulty that arises in practice: Many firms constantly monitor sales and optimize their prices and assortments based on partially observed demand. This leads to an optimization-induced endogeneity as the input used for estimation has been processed by optimization that takes both past data as well as future demand trends in setting controls. As we demonstrate, methods that work well on randomly generated assortments may do badly on optimized assortment data. In this paper we propose a robust method when the firm cannot observe no-purchases, has no market-share information, and the optimization-induced endogeneity exists in the data. The method is a two-step GMM (Generalized Method of Moments) procedure for which we show the estimates are consistent, and we give intuition for its robustness. In Monte-Carlo simulations the performance of our method matches existing methods on randomly generated controls, and is superior in accuracy and robustness when optimization-induced endogeneity is present. On a large real-world data set from the fashion industry --- subject to markdowns as well as stock-outs and unknown management controls --- our method provides very reasonable and robust estimates compared to existing methods.\",\"PeriodicalId\":200007,\"journal\":{\"name\":\"ERN: Statistical Decision Theory; Operations Research (Topic)\",\"volume\":\"26 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2020-01-26\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"ERN: Statistical Decision Theory; Operations Research (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3525773\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"ERN: Statistical Decision Theory; Operations Research (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3525773","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 5

摘要

离散选择模型，如多项logit (MNL)模型，越来越多地被用于为酒店、航空公司、时尚零售和电子商务中的客户购买行为建模。然而，他们的估计在实践中被证明是困难的。造成这种困难的一个原因是众所周知的:大多数零售公司没有或无法记录有兴趣购买但没有购买产品的客户(“无购买者”)。即使使用最简单的离散选择模型(如MNL)估计需求也变得具有挑战性，因为我们不知道选择外部选项(未购买)的比例。实际上，MNL模型的参数可能无法通过这些数据进行识别。先前的一些方法建议使用“市场份额”来确定与外部期权相关的参数。然而，在许多行业中，市场份额数据很难获得，对于一些行业，如时尚产品，几乎没有意义。在本文中，我们指出了在实践中出现的另一个困难:许多公司不断监控销售，并根据部分观察到的需求优化价格和分类。这导致了一种优化诱导的内生性，因为用于估计的输入已经通过优化处理，该优化既采用了过去的数据，也采用了设置控制中的未来需求趋势。正如我们所展示的，在随机生成分类上工作良好的方法可能在优化分类数据上表现不佳。本文提出了当企业无法观察到无购买行为、没有市场份额信息、数据中存在优化诱导的内生性时的稳健方法。该方法是一个两步GMM(广义矩法)过程，我们证明了估计是一致的，我们对其鲁棒性给出了直观的认识。在蒙特卡罗模拟中，我们的方法的性能与随机生成控制的现有方法相匹配，并且在优化诱导的内生性存在时具有更好的准确性和鲁棒性。对于来自时尚行业的大型真实数据集(受降价、缺货和未知管理控制的影响)，与现有方法相比，我们的方法提供了非常合理和可靠的估计。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Estimating Demand With Unobserved No-Purchases on Revenue-Managed Data

Discrete-choice models such as the multinomial-logit (MNL) model are increasingly being used to model customer purchase behaviour in hotels, airlines, fashion retail and e-commerce. Their estimation however has proved difficult in practice. One reason for this difficulty is well-known: most firms in retail do not, or unable to, record customers who were interested in purchasing but did not buy the product (``no-purchasers"). Estimating demand even with the simplest discrete-choice model such as the MNL becomes challenging then as we do not know the fraction that have chosen an outside option (not purchased). Indeed, the parameters of the MNL model may not be identifiable with such data. Some previous approaches have proposed using ``market-share" to pin down the parameter associated with the outside option. However, in many industries, market-share data is difficult to obtain, and for some, such as fashion products, has little meaning. In this paper we point out an additional difficulty that arises in practice: Many firms constantly monitor sales and optimize their prices and assortments based on partially observed demand. This leads to an optimization-induced endogeneity as the input used for estimation has been processed by optimization that takes both past data as well as future demand trends in setting controls. As we demonstrate, methods that work well on randomly generated assortments may do badly on optimized assortment data. In this paper we propose a robust method when the firm cannot observe no-purchases, has no market-share information, and the optimization-induced endogeneity exists in the data. The method is a two-step GMM (Generalized Method of Moments) procedure for which we show the estimates are consistent, and we give intuition for its robustness. In Monte-Carlo simulations the performance of our method matches existing methods on randomly generated controls, and is superior in accuracy and robustness when optimization-induced endogeneity is present. On a large real-world data set from the fashion industry --- subject to markdowns as well as stock-outs and unknown management controls --- our method provides very reasonable and robust estimates compared to existing methods.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

ERN: Statistical Decision Theory; Operations Research (Topic)

自引率

0.00%

发文量