Online Assortment Optimization with High-Dimensional Data

DecisionSciRN: Other Decision-Making in Operations Research (Topic) Pub Date : 2019-11-15 DOI:10.2139/ssrn.3521843

Xue Wang, Mike Mingcheng Wei, Tao Yao

{"title":"Online Assortment Optimization with High-Dimensional Data","authors":"Xue Wang, Mike Mingcheng Wei, Tao Yao","doi":"10.2139/ssrn.3521843","DOIUrl":null,"url":null,"abstract":"In this research, we consider an online assortment optimization problem, where a decision-maker needs to sequentially offer assortments to users instantaneously upon their arrivals and users select products from offered assortments according to the contextual multinomial logit choice model. We propose a computationally efficient Lasso-RP-MNL algorithm for the online assortment optimization problem under the cardinality constraint in high-dimensional settings. The Lasso-RP-MNL algorithm combines the Lasso and random projection as dimension reduction techniques to alleviate the computational complexity and improve the learning and estimation accuracy under high-dimensional data with limited samples. For each arriving user, the Lasso-RP-MNL algorithm constructs an upper-confidence bound for each individual product's attraction parameter, based on which the optimistic assortment can be identified by solving a reformulated linear programming problem. We demonstrate that for the feature dimension $d$ and the sample size dimension $T$, the expected cumulative regret under the Lasso-RP-MNL algorithm is upper bounded by $\\tilde{\\mathcal{O}}(\\sqrt{T}\\log d)$ asymptotically, where $\\tilde{\\mathcal{O}}$ suppresses the logarithmic dependence on $T$. Furthermore, we show that even when available samples are extremely limited, the Lasso-RP-MNL algorithm continues to perform well with a regret upper bound of $\\tilde{\\mathcal{O}}( T^{\\frac{2}{3}}\\log d)$. Finally, through synthetic-data-based experiments and a high-dimensional XianYu assortment recommendation experiment, we show that the Lasso-RP-MNL algorithm is computationally efficient and outperforms other benchmarks in terms of the expected cumulative regret. <br>","PeriodicalId":236552,"journal":{"name":"DecisionSciRN: Other Decision-Making in Operations Research (Topic)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"DecisionSciRN: Other Decision-Making in Operations Research (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3521843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

In this research, we consider an online assortment optimization problem, where a decision-maker needs to sequentially offer assortments to users instantaneously upon their arrivals and users select products from offered assortments according to the contextual multinomial logit choice model. We propose a computationally efficient Lasso-RP-MNL algorithm for the online assortment optimization problem under the cardinality constraint in high-dimensional settings. The Lasso-RP-MNL algorithm combines the Lasso and random projection as dimension reduction techniques to alleviate the computational complexity and improve the learning and estimation accuracy under high-dimensional data with limited samples. For each arriving user, the Lasso-RP-MNL algorithm constructs an upper-confidence bound for each individual product's attraction parameter, based on which the optimistic assortment can be identified by solving a reformulated linear programming problem. We demonstrate that for the feature dimension $d$ and the sample size dimension $T$, the expected cumulative regret under the Lasso-RP-MNL algorithm is upper bounded by $\tilde{\mathcal{O}}(\sqrt{T}\log d)$ asymptotically, where $\tilde{\mathcal{O}}$ suppresses the logarithmic dependence on $T$. Furthermore, we show that even when available samples are extremely limited, the Lasso-RP-MNL algorithm continues to perform well with a regret upper bound of $\tilde{\mathcal{O}}( T^{\frac{2}{3}}\log d)$. Finally, through synthetic-data-based experiments and a high-dimensional XianYu assortment recommendation experiment, we show that the Lasso-RP-MNL algorithm is computationally efficient and outperforms other benchmarks in terms of the expected cumulative regret.

查看原文本刊更多论文

基于高维数据的在线分类优化

在本研究中，我们考虑了一个在线分类优化问题，其中决策者需要在用户到达时立即顺序地向用户提供分类，用户根据上下文多项逻辑选择模型从提供的分类中选择产品。针对高维环境下基数约束下的在线分类优化问题，提出了一种计算效率高的Lasso-RP-MNL算法。Lasso- rp - mnl算法结合Lasso和随机投影作为降维技术，在有限样本的高维数据下降低了计算复杂度，提高了学习和估计精度。对于每个到达的用户，Lasso-RP-MNL算法为每个单个产品的吸引力参数构建了一个上置信度界，在此基础上，可以通过求解一个重新表述的线性规划问题来识别乐观分类。我们证明了对于特征维$d$和样本量维$T$, Lasso-RP-MNL算法下的期望累积遗憾的上界渐近为$\tilde{\mathcal{O}}(\sqrt{T}\log d)$，其中$\tilde{\mathcal{O}}$抑制了对$T$的对数依赖。此外，我们表明，即使在可用样本非常有限的情况下，Lasso-RP-MNL算法仍然表现良好，遗憾上限为$\tilde{\mathcal{O}}( T^{\frac{2}{3}}\log d)$。最后，通过基于综合数据的实验和高维的XianYu分类推荐实验，我们证明了Lasso-RP-MNL算法的计算效率很高，并且在期望累积遗憾方面优于其他基准。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

DecisionSciRN: Other Decision-Making in Operations Research (Topic)

自引率

0.00%

发文量