{"title":"基于高维数据的在线分类优化","authors":"Xue Wang, Mike Mingcheng Wei, Tao Yao","doi":"10.2139/ssrn.3521843","DOIUrl":null,"url":null,"abstract":"In this research, we consider an online assortment optimization problem, where a decision-maker needs to sequentially offer assortments to users instantaneously upon their arrivals and users select products from offered assortments according to the contextual multinomial logit choice model. We propose a computationally efficient Lasso-RP-MNL algorithm for the online assortment optimization problem under the cardinality constraint in high-dimensional settings. The Lasso-RP-MNL algorithm combines the Lasso and random projection as dimension reduction techniques to alleviate the computational complexity and improve the learning and estimation accuracy under high-dimensional data with limited samples. For each arriving user, the Lasso-RP-MNL algorithm constructs an upper-confidence bound for each individual product's attraction parameter, based on which the optimistic assortment can be identified by solving a reformulated linear programming problem. We demonstrate that for the feature dimension $d$ and the sample size dimension $T$, the expected cumulative regret under the Lasso-RP-MNL algorithm is upper bounded by $\\tilde{\\mathcal{O}}(\\sqrt{T}\\log d)$ asymptotically, where $\\tilde{\\mathcal{O}}$ suppresses the logarithmic dependence on $T$. Furthermore, we show that even when available samples are extremely limited, the Lasso-RP-MNL algorithm continues to perform well with a regret upper bound of $\\tilde{\\mathcal{O}}( T^{\\frac{2}{3}}\\log d)$. Finally, through synthetic-data-based experiments and a high-dimensional XianYu assortment recommendation experiment, we show that the Lasso-RP-MNL algorithm is computationally efficient and outperforms other benchmarks in terms of the expected cumulative regret. <br>","PeriodicalId":236552,"journal":{"name":"DecisionSciRN: Other Decision-Making in Operations Research (Topic)","volume":"53 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Online Assortment Optimization with High-Dimensional Data\",\"authors\":\"Xue Wang, Mike Mingcheng Wei, Tao Yao\",\"doi\":\"10.2139/ssrn.3521843\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In this research, we consider an online assortment optimization problem, where a decision-maker needs to sequentially offer assortments to users instantaneously upon their arrivals and users select products from offered assortments according to the contextual multinomial logit choice model. We propose a computationally efficient Lasso-RP-MNL algorithm for the online assortment optimization problem under the cardinality constraint in high-dimensional settings. The Lasso-RP-MNL algorithm combines the Lasso and random projection as dimension reduction techniques to alleviate the computational complexity and improve the learning and estimation accuracy under high-dimensional data with limited samples. For each arriving user, the Lasso-RP-MNL algorithm constructs an upper-confidence bound for each individual product's attraction parameter, based on which the optimistic assortment can be identified by solving a reformulated linear programming problem. We demonstrate that for the feature dimension $d$ and the sample size dimension $T$, the expected cumulative regret under the Lasso-RP-MNL algorithm is upper bounded by $\\\\tilde{\\\\mathcal{O}}(\\\\sqrt{T}\\\\log d)$ asymptotically, where $\\\\tilde{\\\\mathcal{O}}$ suppresses the logarithmic dependence on $T$. Furthermore, we show that even when available samples are extremely limited, the Lasso-RP-MNL algorithm continues to perform well with a regret upper bound of $\\\\tilde{\\\\mathcal{O}}( T^{\\\\frac{2}{3}}\\\\log d)$. Finally, through synthetic-data-based experiments and a high-dimensional XianYu assortment recommendation experiment, we show that the Lasso-RP-MNL algorithm is computationally efficient and outperforms other benchmarks in terms of the expected cumulative regret. <br>\",\"PeriodicalId\":236552,\"journal\":{\"name\":\"DecisionSciRN: Other Decision-Making in Operations Research (Topic)\",\"volume\":\"53 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"DecisionSciRN: Other Decision-Making in Operations Research (Topic)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3521843\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"DecisionSciRN: Other Decision-Making in Operations Research (Topic)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3521843","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
摘要
在本研究中,我们考虑了一个在线分类优化问题,其中决策者需要在用户到达时立即顺序地向用户提供分类,用户根据上下文多项逻辑选择模型从提供的分类中选择产品。针对高维环境下基数约束下的在线分类优化问题,提出了一种计算效率高的Lasso-RP-MNL算法。Lasso- rp - mnl算法结合Lasso和随机投影作为降维技术,在有限样本的高维数据下降低了计算复杂度,提高了学习和估计精度。对于每个到达的用户,Lasso-RP-MNL算法为每个单个产品的吸引力参数构建了一个上置信度界,在此基础上,可以通过求解一个重新表述的线性规划问题来识别乐观分类。我们证明了对于特征维$d$和样本量维$T$, Lasso-RP-MNL算法下的期望累积遗憾的上界渐近为$\tilde{\mathcal{O}}(\sqrt{T}\log d)$,其中$\tilde{\mathcal{O}}$抑制了对$T$的对数依赖。此外,我们表明,即使在可用样本非常有限的情况下,Lasso-RP-MNL算法仍然表现良好,遗憾上限为$\tilde{\mathcal{O}}( T^{\frac{2}{3}}\log d)$。最后,通过基于综合数据的实验和高维的XianYu分类推荐实验,我们证明了Lasso-RP-MNL算法的计算效率很高,并且在期望累积遗憾方面优于其他基准。
Online Assortment Optimization with High-Dimensional Data
In this research, we consider an online assortment optimization problem, where a decision-maker needs to sequentially offer assortments to users instantaneously upon their arrivals and users select products from offered assortments according to the contextual multinomial logit choice model. We propose a computationally efficient Lasso-RP-MNL algorithm for the online assortment optimization problem under the cardinality constraint in high-dimensional settings. The Lasso-RP-MNL algorithm combines the Lasso and random projection as dimension reduction techniques to alleviate the computational complexity and improve the learning and estimation accuracy under high-dimensional data with limited samples. For each arriving user, the Lasso-RP-MNL algorithm constructs an upper-confidence bound for each individual product's attraction parameter, based on which the optimistic assortment can be identified by solving a reformulated linear programming problem. We demonstrate that for the feature dimension $d$ and the sample size dimension $T$, the expected cumulative regret under the Lasso-RP-MNL algorithm is upper bounded by $\tilde{\mathcal{O}}(\sqrt{T}\log d)$ asymptotically, where $\tilde{\mathcal{O}}$ suppresses the logarithmic dependence on $T$. Furthermore, we show that even when available samples are extremely limited, the Lasso-RP-MNL algorithm continues to perform well with a regret upper bound of $\tilde{\mathcal{O}}( T^{\frac{2}{3}}\log d)$. Finally, through synthetic-data-based experiments and a high-dimensional XianYu assortment recommendation experiment, we show that the Lasso-RP-MNL algorithm is computationally efficient and outperforms other benchmarks in terms of the expected cumulative regret.