Online Personalized Assortment Optimization with High-Dimensional Customer Contextual Data

Sentao Miao, X. Chao
{"title":"Online Personalized Assortment Optimization with High-Dimensional Customer Contextual Data","authors":"Sentao Miao, X. Chao","doi":"10.1287/msom.2022.1128","DOIUrl":null,"url":null,"abstract":"Problem definition: Consider an online personalized assortment optimization problem in which customers arrive sequentially and make their decisions (e.g., click an ad, purchase a product) following the multinomial logit choice model with unknown parameters. Utilizing a customer’s personal information that is high-dimensional, the firm selects an assortment tailored for each individual customer’s preference. Academic/practical relevance: High dimensionality of a customer’s contextual information is prevalent in real applications, and it creates tremendous computational challenge in online personalized optimization. Methodology: In this paper, an efficient learning algorithm is developed to tackle the computational complexity issue while maintaining satisfactory performance. The algorithm first applies a random projection for dimension reduction and incorporates an online convex optimization procedure for parameter estimation, thus overcoming the issue of linearly increasing computational requirement as data accumulates. Then, it integrates the upper confidence bound method to balance the exploration and revenue exploitation. Results: The theoretical performance of the algorithm in terms of regret is derived under some plausible sparsity assumption on personal information that is observed in real data, and numerical experiments using both synthetic data and a real data set from Yahoo! show that the algorithm performs very well, having scalability and significant advantage in computational time compared with benchmark methods. Managerial implications: Our findings suggest that practitioners should process high-dimensional sparse customer data with an appropriate feature engineering technique, such as random projection (instead of abandoning the sparse portion) to maximize the effectiveness of online optimization algorithms.","PeriodicalId":18108,"journal":{"name":"Manuf. Serv. Oper. Manag.","volume":"2676 1","pages":"2741-2760"},"PeriodicalIF":0.0000,"publicationDate":"2022-07-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Manuf. Serv. Oper. Manag.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1287/msom.2022.1128","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

Abstract

Problem definition: Consider an online personalized assortment optimization problem in which customers arrive sequentially and make their decisions (e.g., click an ad, purchase a product) following the multinomial logit choice model with unknown parameters. Utilizing a customer’s personal information that is high-dimensional, the firm selects an assortment tailored for each individual customer’s preference. Academic/practical relevance: High dimensionality of a customer’s contextual information is prevalent in real applications, and it creates tremendous computational challenge in online personalized optimization. Methodology: In this paper, an efficient learning algorithm is developed to tackle the computational complexity issue while maintaining satisfactory performance. The algorithm first applies a random projection for dimension reduction and incorporates an online convex optimization procedure for parameter estimation, thus overcoming the issue of linearly increasing computational requirement as data accumulates. Then, it integrates the upper confidence bound method to balance the exploration and revenue exploitation. Results: The theoretical performance of the algorithm in terms of regret is derived under some plausible sparsity assumption on personal information that is observed in real data, and numerical experiments using both synthetic data and a real data set from Yahoo! show that the algorithm performs very well, having scalability and significant advantage in computational time compared with benchmark methods. Managerial implications: Our findings suggest that practitioners should process high-dimensional sparse customer data with an appropriate feature engineering technique, such as random projection (instead of abandoning the sparse portion) to maximize the effectiveness of online optimization algorithms.
基于高维客户上下文数据的在线个性化分类优化
问题定义:考虑一个在线个性化分类优化问题,其中客户依次到达并根据具有未知参数的多项logit选择模型做出决策(例如,点击广告,购买产品)。利用客户的高维个人信息,公司选择适合每个客户偏好的分类。学术/实践相关性:客户上下文信息的高维性在实际应用中很普遍,这给在线个性化优化带来了巨大的计算挑战。方法:本文开发了一种高效的学习算法来解决计算复杂性问题,同时保持令人满意的性能。该算法首先采用随机投影进行降维,并结合在线凸优化过程进行参数估计,从而克服了随着数据积累计算量线性增加的问题。然后,结合上置信度界方法,实现勘探与收益开采的平衡。结果:该算法在后悔方面的理论性能是在真实数据中观察到的个人信息的一些似是而非的稀疏性假设下推导出来的,并使用合成数据和Yahoo!结果表明,该算法性能良好,具有可扩展性和显著的计算时间优势。管理意义:我们的研究结果表明,从业者应该使用适当的特征工程技术处理高维稀疏客户数据,例如随机投影(而不是放弃稀疏部分),以最大限度地提高在线优化算法的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信