Dynamic pricing with waiting and price-anticipating customers

IF 3.7 4区 管理学 Q2 OPERATIONS RESEARCH & MANAGEMENT SCIENCE
Fabian Lange , Rainer Schlosser
{"title":"Dynamic pricing with waiting and price-anticipating customers","authors":"Fabian Lange ,&nbsp;Rainer Schlosser","doi":"10.1016/j.orp.2025.100337","DOIUrl":null,"url":null,"abstract":"<div><div>Over the last decades, dynamic pricing has become increasingly popular. To solve pricing problems, however, is particularly challenging if the customers’ and competitors’ behavior are both strategic and unknown. Reinforcement Learning (RL) methods are promising for solving such dynamic problems with incomplete knowledge. RL algorithms have shown to outperform rule-based competitor heuristics if the underlying Markov decision process is kept simple and customers are myopic. However, the myopic assumption is becoming increasingly unrealistic since technology like price trackers allows customers to act more strategically. To counteract unknown strategic behavior is difficult as pricing policies and consumers buying patterns influence each other and hence, approaches to iteratively update both sides sequentially are time consuming and convergence is unclear. In this work, we show how to use RL algorithms to optimize prices in the presence of different types of strategic customers that may wait and time their buying decisions. We consider strategic customers that (i) compare current prices against past prices and that (ii) anticipate future price developments. To avoid frequently updating pricing policies and consumer price forecasts, we endogenize the impact of current price decisions on the associated changes in forecast-based consumer behaviors. Besides monopoly markets, we further investigate how the interaction with strategic consumers is affected by additional competing vendors in duopoly markets and present managerial insights for all market setups and customer types.</div></div>","PeriodicalId":38055,"journal":{"name":"Operations Research Perspectives","volume":"14 ","pages":"Article 100337"},"PeriodicalIF":3.7000,"publicationDate":"2025-04-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Operations Research Perspectives","FirstCategoryId":"91","ListUrlMain":"https://www.sciencedirect.com/science/article/pii/S2214716025000132","RegionNum":4,"RegionCategory":"管理学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"OPERATIONS RESEARCH & MANAGEMENT SCIENCE","Score":null,"Total":0}
引用次数: 0

Abstract

Over the last decades, dynamic pricing has become increasingly popular. To solve pricing problems, however, is particularly challenging if the customers’ and competitors’ behavior are both strategic and unknown. Reinforcement Learning (RL) methods are promising for solving such dynamic problems with incomplete knowledge. RL algorithms have shown to outperform rule-based competitor heuristics if the underlying Markov decision process is kept simple and customers are myopic. However, the myopic assumption is becoming increasingly unrealistic since technology like price trackers allows customers to act more strategically. To counteract unknown strategic behavior is difficult as pricing policies and consumers buying patterns influence each other and hence, approaches to iteratively update both sides sequentially are time consuming and convergence is unclear. In this work, we show how to use RL algorithms to optimize prices in the presence of different types of strategic customers that may wait and time their buying decisions. We consider strategic customers that (i) compare current prices against past prices and that (ii) anticipate future price developments. To avoid frequently updating pricing policies and consumer price forecasts, we endogenize the impact of current price decisions on the associated changes in forecast-based consumer behaviors. Besides monopoly markets, we further investigate how the interaction with strategic consumers is affected by additional competing vendors in duopoly markets and present managerial insights for all market setups and customer types.
动态定价与等待和价格预期的客户
在过去的几十年里,动态定价变得越来越流行。然而,如果客户和竞争对手的行为都是战略性且未知的,那么解决定价问题就尤其具有挑战性。强化学习(RL)方法有望解决这类不完全知识的动态问题。如果潜在的马尔可夫决策过程保持简单,并且客户是短视的,RL算法已经显示出优于基于规则的竞争对手启发式算法。然而,这种目光短浅的假设正变得越来越不现实,因为像价格追踪器这样的技术让客户的行为更具战略性。要抵消未知的战略行为是困难的,因为定价政策和消费者购买模式相互影响,因此,迭代更新双方顺序的方法是耗时的,并且不清楚收敛。在这项工作中,我们展示了如何在不同类型的战略客户存在的情况下使用强化学习算法来优化价格,这些客户可能会等待并选择购买决策的时间。我们考虑的战略客户是(i)比较当前价格与过去价格和(ii)预测未来价格发展。为了避免频繁更新定价政策和消费者价格预测,我们内化了当前价格决策对基于预测的消费者行为相关变化的影响。除了垄断市场,我们进一步研究了在双寡头市场中与战略消费者的互动如何受到额外竞争供应商的影响,并提出了针对所有市场设置和客户类型的管理见解。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
Operations Research Perspectives
Operations Research Perspectives Mathematics-Statistics and Probability
CiteScore
6.40
自引率
0.00%
发文量
36
审稿时长
27 days
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信