Selling Joint Ads: A Regret Minimization Perspective

Gagan Aggarwal, Ashwinkumar Badanidiyuru, Paul Dütting, Federico Fusco
{"title":"Selling Joint Ads: A Regret Minimization Perspective","authors":"Gagan Aggarwal, Ashwinkumar Badanidiyuru, Paul Dütting, Federico Fusco","doi":"arxiv-2409.07819","DOIUrl":null,"url":null,"abstract":"Motivated by online retail, we consider the problem of selling one item\n(e.g., an ad slot) to two non-excludable buyers (say, a merchant and a brand).\nThis problem captures, for example, situations where a merchant and a brand\ncooperatively bid in an auction to advertise a product, and both benefit from\nthe ad being shown. A mechanism collects bids from the two and decides whether\nto allocate and which payments the two parties should make. This gives rise to\nintricate incentive compatibility constraints, e.g., on how to split payments\nbetween the two parties. We approach the problem of finding a\nrevenue-maximizing incentive-compatible mechanism from an online learning\nperspective; this poses significant technical challenges. First, the action\nspace (the class of all possible mechanisms) is huge; second, the function that\nmaps mechanisms to revenue is highly irregular, ruling out standard\ndiscretization-based approaches. In the stochastic setting, we design an efficient learning algorithm\nachieving a regret bound of $O(T^{3/4})$. Our approach is based on an adaptive\ndiscretization scheme of the space of mechanisms, as any non-adaptive\ndiscretization fails to achieve sublinear regret. In the adversarial setting,\nwe exploit the non-Lipschitzness of the problem to prove a strong negative\nresult, namely that no learning algorithm can achieve more than half of the\nrevenue of the best fixed mechanism in hindsight. We then consider the\n$\\sigma$-smooth adversary; we construct an efficient learning algorithm that\nachieves a regret bound of $O(T^{2/3})$ and builds on a succinct encoding of\nexponentially many experts. Finally, we prove that no learning algorithm can\nachieve less than $\\Omega(\\sqrt T)$ regret in both the stochastic and the\nsmooth setting, thus narrowing the range where the minimax regret rates for\nthese two problems lie.","PeriodicalId":501316,"journal":{"name":"arXiv - CS - Computer Science and Game Theory","volume":"29 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-12","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Computer Science and Game Theory","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.07819","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Motivated by online retail, we consider the problem of selling one item (e.g., an ad slot) to two non-excludable buyers (say, a merchant and a brand). This problem captures, for example, situations where a merchant and a brand cooperatively bid in an auction to advertise a product, and both benefit from the ad being shown. A mechanism collects bids from the two and decides whether to allocate and which payments the two parties should make. This gives rise to intricate incentive compatibility constraints, e.g., on how to split payments between the two parties. We approach the problem of finding a revenue-maximizing incentive-compatible mechanism from an online learning perspective; this poses significant technical challenges. First, the action space (the class of all possible mechanisms) is huge; second, the function that maps mechanisms to revenue is highly irregular, ruling out standard discretization-based approaches. In the stochastic setting, we design an efficient learning algorithm achieving a regret bound of $O(T^{3/4})$. Our approach is based on an adaptive discretization scheme of the space of mechanisms, as any non-adaptive discretization fails to achieve sublinear regret. In the adversarial setting, we exploit the non-Lipschitzness of the problem to prove a strong negative result, namely that no learning algorithm can achieve more than half of the revenue of the best fixed mechanism in hindsight. We then consider the $\sigma$-smooth adversary; we construct an efficient learning algorithm that achieves a regret bound of $O(T^{2/3})$ and builds on a succinct encoding of exponentially many experts. Finally, we prove that no learning algorithm can achieve less than $\Omega(\sqrt T)$ regret in both the stochastic and the smooth setting, thus narrowing the range where the minimax regret rates for these two problems lie.
销售联合广告:后悔最小化视角
受在线零售业的启发,我们考虑了向两个非排他性买家(例如,一个商家和一个品牌)出售一件商品(例如,一个广告时段)的问题。举例来说,这个问题捕捉了商家和品牌在拍卖中合作出价为产品做广告,并且双方都从广告展示中获益的情况。一种机制会收集双方的出价,并决定是否分配以及双方应支付的款项。这就产生了错综复杂的激励相容性约束,例如,如何在双方之间分配付款。我们从在线学习的角度来解决寻找途径最大化激励兼容机制的问题;这带来了重大的技术挑战。首先,行动空间(所有可能机制的类别)是巨大的;其次,将机制映射到收入的函数极不规则,排除了基于标准具体化的方法。在随机设置中,我们设计了一种高效的学习算法,可实现 $O(T^{3/4})$的遗憾约束。我们的方法基于机制空间的自适应具体化方案,因为任何非自适应具体化方案都无法实现亚线性遗憾。在对抗性环境中,我们利用问题的非利普斯奇兹性证明了一个强有力的否定结果,即任何学习算法都无法在事后获得最佳固定机制一半以上的收益。然后,我们考虑了$\sigma$光滑对手;我们构建了一种高效的学习算法,它能实现$O(T^{2/3})$的遗憾约束,并建立在幂次多专家的简洁编码之上。最后,我们证明了在随机和光滑环境下,没有一种学习算法能达到小于 $\Omega(\sqrt T)$ 的遗憾率,从而缩小了这两个问题的最小遗憾率范围。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信