Fundamental Bounds on Online Strategic Classification

Proceedings of the 24th ACM Conference on Economics and Computation Pub Date : 2023-02-23 DOI:10.1145/3580507.3597818

Saba Ahmadi, Avrim Blum, Kunhe Yang

{"title":"Fundamental Bounds on Online Strategic Classification","authors":"Saba Ahmadi, Avrim Blum, Kunhe Yang","doi":"10.1145/3580507.3597818","DOIUrl":null,"url":null,"abstract":"We study the problem of online binary classification where strategic agents can manipulate their observable features in predefined ways, modeled by a manipulation graph, in order to receive a positive classification. We show this setting differs in fundamental ways from classic (non-strategic) online classification. For instance, whereas in the non-strategic case, a mistake bound of ln |H| is achievable via the halving algorithm when the target function belongs to a known class H, we show that no deterministic algorithm can achieve a mistake bound o(Δ) in the strategic setting, where Δ is the maximum degree of the manipulation graph (even when |H| = O(Δ)). We complement this with a general algorithm achieving mistake bound O(Δ ln |H|). We also extend this to the agnostic setting, and show that this algorithm achieves a Δ multiplicative regret (mistake bound of O(Δ · OPT + Δ · ln |H|)), and that no deterministic algorithm can achieve o(Δ) multiplicative regret. Next, we study two randomized models based on whether the random choices are made before or after agents respond, and show they exhibit fundamental differences. In the first, fractional model, at each round the learner deterministically chooses a probability distribution over classifiers inducing expected values on each vertex (probabilities of being classified as positive), which the strategic agents respond to. We show that any learner in this model has to suffer linear regret. On the other hand, in the second randomized algorithms model, while the adversary who selects the next agent must respond to the learner's probability distribution over classifiers, the agent then responds to the actual hypothesis classifier drawn from this distribution. Surprisingly, we show this model is more advantageous to the learner, and we design randomized algorithms that achieve sublinear regret bounds against both oblivious and adaptive adversaries.","PeriodicalId":210555,"journal":{"name":"Proceedings of the 24th ACM Conference on Economics and Computation","volume":"20 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-02-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 24th ACM Conference on Economics and Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3580507.3597818","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 3

Abstract

We study the problem of online binary classification where strategic agents can manipulate their observable features in predefined ways, modeled by a manipulation graph, in order to receive a positive classification. We show this setting differs in fundamental ways from classic (non-strategic) online classification. For instance, whereas in the non-strategic case, a mistake bound of ln |H| is achievable via the halving algorithm when the target function belongs to a known class H, we show that no deterministic algorithm can achieve a mistake bound o(Δ) in the strategic setting, where Δ is the maximum degree of the manipulation graph (even when |H| = O(Δ)). We complement this with a general algorithm achieving mistake bound O(Δ ln |H|). We also extend this to the agnostic setting, and show that this algorithm achieves a Δ multiplicative regret (mistake bound of O(Δ · OPT + Δ · ln |H|)), and that no deterministic algorithm can achieve o(Δ) multiplicative regret. Next, we study two randomized models based on whether the random choices are made before or after agents respond, and show they exhibit fundamental differences. In the first, fractional model, at each round the learner deterministically chooses a probability distribution over classifiers inducing expected values on each vertex (probabilities of being classified as positive), which the strategic agents respond to. We show that any learner in this model has to suffer linear regret. On the other hand, in the second randomized algorithms model, while the adversary who selects the next agent must respond to the learner's probability distribution over classifiers, the agent then responds to the actual hypothesis classifier drawn from this distribution. Surprisingly, we show this model is more advantageous to the learner, and we design randomized algorithms that achieve sublinear regret bounds against both oblivious and adaptive adversaries.

查看原文本刊更多论文

在线战略分类的基本界限

我们研究了在线二元分类问题，其中策略代理可以以预定义的方式操纵其可观察特征，通过操作图建模，以获得正分类。我们展示了这种设置与经典的(非战略性的)在线分类在基本方面的不同。例如，在非策略情况下，当目标函数属于已知类H时，通过减半算法可以实现ln |H|的错误界，我们表明，在策略设置下，没有确定性算法可以实现错误界o(Δ)，其中Δ是操作图的最大程度(即使|H| = o(Δ))。我们用一种实现错误界O(Δ ln |H|)的通用算法来补充它。我们还将其扩展到不可知论设置，并表明该算法实现了Δ乘性后悔(错误界为O(Δ·OPT + Δ·ln |H|))，并且没有确定性算法可以实现O(Δ)乘性后悔。接下来，我们研究了两个基于随机选择是在智能体响应之前还是之后做出的随机模型，并表明它们表现出根本性的差异。在第一个分数模型中，在每一轮中，学习者确定地选择一个概率分布，而分类器会在每个顶点上诱导期望值(被分类为正的概率)，策略代理会对此做出响应。我们表明，在这个模型中，任何学习者都必须承受线性遗憾。另一方面，在第二个随机算法模型中，选择下一个智能体的对手必须响应学习者在分类器上的概率分布，而智能体则响应从该分布中得出的实际假设分类器。令人惊讶的是，我们证明了这个模型对学习者更有利，并且我们设计了随机算法来实现针对无意识和自适应对手的次线性后悔界限。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 24th ACM Conference on Economics and Computation

自引率

0.00%

发文量