社会福利的适应性最大化

IF 7.1 1区经济学 Q1 ECONOMICS

Econometrica Pub Date : 2025-06-10 DOI:10.3982/ECTA22351

Nicolò Cesa-Bianchi, Roberto Colomboni, Maximilian Kasy

{"title":"社会福利的适应性最大化","authors":"Nicolò Cesa-Bianchi, Roberto Colomboni, Maximilian Kasy","doi":"10.3982/ECTA22351","DOIUrl":null,"url":null,"abstract":"<div>\n We consider the problem of repeatedly choosing policies to maximize social welfare. Welfare is a weighted sum of private utility and public revenue. Earlier outcomes inform later policies. Utility is not observed, but indirectly inferred. Response functions are learned through experimentation.\n We derive a lower bound on regret, and a matching adversarial upper bound for a variant of the Exp3 algorithm. Cumulative regret grows at a rate of T2/3. This implies that (i) welfare maximization is harder than the multiarmed bandit problem (with a rate of T1/2 for finite policy sets), and (ii) our algorithm achieves the optimal rate. For the stochastic setting, if social welfare is concave, we can achieve a rate of T1/2 (for continuous policy sets), using a dyadic search algorithm.\n We analyze an extension to nonlinear income taxation, and sketch an extension to commodity taxation. We compare our setting to monopoly pricing (which is easier), and price setting for bilateral trade (which is harder).\n </div>","PeriodicalId":50556,"journal":{"name":"Econometrica","volume":"93 3","pages":"1073-1104"},"PeriodicalIF":7.1000,"publicationDate":"2025-06-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.3982/ECTA22351","citationCount":"0","resultStr":"{\"title\":\"Adaptive Maximization of Social Welfare\",\"authors\":\"Nicolò Cesa-Bianchi, Roberto Colomboni, Maximilian Kasy\",\"doi\":\"10.3982/ECTA22351\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<div>\\n We consider the problem of repeatedly choosing policies to maximize social welfare. Welfare is a weighted sum of private utility and public revenue. Earlier outcomes inform later policies. Utility is not observed, but indirectly inferred. Response functions are learned through experimentation.\\n We derive a lower bound on regret, and a matching adversarial upper bound for a variant of the Exp3 algorithm. Cumulative regret grows at a rate of T2/3. This implies that (i) welfare maximization is harder than the multiarmed bandit problem (with a rate of T1/2 for finite policy sets), and (ii) our algorithm achieves the optimal rate. For the stochastic setting, if social welfare is concave, we can achieve a rate of T1/2 (for continuous policy sets), using a dyadic search algorithm.\\n We analyze an extension to nonlinear income taxation, and sketch an extension to commodity taxation. We compare our setting to monopoly pricing (which is easier), and price setting for bilateral trade (which is harder).\\n </div>\",\"PeriodicalId\":50556,\"journal\":{\"name\":\"Econometrica\",\"volume\":\"93 3\",\"pages\":\"1073-1104\"},\"PeriodicalIF\":7.1000,\"publicationDate\":\"2025-06-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.3982/ECTA22351\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Econometrica\",\"FirstCategoryId\":\"96\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.3982/ECTA22351\",\"RegionNum\":1,\"RegionCategory\":\"经济学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"ECONOMICS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Econometrica","FirstCategoryId":"96","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.3982/ECTA22351","RegionNum":1,"RegionCategory":"经济学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"ECONOMICS","Score":null,"Total":0}

引用次数: 0

摘要

我们考虑反复选择政策以使社会福利最大化的问题。福利是私人效用和公共收入的加权总和。早期的结果会影响后来的政策。效用不是观察到的，而是间接推断出来的。反应函数是通过实验来学习的。我们为Exp3算法的一个变体导出了遗憾的下界和匹配的对抗上界。累积的后悔以T2/3的速度增长。这意味着(i)福利最大化比多武装强盗问题（有限策略集的比率为T1/2）更难，并且（ii）我们的算法实现了最优比率。对于随机设置，如果社会福利是凹的，我们可以使用二进搜索算法实现T1/2的比率（对于连续的政策集）。我们分析了非线性所得税的推广，并概述了商品税的推广。我们将我们的设定与垄断定价（比较容易）和双边贸易定价（比较困难）进行比较。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

Adaptive Maximization of Social Welfare

查看原文本刊更多论文

Adaptive Maximization of Social Welfare

We consider the problem of repeatedly choosing policies to maximize social welfare. Welfare is a weighted sum of private utility and public revenue. Earlier outcomes inform later policies. Utility is not observed, but indirectly inferred. Response functions are learned through experimentation.

We derive a lower bound on regret, and a matching adversarial upper bound for a variant of the Exp3 algorithm. Cumulative regret grows at a rate of T^2/3. This implies that (i) welfare maximization is harder than the multiarmed bandit problem (with a rate of T^1/2 for finite policy sets), and (ii) our algorithm achieves the optimal rate. For the stochastic setting, if social welfare is concave, we can achieve a rate of T^1/2 (for continuous policy sets), using a dyadic search algorithm.

We analyze an extension to nonlinear income taxation, and sketch an extension to commodity taxation. We compare our setting to monopoly pricing (which is easier), and price setting for bilateral trade (which is harder).

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

Econometrica 社会科学-数学跨学科应用

CiteScore

11.00

自引率

3.30%

发文量

审稿时长

6-12 weeks

期刊介绍： Econometrica publishes original articles in all branches of economics - theoretical and empirical, abstract and applied, providing wide-ranging coverage across the subject area. It promotes studies that aim at the unification of the theoretical-quantitative and the empirical-quantitative approach to economic problems and that are penetrated by constructive and rigorous thinking. It explores a unique range of topics each year - from the frontier of theoretical developments in many new and important areas, to research on current and applied economic problems, to methodologically innovative, theoretical and applied studies in econometrics. Econometrica maintains a long tradition that submitted articles are refereed carefully and that detailed and thoughtful referee reports are provided to the author as an aid to scientific research, thus ensuring the high calibre of papers found in Econometrica. An international board of editors, together with the referees it has selected, has succeeded in substantially reducing editorial turnaround time, thereby encouraging submissions of the highest quality. We strongly encourage recent Ph. D. graduates to submit their work to Econometrica. Our policy is to take into account the fact that recent graduates are less experienced in the process of writing and submitting papers.