Mechanism design for data science

Proceedings of the fifteenth ACM conference on Economics and computation Pub Date : 2014-04-23 DOI:10.1145/2600057.2602881

Shuchi Chawla, Jason D. Hartline, Denis Nekipelov

{"title":"Mechanism design for data science","authors":"Shuchi Chawla, Jason D. Hartline, Denis Nekipelov","doi":"10.1145/2600057.2602881","DOIUrl":null,"url":null,"abstract":"The promise of data science is that if data from a system can be recorded and understood then this understanding can potentially be utilized to improve the system. Behavioral and economic data, however, is different from scientific data in that it is subjective to the system. Behavior changes when the system changes, and to predict behavior for any given system change or to optimize over system changes, the behavioral model that generates the data must be inferred from the data. The ease with which this inference can be performed generally also depends on the system. Trivially, a system that ignores behavior does not admit any inference of a behavior generating model that can be used to predict behavior in a system that is responsive to behavior. To realize the promise of data science in economic systems, a theory for the design of such systems must also incorporate the desired inference properties. Consider as an example the revenue-maximizing auctioneer. If the auctioneer has knowledge of the distribution of bidder values then she can run the first-price auction with a reserve price that is tuned to the distribution. Under some mild distributional assumptions, with the appropriate reserve price the first-price auction is revenue optimal [Myerson 1981]. Notice that the historical bid data for the first-price auction with a reserve price will in most cases not have bids for bidders whose values are below the reserve. Therefore, there is no data analysis that the auctioneer can perform that will enable properties of the distribution of bidder values below the reserve price to be inferred. It could be, nonetheless, that over time the population of potential bidders evolves and the optimal reserve price lowers. This change could go completely unnoticed in the auctioneer's data. The two main tools for optimizing revenue in an auction are reserve prices (as above) and ironing. Both of these tools cause pooling behavior (i.e., bidders with distinct values take the same action) and economic inference cannot thereafter differentiate these pooled bidders. In order to maintain the distributional knowledge necessary to be able to run a good auction in the long term, the auctioneer must sacrifice the short-term revenue by running a non-revenue-optimal auction.","PeriodicalId":203155,"journal":{"name":"Proceedings of the fifteenth ACM conference on Economics and computation","volume":"34 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-04-23","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"48","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the fifteenth ACM conference on Economics and computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2600057.2602881","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 48

Abstract

The promise of data science is that if data from a system can be recorded and understood then this understanding can potentially be utilized to improve the system. Behavioral and economic data, however, is different from scientific data in that it is subjective to the system. Behavior changes when the system changes, and to predict behavior for any given system change or to optimize over system changes, the behavioral model that generates the data must be inferred from the data. The ease with which this inference can be performed generally also depends on the system. Trivially, a system that ignores behavior does not admit any inference of a behavior generating model that can be used to predict behavior in a system that is responsive to behavior. To realize the promise of data science in economic systems, a theory for the design of such systems must also incorporate the desired inference properties. Consider as an example the revenue-maximizing auctioneer. If the auctioneer has knowledge of the distribution of bidder values then she can run the first-price auction with a reserve price that is tuned to the distribution. Under some mild distributional assumptions, with the appropriate reserve price the first-price auction is revenue optimal [Myerson 1981]. Notice that the historical bid data for the first-price auction with a reserve price will in most cases not have bids for bidders whose values are below the reserve. Therefore, there is no data analysis that the auctioneer can perform that will enable properties of the distribution of bidder values below the reserve price to be inferred. It could be, nonetheless, that over time the population of potential bidders evolves and the optimal reserve price lowers. This change could go completely unnoticed in the auctioneer's data. The two main tools for optimizing revenue in an auction are reserve prices (as above) and ironing. Both of these tools cause pooling behavior (i.e., bidders with distinct values take the same action) and economic inference cannot thereafter differentiate these pooled bidders. In order to maintain the distributional knowledge necessary to be able to run a good auction in the long term, the auctioneer must sacrifice the short-term revenue by running a non-revenue-optimal auction.

查看原文本刊更多论文

数据科学机制设计

数据科学的前景是，如果来自系统的数据可以被记录和理解，那么这种理解就有可能被用来改进系统。然而，行为和经济数据不同于科学数据，因为它对系统来说是主观的。当系统发生变化时，行为也会发生变化，为了预测任何给定系统变化的行为或优化系统变化，必须从数据中推断生成数据的行为模型。执行这种推理的难易程度通常也取决于系统。简单地说，一个忽略行为的系统不承认任何行为生成模型的推论，而这个模型可以用来预测一个对行为有反应的系统的行为。为了在经济系统中实现数据科学的承诺，设计这种系统的理论也必须包含所需的推理属性。以收益最大化的拍卖师为例。如果拍卖师了解竞标者价值的分布，那么她可以进行第一价格拍卖，保留价格根据分布进行调整。在一些温和的分配假设下，在适当的保留价格下，首价拍卖是收益最优的[Myerson 1981]。请注意，带有保留价格的第一价格拍卖的历史出价数据在大多数情况下不会包含价值低于保留价格的竞标者的出价。因此，拍卖师无法进行数据分析，从而推断出低于底价的竞标者价值分布的属性。然而，随着时间的推移，潜在竞购者的数量可能会不断增加，而最优底价可能会降低。这种变化在拍卖商的数据中可能完全没有被注意到。优化拍卖收益的两个主要工具是保留价格(如上所述)和熨烫。这两种工具都会导致集中行为(即，具有不同价值的投标人采取相同的行动)，因此经济推理无法区分这些集中的投标人。为了维持长期良好拍卖所必需的分销知识，拍卖商必须通过进行非收益最优拍卖来牺牲短期收益。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the fifteenth ACM conference on Economics and computation

自引率

0.00%

发文量