Sameer Mehta, Milind Dawande, G. Janakiraman, V. Mookerjee
{"title":"如何销售数据集?数据货币化的定价政策","authors":"Sameer Mehta, Milind Dawande, G. Janakiraman, V. Mookerjee","doi":"10.2139/ssrn.3333296","DOIUrl":null,"url":null,"abstract":"The wide variety of pricing policies used in practice by data-sellers suggests that there are significant challenges in pricing datasets. The selling of a dataset -- arranged in a row-column format, where rows represent records and columns represent attributes of the records -- is more nuanced than that of information goods like telephone minutes and bandwidth, in the sense that, for a buyer, it is not only the amount of data that matters but also the type of the data. We develop a utility framework that is appropriate for data-buyers and the corresponding pricing of the data by the data-seller. A buyer interested in purchasing a dataset has private valuations in two aspects -- her ideal record that she values the most, and the rate at which her valuation for the records in the dataset decays as they differ from her ideal record. The seller allows individual (and heterogeneous) buyers to filter the dataset and select the records that are of interest to them. The multi-dimensional private information of the buyers coupled with the endogenous selection of records makes the seller's problem of optimally pricing the dataset a challenging one. We formulate a tractable model and successfully exploit its special structure to examine it both analytically and numerically. A key result we establish is that, under reasonable assumptions, a price-quantity schedule is an optimal data-selling mechanism. Such a schedule has a nuanced interpretation in the data-selling context in that buyers buy different sets of records but the price for a given number of records does not depend on the identity of the records chosen by the buyer. Even when the assumptions leading to the optimality of a price-quantity schedule do not hold, we show that the optimal price-quantity schedule offers an attractive worst-case performance guarantee relative to an optimal mechanism. Further, we numerically solve for the optimal mechanism and show that the actual performance of two simple and well-known price-quantity schedules -- two-part pricing and two-block pricing -- is near-optimal. We also quantify the value to the seller from allowing buyers to filter the dataset.","PeriodicalId":416173,"journal":{"name":"Proceedings of the 2019 ACM Conference on Economics and Computation","volume":"22 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"33","resultStr":"{\"title\":\"How to Sell a Dataset? Pricing Policies for Data Monetization\",\"authors\":\"Sameer Mehta, Milind Dawande, G. Janakiraman, V. Mookerjee\",\"doi\":\"10.2139/ssrn.3333296\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The wide variety of pricing policies used in practice by data-sellers suggests that there are significant challenges in pricing datasets. The selling of a dataset -- arranged in a row-column format, where rows represent records and columns represent attributes of the records -- is more nuanced than that of information goods like telephone minutes and bandwidth, in the sense that, for a buyer, it is not only the amount of data that matters but also the type of the data. We develop a utility framework that is appropriate for data-buyers and the corresponding pricing of the data by the data-seller. A buyer interested in purchasing a dataset has private valuations in two aspects -- her ideal record that she values the most, and the rate at which her valuation for the records in the dataset decays as they differ from her ideal record. The seller allows individual (and heterogeneous) buyers to filter the dataset and select the records that are of interest to them. The multi-dimensional private information of the buyers coupled with the endogenous selection of records makes the seller's problem of optimally pricing the dataset a challenging one. We formulate a tractable model and successfully exploit its special structure to examine it both analytically and numerically. A key result we establish is that, under reasonable assumptions, a price-quantity schedule is an optimal data-selling mechanism. Such a schedule has a nuanced interpretation in the data-selling context in that buyers buy different sets of records but the price for a given number of records does not depend on the identity of the records chosen by the buyer. Even when the assumptions leading to the optimality of a price-quantity schedule do not hold, we show that the optimal price-quantity schedule offers an attractive worst-case performance guarantee relative to an optimal mechanism. Further, we numerically solve for the optimal mechanism and show that the actual performance of two simple and well-known price-quantity schedules -- two-part pricing and two-block pricing -- is near-optimal. We also quantify the value to the seller from allowing buyers to filter the dataset.\",\"PeriodicalId\":416173,\"journal\":{\"name\":\"Proceedings of the 2019 ACM Conference on Economics and Computation\",\"volume\":\"22 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"33\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2019 ACM Conference on Economics and Computation\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.2139/ssrn.3333296\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2019 ACM Conference on Economics and Computation","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.2139/ssrn.3333296","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
How to Sell a Dataset? Pricing Policies for Data Monetization
The wide variety of pricing policies used in practice by data-sellers suggests that there are significant challenges in pricing datasets. The selling of a dataset -- arranged in a row-column format, where rows represent records and columns represent attributes of the records -- is more nuanced than that of information goods like telephone minutes and bandwidth, in the sense that, for a buyer, it is not only the amount of data that matters but also the type of the data. We develop a utility framework that is appropriate for data-buyers and the corresponding pricing of the data by the data-seller. A buyer interested in purchasing a dataset has private valuations in two aspects -- her ideal record that she values the most, and the rate at which her valuation for the records in the dataset decays as they differ from her ideal record. The seller allows individual (and heterogeneous) buyers to filter the dataset and select the records that are of interest to them. The multi-dimensional private information of the buyers coupled with the endogenous selection of records makes the seller's problem of optimally pricing the dataset a challenging one. We formulate a tractable model and successfully exploit its special structure to examine it both analytically and numerically. A key result we establish is that, under reasonable assumptions, a price-quantity schedule is an optimal data-selling mechanism. Such a schedule has a nuanced interpretation in the data-selling context in that buyers buy different sets of records but the price for a given number of records does not depend on the identity of the records chosen by the buyer. Even when the assumptions leading to the optimality of a price-quantity schedule do not hold, we show that the optimal price-quantity schedule offers an attractive worst-case performance guarantee relative to an optimal mechanism. Further, we numerically solve for the optimal mechanism and show that the actual performance of two simple and well-known price-quantity schedules -- two-part pricing and two-block pricing -- is near-optimal. We also quantify the value to the seller from allowing buyers to filter the dataset.