Miti Mazmudar, Thomas Humphries, Jiaxiang Liu, Matthew Rafuse, Xi He
{"title":"Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration","authors":"Miti Mazmudar, Thomas Humphries, Jiaxiang Liu, Matthew Rafuse, Xi He","doi":"10.48550/arXiv.2211.15732","DOIUrl":null,"url":null,"abstract":"\n Differential privacy (DP) allows data analysts to query databases that contain users' sensitive information while providing a quantifiable privacy guarantee to users. Recent interactive DP systems such as APEx provide accuracy guarantees over the query responses, but fail to support a large number of queries with a limited total privacy budget, as they process incoming queries independently from past queries. We present an interactive, accuracy-aware DP query engine,\n CacheDP\n , which utilizes a differentially private cache of past responses, to answer the current workload at a lower privacy budget, while meeting strict accuracy guarantees. We integrate complex DP mechanisms with our structured cache, through novel cache-aware DP cost optimization. Our thorough evaluation illustrates that\n CacheDP\n can accurately answer various workload sequences, while lowering the privacy loss as compared to related work.\n","PeriodicalId":20467,"journal":{"name":"Proc. VLDB Endow.","volume":"38 1","pages":"574-586"},"PeriodicalIF":0.0000,"publicationDate":"2022-11-28","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proc. VLDB Endow.","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.48550/arXiv.2211.15732","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Differential privacy (DP) allows data analysts to query databases that contain users' sensitive information while providing a quantifiable privacy guarantee to users. Recent interactive DP systems such as APEx provide accuracy guarantees over the query responses, but fail to support a large number of queries with a limited total privacy budget, as they process incoming queries independently from past queries. We present an interactive, accuracy-aware DP query engine,
CacheDP
, which utilizes a differentially private cache of past responses, to answer the current workload at a lower privacy budget, while meeting strict accuracy guarantees. We integrate complex DP mechanisms with our structured cache, through novel cache-aware DP cost optimization. Our thorough evaluation illustrates that
CacheDP
can accurately answer various workload sequences, while lowering the privacy loss as compared to related work.