Ali Shahin Shamsabadi, Peter Snyder, Ralph Giles, Aurélien Bellet, Hamed Haddadi
{"title":"星云高效、私密、精确的直方图估算","authors":"Ali Shahin Shamsabadi, Peter Snyder, Ralph Giles, Aurélien Bellet, Hamed Haddadi","doi":"arxiv-2409.09676","DOIUrl":null,"url":null,"abstract":"We present Nebula, a system for differential private histogram estimation of\ndata distributed among clients. Nebula enables clients to locally subsample and\nencode their data such that an untrusted server learns only data values that\nmeet an aggregation threshold to satisfy differential privacy guarantees.\nCompared with other private histogram estimation systems, Nebula uniquely\nachieves all of the following: \\textit{i)} a strict upper bound on privacy\nleakage; \\textit{ii)} client privacy under realistic trust assumptions;\n\\textit{iii)} significantly better utility compared to standard local\ndifferential privacy systems; and \\textit{iv)} avoiding trusted third-parties,\nmulti-party computation, or trusted hardware. We provide both a formal\nevaluation of Nebula's privacy, utility and efficiency guarantees, along with\nan empirical evaluation on three real-world datasets. We demonstrate that\nclients can encode and upload their data efficiently (only 0.0058 seconds\nrunning time and 0.0027 MB data communication) and privately (strong\ndifferential privacy guarantees $\\varepsilon=1$). On the United States Census\ndataset, the Nebula's untrusted aggregation server estimates histograms with\nabove 88\\% better utility than the existing local deployment of differential\nprivacy. Additionally, we describe a variant that allows clients to submit\nmulti-dimensional data, with similar privacy, utility, and performance.\nFinally, we provide an open source implementation of Nebula.","PeriodicalId":501332,"journal":{"name":"arXiv - CS - Cryptography and Security","volume":"19 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Nebula: Efficient, Private and Accurate Histogram Estimation\",\"authors\":\"Ali Shahin Shamsabadi, Peter Snyder, Ralph Giles, Aurélien Bellet, Hamed Haddadi\",\"doi\":\"arxiv-2409.09676\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"We present Nebula, a system for differential private histogram estimation of\\ndata distributed among clients. Nebula enables clients to locally subsample and\\nencode their data such that an untrusted server learns only data values that\\nmeet an aggregation threshold to satisfy differential privacy guarantees.\\nCompared with other private histogram estimation systems, Nebula uniquely\\nachieves all of the following: \\\\textit{i)} a strict upper bound on privacy\\nleakage; \\\\textit{ii)} client privacy under realistic trust assumptions;\\n\\\\textit{iii)} significantly better utility compared to standard local\\ndifferential privacy systems; and \\\\textit{iv)} avoiding trusted third-parties,\\nmulti-party computation, or trusted hardware. We provide both a formal\\nevaluation of Nebula's privacy, utility and efficiency guarantees, along with\\nan empirical evaluation on three real-world datasets. We demonstrate that\\nclients can encode and upload their data efficiently (only 0.0058 seconds\\nrunning time and 0.0027 MB data communication) and privately (strong\\ndifferential privacy guarantees $\\\\varepsilon=1$). On the United States Census\\ndataset, the Nebula's untrusted aggregation server estimates histograms with\\nabove 88\\\\% better utility than the existing local deployment of differential\\nprivacy. Additionally, we describe a variant that allows clients to submit\\nmulti-dimensional data, with similar privacy, utility, and performance.\\nFinally, we provide an open source implementation of Nebula.\",\"PeriodicalId\":501332,\"journal\":{\"name\":\"arXiv - CS - Cryptography and Security\",\"volume\":\"19 1\",\"pages\":\"\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2024-09-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"arXiv - CS - Cryptography and Security\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/arxiv-2409.09676\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Cryptography and Security","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.09676","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Nebula: Efficient, Private and Accurate Histogram Estimation
We present Nebula, a system for differential private histogram estimation of
data distributed among clients. Nebula enables clients to locally subsample and
encode their data such that an untrusted server learns only data values that
meet an aggregation threshold to satisfy differential privacy guarantees.
Compared with other private histogram estimation systems, Nebula uniquely
achieves all of the following: \textit{i)} a strict upper bound on privacy
leakage; \textit{ii)} client privacy under realistic trust assumptions;
\textit{iii)} significantly better utility compared to standard local
differential privacy systems; and \textit{iv)} avoiding trusted third-parties,
multi-party computation, or trusted hardware. We provide both a formal
evaluation of Nebula's privacy, utility and efficiency guarantees, along with
an empirical evaluation on three real-world datasets. We demonstrate that
clients can encode and upload their data efficiently (only 0.0058 seconds
running time and 0.0027 MB data communication) and privately (strong
differential privacy guarantees $\varepsilon=1$). On the United States Census
dataset, the Nebula's untrusted aggregation server estimates histograms with
above 88\% better utility than the existing local deployment of differential
privacy. Additionally, we describe a variant that allows clients to submit
multi-dimensional data, with similar privacy, utility, and performance.
Finally, we provide an open source implementation of Nebula.