{"title":"Fidelity loss in distribution-preserving anonymization and histogram equalization","authors":"L. Varshney, Kush R. Varshney","doi":"10.1109/CISS.2016.7460471","DOIUrl":null,"url":null,"abstract":"In this paper, we show a formal equivalence between histogram equalization and distribution-preserving quantization. We use this equivalence to connect histogram equalization to quantization for preserving anonymity under the k-anonymity metric, while maintaining distributional properties for data analytics applications. Finally, we make connections to mismatched quantization. These relationships allow us to characterize the loss in mean-squared error (MSE) performance of privacy-preserving quantizers that must meet distribution-preservation constraints as compared to MSE-optimal quantizers in the high-rate regime. Thus, we obtain a formal characterization of the cost of anonymity.","PeriodicalId":346776,"journal":{"name":"2016 Annual Conference on Information Science and Systems (CISS)","volume":"35 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-03-16","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Annual Conference on Information Science and Systems (CISS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CISS.2016.7460471","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
In this paper, we show a formal equivalence between histogram equalization and distribution-preserving quantization. We use this equivalence to connect histogram equalization to quantization for preserving anonymity under the k-anonymity metric, while maintaining distributional properties for data analytics applications. Finally, we make connections to mismatched quantization. These relationships allow us to characterize the loss in mean-squared error (MSE) performance of privacy-preserving quantizers that must meet distribution-preservation constraints as compared to MSE-optimal quantizers in the high-rate regime. Thus, we obtain a formal characterization of the cost of anonymity.