Preserving edits when perturbing microdata for statistical disclosure control

N. Shlomo, T. Waal
{"title":"Preserving edits when perturbing microdata for statistical disclosure control","authors":"N. Shlomo, T. Waal","doi":"10.3233/SJU-2005-22207","DOIUrl":null,"url":null,"abstract":"To protect individuals in microdata from the risk of re-identification, a general perturbative method called PRAM (the Post-Randomization Method) is sometimes used for masking records. This method adds “noise” to categorical variables by changing values of categories for a small number of records according to a prescribed probability matrix and a stochastic process based on the outcome of a random multinomial draw. Changing values of categorical variables, however, will cause fully edited and clean records in microdata to start failing edit constraints resulting in data of low utility. In addition, an inconsistent record pinpoints to a potential attacker that the record was perturbed and attempts can be made to unmask the data. Therefore, the perturbation process must take into account micro edit constraints which will ensure that perturbed microdata satisfy all edits. Macro edit constraints which take the form of information loss measures also need to be defined in order to ensure that the overall utility of the data will not be badly compromised given an acceptable level of disclosure risk. This paper will discuss methods for perturbing microdata using PRAM while minimizing micro and macro edit failures. (Updated 10th August 2005)","PeriodicalId":85585,"journal":{"name":"Statistical journal of the United Nations Economic Commission for Europe","volume":"76 1","pages":"173-185"},"PeriodicalIF":0.0000,"publicationDate":"2006-01-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"8","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Statistical journal of the United Nations Economic Commission for Europe","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.3233/SJU-2005-22207","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 8

Abstract

To protect individuals in microdata from the risk of re-identification, a general perturbative method called PRAM (the Post-Randomization Method) is sometimes used for masking records. This method adds “noise” to categorical variables by changing values of categories for a small number of records according to a prescribed probability matrix and a stochastic process based on the outcome of a random multinomial draw. Changing values of categorical variables, however, will cause fully edited and clean records in microdata to start failing edit constraints resulting in data of low utility. In addition, an inconsistent record pinpoints to a potential attacker that the record was perturbed and attempts can be made to unmask the data. Therefore, the perturbation process must take into account micro edit constraints which will ensure that perturbed microdata satisfy all edits. Macro edit constraints which take the form of information loss measures also need to be defined in order to ensure that the overall utility of the data will not be badly compromised given an acceptable level of disclosure risk. This paper will discuss methods for perturbing microdata using PRAM while minimizing micro and macro edit failures. (Updated 10th August 2005)
为统计披露控制而干扰微数据时保留编辑
为了保护微数据中的个体免受重新识别的风险,有时使用一种称为PRAM(后随机化方法)的一般摄动方法来屏蔽记录。该方法通过根据规定的概率矩阵和基于随机多项抽取结果的随机过程改变少数记录的类别值,从而为分类变量添加“噪声”。然而,改变分类变量的值将导致微数据中完全编辑和干净的记录开始失效编辑约束,从而导致数据的低效用。此外,不一致的记录可以向潜在的攻击者指出记录受到了干扰,并且可以尝试揭开数据的面纱。因此,摄动过程必须考虑微编辑约束,这将确保被摄动的微数据满足所有编辑。还需要定义采用信息丢失措施形式的宏编辑约束,以确保在可接受的披露风险水平下,数据的总体效用不会受到严重损害。本文将讨论使用PRAM干扰微数据的方法,同时最大限度地减少微观和宏观编辑失败。(2005年8月10日更新)
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信