Steered Microaggregation: A Unified Primitive for Anonymization of Data Sets and Data Streams

2017 IEEE International Conference on Data Mining Workshops (ICDMW) Pub Date : 2017-11-01 DOI:10.1109/ICDMW.2017.141

J. Domingo-Ferrer, Jordi Soria-Comas

引用次数: 13

Abstract

The data anonymization landscape has become quite complex in the last decades. On the methodology side, the statistical disclosure control methods designed in official statistics have been supplemented by a number of privacy models proposed by computer scientists. On the data side, static data sets now coexist with big data, and particularly data streams. In the quest for a unified and conceptually simple anonymization approach, we present here a primitive called steered microaggregation that can be tailored to enforce various privacy models both on static data sets and also on data streams. This type of microaggregation relies on adding artificial attributes that are properly initialized and weighted in order to steer the microaggregation process into meeting certain desired constraints. Although not limited to these, we demonstrate the potential of steered microaggregation by showing how it can be used to achieve t-closeness in the context of static data sets and to achieve k-anonymity of data streams while controlling tuple reordering.

查看原文本刊更多论文

导向微聚合:数据集和数据流匿名化的统一原语

在过去的几十年里，数据匿名化已经变得相当复杂。在方法论方面，官方统计中设计的统计披露控制方法得到了计算机科学家提出的一些隐私模型的补充。在数据方面，静态数据集现在与大数据共存，尤其是数据流。为了寻求一种统一的、概念上简单的匿名化方法，我们在这里提出了一种称为定向微聚合的原语，可以对其进行定制，以在静态数据集和数据流上实施各种隐私模型。这种类型的微聚合依赖于添加适当初始化和加权的人工属性，以便引导微聚合过程满足某些期望的约束。虽然不限于这些，但我们展示了定向微聚合的潜力，展示了它如何在静态数据集的上下文中实现t-接近，并在控制元组重排序的同时实现数据流的k-匿名。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2017 IEEE International Conference on Data Mining Workshops (ICDMW)

自引率

0.00%

发文量