{"title":"Steered Microaggregation: A Unified Primitive for Anonymization of Data Sets and Data Streams","authors":"J. Domingo-Ferrer, Jordi Soria-Comas","doi":"10.1109/ICDMW.2017.141","DOIUrl":null,"url":null,"abstract":"The data anonymization landscape has become quite complex in the last decades. On the methodology side, the statistical disclosure control methods designed in official statistics have been supplemented by a number of privacy models proposed by computer scientists. On the data side, static data sets now coexist with big data, and particularly data streams. In the quest for a unified and conceptually simple anonymization approach, we present here a primitive called steered microaggregation that can be tailored to enforce various privacy models both on static data sets and also on data streams. This type of microaggregation relies on adding artificial attributes that are properly initialized and weighted in order to steer the microaggregation process into meeting certain desired constraints. Although not limited to these, we demonstrate the potential of steered microaggregation by showing how it can be used to achieve t-closeness in the context of static data sets and to achieve k-anonymity of data streams while controlling tuple reordering.","PeriodicalId":389183,"journal":{"name":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","volume":"111 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"13","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Data Mining Workshops (ICDMW)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICDMW.2017.141","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 13
Abstract
The data anonymization landscape has become quite complex in the last decades. On the methodology side, the statistical disclosure control methods designed in official statistics have been supplemented by a number of privacy models proposed by computer scientists. On the data side, static data sets now coexist with big data, and particularly data streams. In the quest for a unified and conceptually simple anonymization approach, we present here a primitive called steered microaggregation that can be tailored to enforce various privacy models both on static data sets and also on data streams. This type of microaggregation relies on adding artificial attributes that are properly initialized and weighted in order to steer the microaggregation process into meeting certain desired constraints. Although not limited to these, we demonstrate the potential of steered microaggregation by showing how it can be used to achieve t-closeness in the context of static data sets and to achieve k-anonymity of data streams while controlling tuple reordering.