Muhammad Rizwan;Ammar Hawbani;Xingfu Wang;Adeel Anjum;Pelin Angin;Yigit Sever;Sanchuan Chen;Liang Zhao;Ahmed Al-Dubai
{"title":"私有和有用的1:M微数据的增强和健壮的数据发布方案","authors":"Muhammad Rizwan;Ammar Hawbani;Xingfu Wang;Adeel Anjum;Pelin Angin;Yigit Sever;Sanchuan Chen;Liang Zhao;Ahmed Al-Dubai","doi":"10.1109/TBDATA.2024.3495497","DOIUrl":null,"url":null,"abstract":"A data publishing deal conducted with anonymous microdata can preserve the privacy of people. However, anonymizing data with multiple records of an individual (1:M dataset) is still a challenging problem. After anonymizing the 1:M microdata, the vertical correlation can be exploited to launch privacy attacks. In this paper, a novel privacy preserving model <inline-formula><tex-math>$l_{c}, l_{s}$</tex-math></inline-formula>-ANGEL is proposed. To validate the new model, two privacy attacks are presented, namely, a Vertical correlation attack (<inline-formula><tex-math>$V_{c0}$</tex-math></inline-formula>) and a Vulnerable sensitive attribute attack (<inline-formula><tex-math>$V_{sa}$</tex-math></inline-formula>) on 1:M datasets, which breach the privacy of individuals. Furthermore, the proposed model is examined through High-Level Petri Nets (HLPNs). Our experiments on three real-world datasets;“INFORMS”,“YOUTUBE”, and “IMDb” demonstrate that the proposed model outperforms the state-of-the-art models. Our practices and lessons learned in this work can direct future concrete steps towards Multiple Sensitive Attributes, where we can expand the proposed model to dynamic datasets.","PeriodicalId":13106,"journal":{"name":"IEEE Transactions on Big Data","volume":"11 4","pages":"1932-1944"},"PeriodicalIF":5.7000,"publicationDate":"2024-11-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"An Enhanced and Robust Data Publishing Scheme for Private and Useful 1:M Microdata\",\"authors\":\"Muhammad Rizwan;Ammar Hawbani;Xingfu Wang;Adeel Anjum;Pelin Angin;Yigit Sever;Sanchuan Chen;Liang Zhao;Ahmed Al-Dubai\",\"doi\":\"10.1109/TBDATA.2024.3495497\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"A data publishing deal conducted with anonymous microdata can preserve the privacy of people. However, anonymizing data with multiple records of an individual (1:M dataset) is still a challenging problem. After anonymizing the 1:M microdata, the vertical correlation can be exploited to launch privacy attacks. In this paper, a novel privacy preserving model <inline-formula><tex-math>$l_{c}, l_{s}$</tex-math></inline-formula>-ANGEL is proposed. To validate the new model, two privacy attacks are presented, namely, a Vertical correlation attack (<inline-formula><tex-math>$V_{c0}$</tex-math></inline-formula>) and a Vulnerable sensitive attribute attack (<inline-formula><tex-math>$V_{sa}$</tex-math></inline-formula>) on 1:M datasets, which breach the privacy of individuals. Furthermore, the proposed model is examined through High-Level Petri Nets (HLPNs). Our experiments on three real-world datasets;“INFORMS”,“YOUTUBE”, and “IMDb” demonstrate that the proposed model outperforms the state-of-the-art models. Our practices and lessons learned in this work can direct future concrete steps towards Multiple Sensitive Attributes, where we can expand the proposed model to dynamic datasets.\",\"PeriodicalId\":13106,\"journal\":{\"name\":\"IEEE Transactions on Big Data\",\"volume\":\"11 4\",\"pages\":\"1932-1944\"},\"PeriodicalIF\":5.7000,\"publicationDate\":\"2024-11-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IEEE Transactions on Big Data\",\"FirstCategoryId\":\"94\",\"ListUrlMain\":\"https://ieeexplore.ieee.org/document/10748377/\",\"RegionNum\":3,\"RegionCategory\":\"计算机科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"COMPUTER SCIENCE, INFORMATION SYSTEMS\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IEEE Transactions on Big Data","FirstCategoryId":"94","ListUrlMain":"https://ieeexplore.ieee.org/document/10748377/","RegionNum":3,"RegionCategory":"计算机科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"COMPUTER SCIENCE, INFORMATION SYSTEMS","Score":null,"Total":0}
An Enhanced and Robust Data Publishing Scheme for Private and Useful 1:M Microdata
A data publishing deal conducted with anonymous microdata can preserve the privacy of people. However, anonymizing data with multiple records of an individual (1:M dataset) is still a challenging problem. After anonymizing the 1:M microdata, the vertical correlation can be exploited to launch privacy attacks. In this paper, a novel privacy preserving model $l_{c}, l_{s}$-ANGEL is proposed. To validate the new model, two privacy attacks are presented, namely, a Vertical correlation attack ($V_{c0}$) and a Vulnerable sensitive attribute attack ($V_{sa}$) on 1:M datasets, which breach the privacy of individuals. Furthermore, the proposed model is examined through High-Level Petri Nets (HLPNs). Our experiments on three real-world datasets;“INFORMS”,“YOUTUBE”, and “IMDb” demonstrate that the proposed model outperforms the state-of-the-art models. Our practices and lessons learned in this work can direct future concrete steps towards Multiple Sensitive Attributes, where we can expand the proposed model to dynamic datasets.
期刊介绍:
The IEEE Transactions on Big Data publishes peer-reviewed articles focusing on big data. These articles present innovative research ideas and application results across disciplines, including novel theories, algorithms, and applications. Research areas cover a wide range, such as big data analytics, visualization, curation, management, semantics, infrastructure, standards, performance analysis, intelligence extraction, scientific discovery, security, privacy, and legal issues specific to big data. The journal also prioritizes applications of big data in fields generating massive datasets.