{"title":"利用多目标优化方法探索完整计数人口普查数据的隐私性和实用性之间的权衡","authors":"Yue Lin, Ningchuan Xiao","doi":"10.1111/gean.12388","DOIUrl":null,"url":null,"abstract":"<p>Privacy and utility are two important objectives to consider when releasing census data. However, these two objectives are often conflicting, as protecting privacy usually necessitates introducing noise into the data, which compromises data utility. Determining the appropriate level of privacy protection presents a significant challenge in the data release. Therefore, it is necessary to investigate the tradeoff between privacy and utility before making a final decision on the level of privacy protection. In this article, we propose a multiobjective optimization framework to generate multiple optimal solutions that satisfy the two objectives of privacy and utility, as well as to analyze the tradeoff between privacy and utility for decision-making. This framework relocates individuals susceptible to revealing their identities to protect their privacy. We maximize the number of individuals relocated while maximizing the utility of the data after relocations. The proposed framework is tested using synthetic population data in Franklin County, Ohio. Our experimental results show that the framework can efficiently generate a collection of optimal solutions and can be used to effectively balance privacy and utility.</p>","PeriodicalId":12533,"journal":{"name":"Geographical Analysis","volume":"56 3","pages":"427-450"},"PeriodicalIF":3.3000,"publicationDate":"2024-01-30","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12388","citationCount":"0","resultStr":"{\"title\":\"Exploring the Tradeoff Between Privacy and Utility of Complete-count Census Data Using a Multiobjective Optimization Approach\",\"authors\":\"Yue Lin, Ningchuan Xiao\",\"doi\":\"10.1111/gean.12388\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"<p>Privacy and utility are two important objectives to consider when releasing census data. However, these two objectives are often conflicting, as protecting privacy usually necessitates introducing noise into the data, which compromises data utility. Determining the appropriate level of privacy protection presents a significant challenge in the data release. Therefore, it is necessary to investigate the tradeoff between privacy and utility before making a final decision on the level of privacy protection. In this article, we propose a multiobjective optimization framework to generate multiple optimal solutions that satisfy the two objectives of privacy and utility, as well as to analyze the tradeoff between privacy and utility for decision-making. This framework relocates individuals susceptible to revealing their identities to protect their privacy. We maximize the number of individuals relocated while maximizing the utility of the data after relocations. The proposed framework is tested using synthetic population data in Franklin County, Ohio. Our experimental results show that the framework can efficiently generate a collection of optimal solutions and can be used to effectively balance privacy and utility.</p>\",\"PeriodicalId\":12533,\"journal\":{\"name\":\"Geographical Analysis\",\"volume\":\"56 3\",\"pages\":\"427-450\"},\"PeriodicalIF\":3.3000,\"publicationDate\":\"2024-01-30\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"https://onlinelibrary.wiley.com/doi/epdf/10.1111/gean.12388\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Geographical Analysis\",\"FirstCategoryId\":\"89\",\"ListUrlMain\":\"https://onlinelibrary.wiley.com/doi/10.1111/gean.12388\",\"RegionNum\":3,\"RegionCategory\":\"地球科学\",\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"Q1\",\"JCRName\":\"GEOGRAPHY\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Geographical Analysis","FirstCategoryId":"89","ListUrlMain":"https://onlinelibrary.wiley.com/doi/10.1111/gean.12388","RegionNum":3,"RegionCategory":"地球科学","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q1","JCRName":"GEOGRAPHY","Score":null,"Total":0}
Exploring the Tradeoff Between Privacy and Utility of Complete-count Census Data Using a Multiobjective Optimization Approach
Privacy and utility are two important objectives to consider when releasing census data. However, these two objectives are often conflicting, as protecting privacy usually necessitates introducing noise into the data, which compromises data utility. Determining the appropriate level of privacy protection presents a significant challenge in the data release. Therefore, it is necessary to investigate the tradeoff between privacy and utility before making a final decision on the level of privacy protection. In this article, we propose a multiobjective optimization framework to generate multiple optimal solutions that satisfy the two objectives of privacy and utility, as well as to analyze the tradeoff between privacy and utility for decision-making. This framework relocates individuals susceptible to revealing their identities to protect their privacy. We maximize the number of individuals relocated while maximizing the utility of the data after relocations. The proposed framework is tested using synthetic population data in Franklin County, Ohio. Our experimental results show that the framework can efficiently generate a collection of optimal solutions and can be used to effectively balance privacy and utility.
期刊介绍:
First in its specialty area and one of the most frequently cited publications in geography, Geographical Analysis has, since 1969, presented significant advances in geographical theory, model building, and quantitative methods to geographers and scholars in a wide spectrum of related fields. Traditionally, mathematical and nonmathematical articulations of geographical theory, and statements and discussions of the analytic paradigm are published in the journal. Spatial data analyses and spatial econometrics and statistics are strongly represented.