基于拉普拉斯噪声附加的隐空间轨迹匿名化

2021 Ninth International Symposium on Computing and Networking (CANDAR) Pub Date : 2021-11-01 DOI:10.1109/CANDAR53791.2021.00016

Yuiko Sakuma, Thai P. Tran, Tomomu Iwai, Akihito Nishikawa, Hiroaki Nishi

{"title":"基于拉普拉斯噪声附加的隐空间轨迹匿名化","authors":"Yuiko Sakuma, Thai P. Tran, Tomomu Iwai, Akihito Nishikawa, Hiroaki Nishi","doi":"10.1109/CANDAR53791.2021.00016","DOIUrl":null,"url":null,"abstract":"In recent years, the volume of captured location-based movement data has drastically increased with the prevalence of smartphones. Mobility data are commonly used for smart assistant and personalized advertising applications. However, such data contain considerable sensitive information; thus, they must be anonymized before they can be published or analyzed. In this study, we investigate the problem of anonymization for trajectory publication. Anonymizing trajectories is challenging because they have high dimensionality in both the spatial and temporal domains. Traditional anonymization methods cannot handle high dimensionality without significantly sacrificing data utility. The proposed method addresses this limitation by training a Seq2Seq autoencoder model to reconstruct trajectories from the spatiotemporal input, followed by distributing the Laplace noise to the principal components of the Seq2Seq encoder's hidden-layer output under differential privacy. By distributing the privacy budget in the latent space, the proposed method can output trajectories that satisfy differential privacy while maintaining embedded information. Experimental results from the application of the proposed method to real-life movement trajectory data from Saitama, Japan, demonstrate a reduction in data loss by up to 75.7 % while maintaining significant data utility.","PeriodicalId":263773,"journal":{"name":"2021 Ninth International Symposium on Computing and Networking (CANDAR)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"Trajectory Anonymization through Laplace Noise Addition in Latent Space\",\"authors\":\"Yuiko Sakuma, Thai P. Tran, Tomomu Iwai, Akihito Nishikawa, Hiroaki Nishi\",\"doi\":\"10.1109/CANDAR53791.2021.00016\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In recent years, the volume of captured location-based movement data has drastically increased with the prevalence of smartphones. Mobility data are commonly used for smart assistant and personalized advertising applications. However, such data contain considerable sensitive information; thus, they must be anonymized before they can be published or analyzed. In this study, we investigate the problem of anonymization for trajectory publication. Anonymizing trajectories is challenging because they have high dimensionality in both the spatial and temporal domains. Traditional anonymization methods cannot handle high dimensionality without significantly sacrificing data utility. The proposed method addresses this limitation by training a Seq2Seq autoencoder model to reconstruct trajectories from the spatiotemporal input, followed by distributing the Laplace noise to the principal components of the Seq2Seq encoder's hidden-layer output under differential privacy. By distributing the privacy budget in the latent space, the proposed method can output trajectories that satisfy differential privacy while maintaining embedded information. Experimental results from the application of the proposed method to real-life movement trajectory data from Saitama, Japan, demonstrate a reduction in data loss by up to 75.7 % while maintaining significant data utility.\",\"PeriodicalId\":263773,\"journal\":{\"name\":\"2021 Ninth International Symposium on Computing and Networking (CANDAR)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 Ninth International Symposium on Computing and Networking (CANDAR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CANDAR53791.2021.00016\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 Ninth International Symposium on Computing and Networking (CANDAR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CANDAR53791.2021.00016","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 2

摘要

近年来，随着智能手机的普及，基于位置的移动数据的捕获量急剧增加。移动数据通常用于智能助手和个性化广告应用程序。然而，这些数据包含相当多的敏感信息;因此，在发表或分析之前，它们必须匿名化。在本研究中，我们探讨了轨迹发布的匿名化问题。轨迹匿名化具有挑战性，因为它们在空间和时间域都具有高维性。传统的匿名化方法无法在不显著牺牲数据效用的情况下处理高维数据。该方法通过训练一个Seq2Seq自编码器模型来从时空输入重建轨迹，然后在差分隐私下将拉普拉斯噪声分布到Seq2Seq编码器隐藏层输出的主成分中，从而解决了这一限制。通过在隐空间中分配隐私预算，该方法可以在保持嵌入信息的同时输出满足差分隐私的轨迹。将该方法应用于日本埼玉县的真实运动轨迹数据的实验结果表明，在保持显著数据效用的同时，数据丢失减少了75.7%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Trajectory Anonymization through Laplace Noise Addition in Latent Space

In recent years, the volume of captured location-based movement data has drastically increased with the prevalence of smartphones. Mobility data are commonly used for smart assistant and personalized advertising applications. However, such data contain considerable sensitive information; thus, they must be anonymized before they can be published or analyzed. In this study, we investigate the problem of anonymization for trajectory publication. Anonymizing trajectories is challenging because they have high dimensionality in both the spatial and temporal domains. Traditional anonymization methods cannot handle high dimensionality without significantly sacrificing data utility. The proposed method addresses this limitation by training a Seq2Seq autoencoder model to reconstruct trajectories from the spatiotemporal input, followed by distributing the Laplace noise to the principal components of the Seq2Seq encoder's hidden-layer output under differential privacy. By distributing the privacy budget in the latent space, the proposed method can output trajectories that satisfy differential privacy while maintaining embedded information. Experimental results from the application of the proposed method to real-life movement trajectory data from Saitama, Japan, demonstrate a reduction in data loss by up to 75.7 % while maintaining significant data utility.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

2021 Ninth International Symposium on Computing and Networking (CANDAR)

自引率

0.00%

发文量