To Share or Not to Share: On Location Privacy in IoT Sensor Data

F. Papst, Naomi Stricker, R. Entezari, O. Saukh
{"title":"To Share or Not to Share: On Location Privacy in IoT Sensor Data","authors":"F. Papst, Naomi Stricker, R. Entezari, O. Saukh","doi":"10.1109/iotdi54339.2022.00015","DOIUrl":null,"url":null,"abstract":"Data sharing is crucial for building large datasets which in return are essential for developing and training accurate models in many contexts including smart cities, agriculture, and medical applications. However, shared data may leak private information, such as personal identifiers or location. Past research provides evidence that solely removing these identifiers through pseudonymization is not enough to ensure data privacy protection, since even the pseudonymized data may still contain information about the data provider. In this paper, we show that sensor data may leak a sensor's location even if the latter is not explicitly shared. Sensors are localized by linking sensor data with publicly available environmental data such as local weather. The proposed localization method relies on a machine learning model to predict weather data from sensor observations. Subsequently, the localization algorithm determines the sensor's location from the predicted weather trace using Bayesian filtering. We apply our approach to three real-world datasets where we (1) localize an ozone sensor given its readings, (2) localize a cow from activity parameters recorded with a tracker in the cow's reticulum, (3) localize solar panels based on their solar generation data. The achieved average localization accuracy of 5.68 km, 19.91 km, and 13.68 km on the above tasks, respectively, using data traces with a length of 365 days is remarkable. In addition, we introduce a mechanism, referred to as teleport, to protect location information in sensor data. The mechanism is based on deep models and masks the location by replacing the weather dependency with a different weather signature.","PeriodicalId":314074,"journal":{"name":"2022 IEEE/ACM Seventh International Conference on Internet-of-Things Design and Implementation (IoTDI)","volume":"301 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 IEEE/ACM Seventh International Conference on Internet-of-Things Design and Implementation (IoTDI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/iotdi54339.2022.00015","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

Abstract

Data sharing is crucial for building large datasets which in return are essential for developing and training accurate models in many contexts including smart cities, agriculture, and medical applications. However, shared data may leak private information, such as personal identifiers or location. Past research provides evidence that solely removing these identifiers through pseudonymization is not enough to ensure data privacy protection, since even the pseudonymized data may still contain information about the data provider. In this paper, we show that sensor data may leak a sensor's location even if the latter is not explicitly shared. Sensors are localized by linking sensor data with publicly available environmental data such as local weather. The proposed localization method relies on a machine learning model to predict weather data from sensor observations. Subsequently, the localization algorithm determines the sensor's location from the predicted weather trace using Bayesian filtering. We apply our approach to three real-world datasets where we (1) localize an ozone sensor given its readings, (2) localize a cow from activity parameters recorded with a tracker in the cow's reticulum, (3) localize solar panels based on their solar generation data. The achieved average localization accuracy of 5.68 km, 19.91 km, and 13.68 km on the above tasks, respectively, using data traces with a length of 365 days is remarkable. In addition, we introduce a mechanism, referred to as teleport, to protect location information in sensor data. The mechanism is based on deep models and masks the location by replacing the weather dependency with a different weather signature.
分享还是不分享:关于物联网传感器数据中的位置隐私
数据共享对于构建大型数据集至关重要,而大型数据集对于在智能城市、农业和医疗应用等许多环境中开发和训练准确的模型至关重要。然而,共享数据可能会泄露个人信息,如个人标识符或位置。过去的研究提供的证据表明,仅仅通过假名删除这些标识符不足以确保数据隐私保护,因为即使是假名化的数据也可能包含有关数据提供者的信息。在本文中,我们表明传感器数据可能泄漏传感器的位置,即使后者没有明确共享。通过将传感器数据与公开可用的环境数据(如当地天气)联系起来,传感器可以实现本地化。提出的定位方法依赖于机器学习模型来预测传感器观测的天气数据。随后,定位算法使用贝叶斯滤波从预测的天气轨迹确定传感器的位置。我们将我们的方法应用于三个真实世界的数据集,其中我们(1)根据臭氧传感器的读数对其进行定位,(2)根据奶牛网状结构中跟踪器记录的活动参数对奶牛进行定位,(3)根据太阳能发电数据对太阳能电池板进行定位。在365天的数据轨迹下,上述任务的平均定位精度分别为5.68 km、19.91 km和13.68 km。此外,我们引入了一种机制,称为传送,以保护传感器数据中的位置信息。该机制基于深度模型,并通过用不同的天气特征替换天气依赖来掩盖位置。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信