Classification of Encrypted IoT Traffic despite Padding and Shaping

Proceedings of the 21st Workshop on Privacy in the Electronic Society Pub Date : 2021-10-21 DOI:10.1145/3559613.3563191

Aviv Engelberg, A. Wool

{"title":"Classification of Encrypted IoT Traffic despite Padding and Shaping","authors":"Aviv Engelberg, A. Wool","doi":"10.1145/3559613.3563191","DOIUrl":null,"url":null,"abstract":"It is well-known that when IoT traffic is unencrypted it is possible to identify the active devices based on their TCP/IP headers. And when traffic is encrypted, packet-sizes and timings can still be used to do so. To defend against such fingerprinting, traffic padding and shaping were introduced. In this paper we show that even with these mitigations, the privacy of IoT consumers can still be violated. The main tool we use in our analysis is the full distribution of packet-size---as opposed to commonly used statistics such as mean and variance. We evaluate the performance of a local adversary, such as a snooping neighbor or a criminal, against 8~different padding methods. We show that our classifiers achieve perfect (100% accuracy) classification using the full packet-size distribution for low-overhead methods, whereas prior works that rely on statistical metadata achieved lower rates even when no padding and shaping were used. We also achieve an excellent classification rate even against high-overhead methods. We further show how an external adversary such as a malicious ISP or a government intelligence agency, who only sees the padded and shaped traffic as it goes through a VPN, can accurately identify the subset of active devices with Recall and Precision of at least 96%. Finally, we also propose a new method of padding we call the Dynamic STP (DSTP) that incurs significantly less per-packet overhead compared to other padding methods we tested and guarantees more privacy to IoT consumers.","PeriodicalId":416548,"journal":{"name":"Proceedings of the 21st Workshop on Privacy in the Electronic Society","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-10-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"7","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 21st Workshop on Privacy in the Electronic Society","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3559613.3563191","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 7

Abstract

It is well-known that when IoT traffic is unencrypted it is possible to identify the active devices based on their TCP/IP headers. And when traffic is encrypted, packet-sizes and timings can still be used to do so. To defend against such fingerprinting, traffic padding and shaping were introduced. In this paper we show that even with these mitigations, the privacy of IoT consumers can still be violated. The main tool we use in our analysis is the full distribution of packet-size---as opposed to commonly used statistics such as mean and variance. We evaluate the performance of a local adversary, such as a snooping neighbor or a criminal, against 8~different padding methods. We show that our classifiers achieve perfect (100% accuracy) classification using the full packet-size distribution for low-overhead methods, whereas prior works that rely on statistical metadata achieved lower rates even when no padding and shaping were used. We also achieve an excellent classification rate even against high-overhead methods. We further show how an external adversary such as a malicious ISP or a government intelligence agency, who only sees the padded and shaped traffic as it goes through a VPN, can accurately identify the subset of active devices with Recall and Precision of at least 96%. Finally, we also propose a new method of padding we call the Dynamic STP (DSTP) that incurs significantly less per-packet overhead compared to other padding methods we tested and guarantees more privacy to IoT consumers.

查看原文本刊更多论文

尽管填充和整形，加密物联网流量的分类

众所周知，当物联网流量未加密时，可以根据TCP/IP标头识别活动设备。当流量被加密时，数据包大小和时间仍然可以用来加密。为了防止这种指纹识别，引入了流量填充和整形。在本文中，我们表明，即使有了这些缓解措施，物联网消费者的隐私仍然可能被侵犯。我们在分析中使用的主要工具是数据包大小的完整分布，而不是常用的统计数据，如均值和方差。我们针对8种不同的填充方法评估了本地对手(如窥探邻居或罪犯)的性能。我们表明，我们的分类器使用低开销方法的完整数据包大小分布实现了完美(100%准确率)的分类，而以前依赖于统计元数据的工作即使在没有使用填充和整形的情况下也实现了较低的分类率。即使在高开销的方法下，我们也实现了出色的分类率。我们进一步展示了外部攻击者(如恶意ISP或政府情报机构)如何在通过VPN时只看到填充和变形的流量，从而准确识别活动设备的子集，召回率和精度至少为96%。最后，我们还提出了一种新的填充方法，我们称之为动态STP (DSTP)，与我们测试的其他填充方法相比，它产生的每包开销显着减少，并保证了物联网消费者的更多隐私。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 21st Workshop on Privacy in the Electronic Society

自引率

0.00%

发文量