{"title":"NetDPSyn: Synthesizing Network Traces under Differential Privacy","authors":"Danyu Sun, Joann Qiongna Chen, Chen Gong, Tianhao Wang, Zhou Li","doi":"arxiv-2409.05249","DOIUrl":null,"url":null,"abstract":"As the utilization of network traces for the network measurement research\nbecomes increasingly prevalent, concerns regarding privacy leakage from network\ntraces have garnered the public's attention. To safeguard network traces,\nresearchers have proposed the trace synthesis that retains the essential\nproperties of the raw data. However, previous works also show that synthesis\ntraces with generative models are vulnerable under linkage attacks. This paper introduces NetDPSyn, the first system to synthesize high-fidelity\nnetwork traces under privacy guarantees. NetDPSyn is built with the\nDifferential Privacy (DP) framework as its core, which is significantly\ndifferent from prior works that apply DP when training the generative model.\nThe experiments conducted on three flow and two packet datasets indicate that\nNetDPSyn achieves much better data utility in downstream tasks like anomaly\ndetection. NetDPSyn is also 2.5 times faster than the other methods on average\nin data synthesis.","PeriodicalId":501280,"journal":{"name":"arXiv - CS - Networking and Internet Architecture","volume":null,"pages":null},"PeriodicalIF":0.0000,"publicationDate":"2024-09-08","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Networking and Internet Architecture","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.05249","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
As the utilization of network traces for the network measurement research
becomes increasingly prevalent, concerns regarding privacy leakage from network
traces have garnered the public's attention. To safeguard network traces,
researchers have proposed the trace synthesis that retains the essential
properties of the raw data. However, previous works also show that synthesis
traces with generative models are vulnerable under linkage attacks. This paper introduces NetDPSyn, the first system to synthesize high-fidelity
network traces under privacy guarantees. NetDPSyn is built with the
Differential Privacy (DP) framework as its core, which is significantly
different from prior works that apply DP when training the generative model.
The experiments conducted on three flow and two packet datasets indicate that
NetDPSyn achieves much better data utility in downstream tasks like anomaly
detection. NetDPSyn is also 2.5 times faster than the other methods on average
in data synthesis.