{"title":"HyperHeadTail: a Streaming Algorithm for Estimating the Degree Distribution of Dynamic Multigraphs","authors":"Andrew Stolman, Kevin Matulef","doi":"10.1145/3110025.3119395","DOIUrl":null,"url":null,"abstract":"We introduce HyperHeadTail, a streaming algorithm for estimating the degree distribution of a graph from a stream of edges using very little storage space. Real world graph streams, such as those generated by network traffic or other communication networks, tend to contain repeated elements as well as a temporal nature. Our algorithm handles these situations by extending the HeadTail algorithm of Simpson, Seshadhri, and McGregor [20]. We provide an implementation of HyperHeadTail and demonstrate its utility on both synthetic and real-world data sets. We show that HyperHeadTail offers similar performance to HeadTail, while also providing additional functionality for tracking dynamic graphs that previous algorithms cannot efficiently achieve. We show that with a space usage on the order of 8% of the number of vertices in a graph, we were able to achieve a Relative Hausdorff distance of .27.","PeriodicalId":399660,"journal":{"name":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","volume":"9 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2017","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3110025.3119395","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
We introduce HyperHeadTail, a streaming algorithm for estimating the degree distribution of a graph from a stream of edges using very little storage space. Real world graph streams, such as those generated by network traffic or other communication networks, tend to contain repeated elements as well as a temporal nature. Our algorithm handles these situations by extending the HeadTail algorithm of Simpson, Seshadhri, and McGregor [20]. We provide an implementation of HyperHeadTail and demonstrate its utility on both synthetic and real-world data sets. We show that HyperHeadTail offers similar performance to HeadTail, while also providing additional functionality for tracking dynamic graphs that previous algorithms cannot efficiently achieve. We show that with a space usage on the order of 8% of the number of vertices in a graph, we were able to achieve a Relative Hausdorff distance of .27.