{"title":"Large very dense subgraphs in a stream of edges","authors":"Claire Mathieu, Michel de Rougemont","doi":"10.1017/nws.2021.17","DOIUrl":null,"url":null,"abstract":"<p>We study the detection and the reconstruction of a large very dense subgraph in a social graph with <span>n</span> nodes and <span>m</span> edges given as a stream of edges, when the graph follows a power law degree distribution, in the regime when <span>\n<span>\n<img data-mimesubtype=\"png\" data-type=\"\" src=\"https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220125124825863-0283:S2050124221000175:S2050124221000175_inline1.png\"/>\n<span data-mathjax-type=\"texmath\"><span>\n$m=O(n. \\log n)$\n</span></span>\n</span>\n</span>. A subgraph <span>S</span> is very dense if it has <span>\n<span>\n<img data-mimesubtype=\"png\" data-type=\"\" src=\"https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220125124825863-0283:S2050124221000175:S2050124221000175_inline2.png\"/>\n<span data-mathjax-type=\"texmath\"><span>\n$\\Omega(|S|^2)$\n</span></span>\n</span>\n</span> edges. We uniformly sample the edges with a Reservoir of size <span>\n<span>\n<img data-mimesubtype=\"png\" data-type=\"\" src=\"https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220125124825863-0283:S2050124221000175:S2050124221000175_inline3.png\"/>\n<span data-mathjax-type=\"texmath\"><span>\n$k=O(\\sqrt{n}.\\log n)$\n</span></span>\n</span>\n</span>. Our detection algorithm checks whether the Reservoir has a giant component. We show that if the graph contains a very dense subgraph of size <span>\n<span>\n<img data-mimesubtype=\"png\" data-type=\"\" src=\"https://static.cambridge.org/binary/version/id/urn:cambridge.org:id:binary:20220125124825863-0283:S2050124221000175:S2050124221000175_inline4.png\"/>\n<span data-mathjax-type=\"texmath\"><span>\n$\\Omega(\\sqrt{n})$\n</span></span>\n</span>\n</span>, then the detection algorithm is almost surely correct. On the other hand, a random graph that follows a power law degree distribution almost surely has no large very dense subgraph, and the detection algorithm is almost surely correct. We define a new model of random graphs which follow a power law degree distribution and have large very dense subgraphs. We then show that on this class of random graphs we can reconstruct a good approximation of the very dense subgraph with high probability. We generalize these results to dynamic graphs defined by sliding windows in a stream of edges.</p>","PeriodicalId":51827,"journal":{"name":"Network Science","volume":null,"pages":null},"PeriodicalIF":1.4000,"publicationDate":"2022-01-25","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Network Science","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1017/nws.2021.17","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"Q2","JCRName":"SOCIAL SCIENCES, INTERDISCIPLINARY","Score":null,"Total":0}
引用次数: 0
Abstract
We study the detection and the reconstruction of a large very dense subgraph in a social graph with n nodes and m edges given as a stream of edges, when the graph follows a power law degree distribution, in the regime when
$m=O(n. \log n)$
. A subgraph S is very dense if it has
$\Omega(|S|^2)$
edges. We uniformly sample the edges with a Reservoir of size
$k=O(\sqrt{n}.\log n)$
. Our detection algorithm checks whether the Reservoir has a giant component. We show that if the graph contains a very dense subgraph of size
$\Omega(\sqrt{n})$
, then the detection algorithm is almost surely correct. On the other hand, a random graph that follows a power law degree distribution almost surely has no large very dense subgraph, and the detection algorithm is almost surely correct. We define a new model of random graphs which follow a power law degree distribution and have large very dense subgraphs. We then show that on this class of random graphs we can reconstruct a good approximation of the very dense subgraph with high probability. We generalize these results to dynamic graphs defined by sliding windows in a stream of edges.
期刊介绍:
Network Science is an important journal for an important discipline - one using the network paradigm, focusing on actors and relational linkages, to inform research, methodology, and applications from many fields across the natural, social, engineering and informational sciences. Given growing understanding of the interconnectedness and globalization of the world, network methods are an increasingly recognized way to research aspects of modern society along with the individuals, organizations, and other actors within it. The discipline is ready for a comprehensive journal, open to papers from all relevant areas. Network Science is a defining work, shaping this discipline. The journal welcomes contributions from researchers in all areas working on network theory, methods, and data.