{"title":"An efficient algorithm for sampling of a single large graph","authors":"Vandana Bhatia, Rinkle Rani","doi":"10.1109/IC3.2017.8284290","DOIUrl":null,"url":null,"abstract":"Graph Databases offer a very influential way to provide an instinctual representation for many applications spanning from social networks, web networks to biological networks. In the current era of big data, the size of the graph is increasing exponentially. It is difficult for the conventional machines to analyze a whole graph. To overcome this, the characteristics of the large graphs are estimated via sampling in order to identify trends and patterns in the large graph. The existing sampling techniques such as random node and random walk do not provide consistent efficiency over the graphs. In this paper, an efficient sampling algorithm named Influence sampling (IS) is proposed which sample the graphs by analyzing the degree of the vertices of the graph such that the most influential vertices remain in the graph sample. The experiments are performed over three real life datasets and the performance is compared with the three existing sampling algorithms. It is shown that IS performs well in the terms of accuracy.","PeriodicalId":147099,"journal":{"name":"2017 Tenth International Conference on Contemporary Computing (IC3)","volume":"28 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 Tenth International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2017.8284290","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3
Abstract
Graph Databases offer a very influential way to provide an instinctual representation for many applications spanning from social networks, web networks to biological networks. In the current era of big data, the size of the graph is increasing exponentially. It is difficult for the conventional machines to analyze a whole graph. To overcome this, the characteristics of the large graphs are estimated via sampling in order to identify trends and patterns in the large graph. The existing sampling techniques such as random node and random walk do not provide consistent efficiency over the graphs. In this paper, an efficient sampling algorithm named Influence sampling (IS) is proposed which sample the graphs by analyzing the degree of the vertices of the graph such that the most influential vertices remain in the graph sample. The experiments are performed over three real life datasets and the performance is compared with the three existing sampling algorithms. It is shown that IS performs well in the terms of accuracy.