{"title":"Estimation of Graph Features Based on Random Walks Using Neighbors' Properties","authors":"Tsuyoshi Hasegawa, Shiori Hironaka, Kazuyuki Shudo","doi":"arxiv-2409.08599","DOIUrl":null,"url":null,"abstract":"Using random walks for sampling has proven advantageous in assessing the\ncharacteristics of large and unknown social networks. Several algorithms based\non random walks have been introduced in recent years. In the practical\napplication of social network sampling, there is a recurrent reliance on an\napplication programming interface (API) for obtaining adjacent nodes. However,\nowing to constraints related to query frequency and associated API expenses, it\nis preferable to minimize API calls during the feature estimation process. In\nthis study, considering the acquisition of neighboring nodes as a cost factor,\nwe introduce a feature estimation algorithm that outperforms existing\nalgorithms in terms of accuracy. Through experiments that simulate sampling on\nknown graphs, we demonstrate the superior accuracy of our proposed algorithm\nwhen compared to existing alternatives.","PeriodicalId":501032,"journal":{"name":"arXiv - CS - Social and Information Networks","volume":"16 1","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-09-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"arXiv - CS - Social and Information Networks","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/arxiv-2409.08599","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Using random walks for sampling has proven advantageous in assessing the
characteristics of large and unknown social networks. Several algorithms based
on random walks have been introduced in recent years. In the practical
application of social network sampling, there is a recurrent reliance on an
application programming interface (API) for obtaining adjacent nodes. However,
owing to constraints related to query frequency and associated API expenses, it
is preferable to minimize API calls during the feature estimation process. In
this study, considering the acquisition of neighboring nodes as a cost factor,
we introduce a feature estimation algorithm that outperforms existing
algorithms in terms of accuracy. Through experiments that simulate sampling on
known graphs, we demonstrate the superior accuracy of our proposed algorithm
when compared to existing alternatives.
事实证明,使用随机游走进行抽样在评估大型未知社交网络的特征方面具有优势。近年来,已经推出了几种基于随机游走的算法。在社交网络采样的实际应用中,人们经常依赖应用编程接口(API)来获取相邻节点。然而,由于查询频率和相关 API 费用的限制,在特征估计过程中最好尽量减少 API 调用。在本研究中,考虑到获取相邻节点的成本因素,我们引入了一种在准确性方面优于现有算法的特征估计算法。通过在已知图上模拟采样的实验,我们证明了与现有算法相比,我们提出的算法具有更高的准确性。