基于选择种子节点和无回溯两种方法与页面排序算法相结合的随机行走抽样改进

Ali Kheradbeygi Moghadam, A. Bastanfard
{"title":"基于选择种子节点和无回溯两种方法与页面排序算法相结合的随机行走抽样改进","authors":"Ali Kheradbeygi Moghadam, A. Bastanfard","doi":"10.1109/ICWR54782.2022.9786241","DOIUrl":null,"url":null,"abstract":"One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.","PeriodicalId":355187,"journal":{"name":"2022 8th International Conference on Web Research (ICWR)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving Random Walk Sampling, Inspired by Two Methods of Choosing Seed Node And No-Retracing With Combination of them with Page Rank Algorithm\",\"authors\":\"Ali Kheradbeygi Moghadam, A. Bastanfard\",\"doi\":\"10.1109/ICWR54782.2022.9786241\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.\",\"PeriodicalId\":355187,\"journal\":{\"name\":\"2022 8th International Conference on Web Research (ICWR)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Conference on Web Research (ICWR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWR54782.2022.9786241\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR54782.2022.9786241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1

摘要

用于复杂网络采样的算法之一是经典随机漫步算法,由于其良好的性能而受到人们的重视。但是速度和能耗也可以通过减少输入数据的大小来提高。本研究讨论了两种随机行走算法:选择种子节点和改变经典随机行走算法得到的无回溯算法,并将这三种算法与google页面排名算法相结合。这样做是为了保留重要节点并减少输入数据的大小。这个抽样是从美国飞行网络数据库中进行的。同时,对采样中得到的重要特征,如采样效率、度分布、平均度、平均聚类系数等进行了研究。本文所研究的算法各有优缺点。例如,无回溯在时间和平均聚类系数方面表现出更好的性能。当我们将无回溯算法与谷歌页面排名算法结合使用时,这种效率甚至更高。这些算法可用于速度对决策很重要的情况,例如选择航空公司和公共交通工具等。这些算法也比研究过的算法更节能。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Improving Random Walk Sampling, Inspired by Two Methods of Choosing Seed Node And No-Retracing With Combination of them with Page Rank Algorithm
One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信