{"title":"基于选择种子节点和无回溯两种方法与页面排序算法相结合的随机行走抽样改进","authors":"Ali Kheradbeygi Moghadam, A. Bastanfard","doi":"10.1109/ICWR54782.2022.9786241","DOIUrl":null,"url":null,"abstract":"One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.","PeriodicalId":355187,"journal":{"name":"2022 8th International Conference on Web Research (ICWR)","volume":"42 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-05-11","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving Random Walk Sampling, Inspired by Two Methods of Choosing Seed Node And No-Retracing With Combination of them with Page Rank Algorithm\",\"authors\":\"Ali Kheradbeygi Moghadam, A. Bastanfard\",\"doi\":\"10.1109/ICWR54782.2022.9786241\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.\",\"PeriodicalId\":355187,\"journal\":{\"name\":\"2022 8th International Conference on Web Research (ICWR)\",\"volume\":\"42 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-05-11\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2022 8th International Conference on Web Research (ICWR)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICWR54782.2022.9786241\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2022 8th International Conference on Web Research (ICWR)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICWR54782.2022.9786241","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving Random Walk Sampling, Inspired by Two Methods of Choosing Seed Node And No-Retracing With Combination of them with Page Rank Algorithm
One of the algorithms used for sampling complex networks is the classical random walk algorithm, which has been considered due to its good performance. But speed and energy consumption can also be improved by reducing size of input data. In this study, two random walk algorithms inspired by two methods, choosing seed node, and no-retracing algorithm which obtained by changing the classical random walk algorithm, and combining these three algorithms with google page rank algorithm, are discussed. This is done to preserve important nodes and reduce the size of the input data. This sampling was done from the United States flight network database. Also, important characteristics obtained in sampling, such as sampling efficiency, degree distribution, average degree, and average clustering coefficient have been investigated. The algorithms studied in this research each have their own advantages and disadvantages. For example, the no-retracing shows better performance in terms of time and average clustering coefficient. This efficiency is even greater when we use a combination of no-retracing algorithm with google page ranking algorithm. These algorithms can be used when speed is important in decision making, such as deciding on airlines and public transportation, etc. These algorithms are also more energy efficient than the studied algorithms.