基于Hadoop平台的二部网络单模投影的可扩展方法

Mahsa Asadi, Nasser Ghadiri, M. Nikbakht
{"title":"基于Hadoop平台的二部网络单模投影的可扩展方法","authors":"Mahsa Asadi, Nasser Ghadiri, M. Nikbakht","doi":"10.1109/ICCKE.2018.8566259","DOIUrl":null,"url":null,"abstract":"People look for models and methods to organize, classify, compress and filter the information due to the difficulty in maintenance and using immense sources of information. The bipartite graphs are particularly useful among the variety of presenting methods such as recommender systems. Most of the bipartite networks tend to cluster one side of graph behavior to recognize communications and interactions between members of that side and discover similar members. The one-mode projection technique is widely used for this purpose. However, parts of the primary information of the original bipartite graph is missed under the projection. So we need to exploit a method for determining the weights that yield projected edges in a way that minimizes information loss. While such methods exist, the majority of investigated databases in the field of bipartite network projection are huge, consequently, executing a projection procedure takes lots of times. In this paper, we propose a scalable method based on resource allocation for bipartite network projection. It provides a high performance while preserving precision through transferring the needed operations on a distributed platform like Hadoop. Moreover, as a case study, we evaluate the performance of the presented scalable algorithm in the field of social network which results in short projection operation time in comparison to the undistributed mode. Also, we compared our proposed method with a collaborative filtering method, a well-known algorithm in the recommendation field and as a result, our method had higher overall execution speed. With using the largest dataset of our experiments, the Orkut dataset, the proposed method has higher speed than the scalable CF by 33 %. Then, we evaluate the scalability of the introduced method by a scalability metric namely Speedup, which showed good scalability.","PeriodicalId":283700,"journal":{"name":"2018 8th International Conference on Computer and Knowledge Engineering (ICCKE)","volume":"112 5 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":"{\"title\":\"A Scalable Method for One-Mode Projection of Bipartite Networks Based on Hadoop Platform\",\"authors\":\"Mahsa Asadi, Nasser Ghadiri, M. Nikbakht\",\"doi\":\"10.1109/ICCKE.2018.8566259\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"People look for models and methods to organize, classify, compress and filter the information due to the difficulty in maintenance and using immense sources of information. The bipartite graphs are particularly useful among the variety of presenting methods such as recommender systems. Most of the bipartite networks tend to cluster one side of graph behavior to recognize communications and interactions between members of that side and discover similar members. The one-mode projection technique is widely used for this purpose. However, parts of the primary information of the original bipartite graph is missed under the projection. So we need to exploit a method for determining the weights that yield projected edges in a way that minimizes information loss. While such methods exist, the majority of investigated databases in the field of bipartite network projection are huge, consequently, executing a projection procedure takes lots of times. In this paper, we propose a scalable method based on resource allocation for bipartite network projection. It provides a high performance while preserving precision through transferring the needed operations on a distributed platform like Hadoop. Moreover, as a case study, we evaluate the performance of the presented scalable algorithm in the field of social network which results in short projection operation time in comparison to the undistributed mode. Also, we compared our proposed method with a collaborative filtering method, a well-known algorithm in the recommendation field and as a result, our method had higher overall execution speed. With using the largest dataset of our experiments, the Orkut dataset, the proposed method has higher speed than the scalable CF by 33 %. Then, we evaluate the scalability of the introduced method by a scalability metric namely Speedup, which showed good scalability.\",\"PeriodicalId\":283700,\"journal\":{\"name\":\"2018 8th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"volume\":\"112 5 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"2\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 8th International Conference on Computer and Knowledge Engineering (ICCKE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICCKE.2018.8566259\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 8th International Conference on Computer and Knowledge Engineering (ICCKE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCKE.2018.8566259","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2

摘要

由于庞大的信息源难以维护和使用,人们寻求对信息进行组织、分类、压缩和过滤的模型和方法。二部图在各种表示方法中特别有用,例如推荐系统。大多数二部网络倾向于聚类图行为的一侧,以识别该一侧成员之间的通信和交互,并发现相似的成员。单模投影技术被广泛用于此目的。然而,在投影下会丢失部分原始二部图的原始信息。所以我们需要找到一种方法来确定产生投影边的权重以最小化信息损失。尽管存在这样的方法,但在二部网络投影领域,大多数研究的数据库都是巨大的,因此,执行一个投影过程需要花费大量的时间。本文提出了一种基于资源分配的可扩展二部网络投影方法。它提供了高性能,同时通过在像Hadoop这样的分布式平台上传输所需的操作来保持精度。此外,作为一个案例研究,我们评估了所提出的可扩展算法在社交网络领域的性能,与非分布模式相比,该算法的投影运算时间更短。同时,将本文方法与推荐领域的一种知名算法协同过滤方法进行了比较,结果表明本文方法具有更高的整体执行速度。使用我们实验中最大的数据集Orkut数据集,所提出的方法比可扩展CF提高了33%的速度。然后用可扩展性指标加速度来评价所引入方法的可扩展性,结果表明该方法具有良好的可扩展性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
A Scalable Method for One-Mode Projection of Bipartite Networks Based on Hadoop Platform
People look for models and methods to organize, classify, compress and filter the information due to the difficulty in maintenance and using immense sources of information. The bipartite graphs are particularly useful among the variety of presenting methods such as recommender systems. Most of the bipartite networks tend to cluster one side of graph behavior to recognize communications and interactions between members of that side and discover similar members. The one-mode projection technique is widely used for this purpose. However, parts of the primary information of the original bipartite graph is missed under the projection. So we need to exploit a method for determining the weights that yield projected edges in a way that minimizes information loss. While such methods exist, the majority of investigated databases in the field of bipartite network projection are huge, consequently, executing a projection procedure takes lots of times. In this paper, we propose a scalable method based on resource allocation for bipartite network projection. It provides a high performance while preserving precision through transferring the needed operations on a distributed platform like Hadoop. Moreover, as a case study, we evaluate the performance of the presented scalable algorithm in the field of social network which results in short projection operation time in comparison to the undistributed mode. Also, we compared our proposed method with a collaborative filtering method, a well-known algorithm in the recommendation field and as a result, our method had higher overall execution speed. With using the largest dataset of our experiments, the Orkut dataset, the proposed method has higher speed than the scalable CF by 33 %. Then, we evaluate the scalability of the introduced method by a scalability metric namely Speedup, which showed good scalability.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信