{"title":"基于稀疏大网络数据的网页推荐","authors":"C. Leung, Fan Jiang, Joglas Souza","doi":"10.1109/WI.2018.00-32","DOIUrl":null,"url":null,"abstract":"In many real-life web applications, web surfers would like to get recommendation on which collections of web pages that would be interested to them or that they should follow. In order to discover this information and make recommendation, data analytics—and specially, association rule mining or web data mining—is in demand. Since its introduction, association rule mining has drawn attention of many researchers. Consequently, many association rule mining algorithms have been proposed for finding interesting relationships—in the form of association rules—among frequently occurring patterns. For instance, in IEEE/WIC/ACM WI 2016 and 2017, serial and parallel algorithms were proposed to find interesting web pages. However, like most of the existing association rule mining algorithms, these two algorithms also were not designed for mining big data. Moreover, the search space of web pages can sparse in the sense that web pages are connected to a small subset of all web pages in the search space. In this paper, we present a compact bitwise representation for web pages in the search space. Such a representation can then be used with a bitwise serial or parallel association rule mining system for web mining and recommendation. Evaluation results show the effectiveness of our compression and the practicality of our algorithm—which discovers popular pages on the web, which in turn gives the web surfers recommendation of web pages that might be interested to them—in real-life web applications.","PeriodicalId":405966,"journal":{"name":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","volume":"11 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-12-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"14","resultStr":"{\"title\":\"Web Page Recommendation from Sparse Big Web Data\",\"authors\":\"C. Leung, Fan Jiang, Joglas Souza\",\"doi\":\"10.1109/WI.2018.00-32\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"In many real-life web applications, web surfers would like to get recommendation on which collections of web pages that would be interested to them or that they should follow. In order to discover this information and make recommendation, data analytics—and specially, association rule mining or web data mining—is in demand. Since its introduction, association rule mining has drawn attention of many researchers. Consequently, many association rule mining algorithms have been proposed for finding interesting relationships—in the form of association rules—among frequently occurring patterns. For instance, in IEEE/WIC/ACM WI 2016 and 2017, serial and parallel algorithms were proposed to find interesting web pages. However, like most of the existing association rule mining algorithms, these two algorithms also were not designed for mining big data. Moreover, the search space of web pages can sparse in the sense that web pages are connected to a small subset of all web pages in the search space. In this paper, we present a compact bitwise representation for web pages in the search space. Such a representation can then be used with a bitwise serial or parallel association rule mining system for web mining and recommendation. Evaluation results show the effectiveness of our compression and the practicality of our algorithm—which discovers popular pages on the web, which in turn gives the web surfers recommendation of web pages that might be interested to them—in real-life web applications.\",\"PeriodicalId\":405966,\"journal\":{\"name\":\"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)\",\"volume\":\"11 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-12-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"14\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/WI.2018.00-32\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/WI.2018.00-32","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 14
摘要
在许多现实生活中的web应用程序中,网络冲浪者希望得到关于哪些网页对他们感兴趣或他们应该关注哪些网页的推荐。为了发现这些信息并提出建议,需要进行数据分析,特别是关联规则挖掘或web数据挖掘。关联规则挖掘自提出以来,受到了众多研究者的关注。因此,已经提出了许多关联规则挖掘算法,用于在频繁出现的模式中发现有趣的关系(以关联规则的形式)。例如,在IEEE/WIC/ACM WI 2016和2017中,提出了串行和并行算法来寻找有趣的网页。然而,与大多数现有的关联规则挖掘算法一样,这两种算法也不是为挖掘大数据而设计的。此外,网页的搜索空间可以是稀疏的,即网页与搜索空间中所有网页的一个小子集相连。在本文中,我们提出了网页在搜索空间中的一种紧凑的按位表示。然后,这种表示可以与按位串行或并行关联规则挖掘系统一起用于web挖掘和推荐。评估结果显示了我们的压缩的有效性和我们算法的实用性——在现实的网络应用中发现网络上的热门页面,进而向网络冲浪者推荐他们可能感兴趣的网页。
In many real-life web applications, web surfers would like to get recommendation on which collections of web pages that would be interested to them or that they should follow. In order to discover this information and make recommendation, data analytics—and specially, association rule mining or web data mining—is in demand. Since its introduction, association rule mining has drawn attention of many researchers. Consequently, many association rule mining algorithms have been proposed for finding interesting relationships—in the form of association rules—among frequently occurring patterns. For instance, in IEEE/WIC/ACM WI 2016 and 2017, serial and parallel algorithms were proposed to find interesting web pages. However, like most of the existing association rule mining algorithms, these two algorithms also were not designed for mining big data. Moreover, the search space of web pages can sparse in the sense that web pages are connected to a small subset of all web pages in the search space. In this paper, we present a compact bitwise representation for web pages in the search space. Such a representation can then be used with a bitwise serial or parallel association rule mining system for web mining and recommendation. Evaluation results show the effectiveness of our compression and the practicality of our algorithm—which discovers popular pages on the web, which in turn gives the web surfers recommendation of web pages that might be interested to them—in real-life web applications.