Extending FolkRank with content data

Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web Pub Date : 2012-09-09 DOI:10.1145/2365934.2365936

Nikolas Landia, S. Anand, A. Hotho, R. Jäschke, Stephan Doerfel, Folke Mitzlaff

{"title":"Extending FolkRank with content data","authors":"Nikolas Landia, S. Anand, A. Hotho, R. Jäschke, Stephan Doerfel, Folke Mitzlaff","doi":"10.1145/2365934.2365936","DOIUrl":null,"url":null,"abstract":"Real-world tagging datasets have a large proportion of new/ untagged documents. Few approaches for recommending tags to a user for a document address this new item problem, concentrating instead on artificially created post-core datasets where it is guaranteed that the user as well as the document of each test post is known to the system and already has some tags assigned to it. In order to recommend tags for new documents, approaches are required which model documents not only based on the tags assigned to them in the past (if any), but also the content. In this paper we present a novel adaptation to the widely recognised FolkRank tag recommendation algorithm by including content data. We adapt the FolkRank graph to use word nodes instead of document nodes, enabling it to recommend tags for new documents based on their textual content. Our adaptations make FolkRank applicable to post-core 1 ie. the full real-world tagging datasets and address the new item problem in tag recommendation. For comparison, we also apply and evaluate the same methodology of including content on a simpler tag recommendation algorithm. This results in a less expensive recommender which suggests a combination of user related and document content related tags. Including content data into FolkRank shows an improvement over plain FolkRank on full tagging datasets. However, we also observe that our simpler content-aware tag recommender outperforms FolkRank with content data. Our results suggest that an optimisation of the weighting method of FolkRank is required to achieve better results.","PeriodicalId":258534,"journal":{"name":"Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web","volume":"170 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2012-09-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"10","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2365934.2365936","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 10

Abstract

Real-world tagging datasets have a large proportion of new/ untagged documents. Few approaches for recommending tags to a user for a document address this new item problem, concentrating instead on artificially created post-core datasets where it is guaranteed that the user as well as the document of each test post is known to the system and already has some tags assigned to it. In order to recommend tags for new documents, approaches are required which model documents not only based on the tags assigned to them in the past (if any), but also the content. In this paper we present a novel adaptation to the widely recognised FolkRank tag recommendation algorithm by including content data. We adapt the FolkRank graph to use word nodes instead of document nodes, enabling it to recommend tags for new documents based on their textual content. Our adaptations make FolkRank applicable to post-core 1 ie. the full real-world tagging datasets and address the new item problem in tag recommendation. For comparison, we also apply and evaluate the same methodology of including content on a simpler tag recommendation algorithm. This results in a less expensive recommender which suggests a combination of user related and document content related tags. Including content data into FolkRank shows an improvement over plain FolkRank on full tagging datasets. However, we also observe that our simpler content-aware tag recommender outperforms FolkRank with content data. Our results suggest that an optimisation of the weighting method of FolkRank is required to achieve better results.

查看原文本刊更多论文

用内容数据扩展FolkRank

真实世界的标记数据集有很大比例的新/未标记文档。为文档向用户推荐标签的方法很少解决这个新项目问题，而是专注于人工创建的post-core数据集，在这些数据集中，系统可以保证每个测试帖子的用户和文档都是已知的，并且已经为其分配了一些标签。为了为新文档推荐标记，需要一些方法，这些方法不仅基于过去分配给文档的标记(如果有的话)，而且还基于内容对文档进行建模。在本文中，我们提出了一种新的适应广泛认可的FolkRank标签推荐算法，包括内容数据。我们调整了FolkRank图，使用词节点而不是文档节点，使其能够根据文本内容为新文档推荐标签。我们的调整使FolkRank适用于后核心1。完整的真实世界标签数据集，并解决标签推荐中的新项目问题。为了进行比较，我们还应用并评估了在更简单的标签推荐算法上包含内容的相同方法。这就产生了一个更便宜的推荐器，它建议用户相关和文档内容相关的标签组合。将内容数据包含到FolkRank中显示了在完整标记数据集上比纯FolkRank的改进。然而，我们也观察到，我们更简单的内容感知标签推荐器在内容数据方面优于FolkRank。我们的结果表明，需要对FolkRank的加权方法进行优化以获得更好的结果。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 4th ACM RecSys workshop on Recommender systems and the social web

自引率

0.00%

发文量