Mining tags using social endorsement networks

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval Pub Date : 2011-07-24 DOI:10.1145/2009916.2009946

Theodoros Lappas, Kunal Punera, Tamás Sarlós

{"title":"Mining tags using social endorsement networks","authors":"Theodoros Lappas, Kunal Punera, Tamás Sarlós","doi":"10.1145/2009916.2009946","DOIUrl":null,"url":null,"abstract":"Entities on social systems, such as users on Twitter, and images on Flickr, are at the core of many interesting applications: they can be ranked in search results, recommended to users, or used in contextual advertising. Such applications assume knowledge of an entity's nature and characteristic attributes. An effective way to encode such knowledge is in the form of tags. An untagged entity is practically inaccessible, since it is hard to retrieve or interact with. To address this, some platforms allow users to manually tag entities. However,while such tags can be informative, they can oftentimes be inadequate, trivial, ambiguous, or even plain false. Numerous automated tagging methods have been proposed to address these issues. However,most of them require pre-existing high-quality tags or descriptive texts for every entity that needs to be tagged. In our work, we propose a method based on social endorsements that is free from such constraints. Virtually every major social networking platform allows users to endorse entities that they find appealing. Examples include \"following\" Twitter users or \"favoriting\" Flickr photos. These endorsements are abundant and directly capture the preferences of users. In this paper, we pose and solve the problem of using the underlying social endorsement network to extract useful tags for entities in a social system. Our work leverages techniques from topic modeling to capture the interests of users and then uses them to extract relevant and descriptive tags for the entities they endorse. We perform an extensive evaluation of our proposed approach on real large-scale datasets from both Twitter and Flickr, and show that it significantly outperforms meaningful and competitive baselines.","PeriodicalId":356580,"journal":{"name":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","volume":"2 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2011-07-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2009916.2009946","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 22

Abstract

Entities on social systems, such as users on Twitter, and images on Flickr, are at the core of many interesting applications: they can be ranked in search results, recommended to users, or used in contextual advertising. Such applications assume knowledge of an entity's nature and characteristic attributes. An effective way to encode such knowledge is in the form of tags. An untagged entity is practically inaccessible, since it is hard to retrieve or interact with. To address this, some platforms allow users to manually tag entities. However,while such tags can be informative, they can oftentimes be inadequate, trivial, ambiguous, or even plain false. Numerous automated tagging methods have been proposed to address these issues. However,most of them require pre-existing high-quality tags or descriptive texts for every entity that needs to be tagged. In our work, we propose a method based on social endorsements that is free from such constraints. Virtually every major social networking platform allows users to endorse entities that they find appealing. Examples include "following" Twitter users or "favoriting" Flickr photos. These endorsements are abundant and directly capture the preferences of users. In this paper, we pose and solve the problem of using the underlying social endorsement network to extract useful tags for entities in a social system. Our work leverages techniques from topic modeling to capture the interests of users and then uses them to extract relevant and descriptive tags for the entities they endorse. We perform an extensive evaluation of our proposed approach on real large-scale datasets from both Twitter and Flickr, and show that it significantly outperforms meaningful and competitive baselines.

查看原文本刊更多论文

使用社会认可网络挖掘标签

社交系统上的实体，如Twitter上的用户和Flickr上的图像，是许多有趣应用程序的核心:它们可以在搜索结果中排名，推荐给用户，或用于上下文广告。这些应用程序假定了解实体的性质和特征属性。对这些知识进行编码的有效方法是以标签的形式。未标记的实体实际上是不可访问的，因为很难检索或与之交互。为了解决这个问题，一些平台允许用户手动标记实体。然而，虽然这样的标签可以提供信息，但它们通常是不充分的、琐碎的、模棱两可的，甚至是完全错误的。已经提出了许多自动标记方法来解决这些问题。然而，它们中的大多数都需要预先存在的高质量标记或需要标记的每个实体的描述性文本。在我们的工作中，我们提出了一种基于社会认可的方法，该方法不受这些限制。实际上，每个主要的社交网络平台都允许用户为他们认为有吸引力的实体背书。例如“关注”Twitter用户或“收藏”Flickr照片。这些背书是丰富的，直接抓住了用户的偏好。在本文中，我们提出并解决了使用底层社会背书网络为社会系统中的实体提取有用标签的问题。我们的工作利用主题建模技术来捕获用户的兴趣，然后使用它们为他们认可的实体提取相关和描述性标签。我们在Twitter和Flickr的真实大规模数据集上对我们提出的方法进行了广泛的评估，并表明它明显优于有意义和有竞争力的基线。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

自引率

0.00%

发文量