社会化书签系统中为新资源推荐标签

Shweta Yagnik, Priyank Thakkar, K. Kotecha
{"title":"社会化书签系统中为新资源推荐标签","authors":"Shweta Yagnik, Priyank Thakkar, K. Kotecha","doi":"10.5121/IJDKP.2014.4102","DOIUrl":null,"url":null,"abstract":"Social bookmarking system is a web-based resource sharing system that allows users to upload, share and organize their resources i.e. bookmarks and publications. The system has shifted the paradigm of bookmarking from an individual activity limited to desktop to a collective activity on the web. It also facilitates user to annotate his resource with free form tags that leads to large communities of users to collaboratively create accessible repositories of web resources. Tagging process has its own challenges like ambiguity, redundancy or misspelled tags and sometimes user tends to avoid it as he has to describe tag at his own. The resultant tag space is noisy or very sparse and dilutes the purpose of tagging. The effective solution is Tag Recommendation System that automatically suggests appropriate set of tags to user while annotating resource. In this paper, we propose a framework that does not depend on tagging history of the resource or user and thereby capable of suggesting tags to the resources which are being submitted to the system first time. We model tag recommendation task as multi-label text classification problem and use Naive Bayes classifier as the base learner of the multilabel classifier. We experiment with Boolean, bag-of-words and term frequency-inverse document frequency (TFIDF) representation of the resources and fit appropriate distribution to the data based on the representation used. Impact of feature selection on the effectiveness of the tag recommendation is also studied. Effectiveness of the proposed framework is evaluated through precision, recall and f-measure metrics.","PeriodicalId":131153,"journal":{"name":"International Journal of Data Mining & Knowledge Management Process","volume":"51 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-01-31","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"Recommending Tags for New Resources in Social Bookmarking Systems\",\"authors\":\"Shweta Yagnik, Priyank Thakkar, K. Kotecha\",\"doi\":\"10.5121/IJDKP.2014.4102\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Social bookmarking system is a web-based resource sharing system that allows users to upload, share and organize their resources i.e. bookmarks and publications. The system has shifted the paradigm of bookmarking from an individual activity limited to desktop to a collective activity on the web. It also facilitates user to annotate his resource with free form tags that leads to large communities of users to collaboratively create accessible repositories of web resources. Tagging process has its own challenges like ambiguity, redundancy or misspelled tags and sometimes user tends to avoid it as he has to describe tag at his own. The resultant tag space is noisy or very sparse and dilutes the purpose of tagging. The effective solution is Tag Recommendation System that automatically suggests appropriate set of tags to user while annotating resource. In this paper, we propose a framework that does not depend on tagging history of the resource or user and thereby capable of suggesting tags to the resources which are being submitted to the system first time. We model tag recommendation task as multi-label text classification problem and use Naive Bayes classifier as the base learner of the multilabel classifier. We experiment with Boolean, bag-of-words and term frequency-inverse document frequency (TFIDF) representation of the resources and fit appropriate distribution to the data based on the representation used. Impact of feature selection on the effectiveness of the tag recommendation is also studied. Effectiveness of the proposed framework is evaluated through precision, recall and f-measure metrics.\",\"PeriodicalId\":131153,\"journal\":{\"name\":\"International Journal of Data Mining & Knowledge Management Process\",\"volume\":\"51 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-01-31\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Data Mining & Knowledge Management Process\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.5121/IJDKP.2014.4102\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Data Mining & Knowledge Management Process","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.5121/IJDKP.2014.4102","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

社会书签系统是一个基于网络的资源共享系统,允许用户上传、共享和组织他们的资源,即书签和出版物。该系统将书签的模式从局限于桌面的个人活动转变为网络上的集体活动。它还方便用户用自由形式的标签对资源进行注释,从而导致大型用户社区协作创建可访问的web资源存储库。标记过程有其自身的挑战,如歧义,冗余或拼写错误的标签,有时用户倾向于避免它,因为他必须自己描述标签。由此产生的标签空间是嘈杂的或非常稀疏的,并且削弱了标记的目的。有效的解决方案是标签推荐系统,该系统在标注资源的同时自动向用户推荐合适的标签集。在本文中,我们提出了一个框架,它不依赖于资源或用户的标记历史,从而能够向第一次提交给系统的资源建议标记。我们将标签推荐任务建模为多标签文本分类问题,并使用朴素贝叶斯分类器作为多标签分类器的基础学习器。我们对资源的布尔、词袋和词频率逆文档频率(TFIDF)表示进行了实验,并根据所使用的表示对数据进行了适当的分布。研究了特征选择对标签推荐效果的影响。通过精确度、召回率和f测量指标来评估所提出框架的有效性。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Recommending Tags for New Resources in Social Bookmarking Systems
Social bookmarking system is a web-based resource sharing system that allows users to upload, share and organize their resources i.e. bookmarks and publications. The system has shifted the paradigm of bookmarking from an individual activity limited to desktop to a collective activity on the web. It also facilitates user to annotate his resource with free form tags that leads to large communities of users to collaboratively create accessible repositories of web resources. Tagging process has its own challenges like ambiguity, redundancy or misspelled tags and sometimes user tends to avoid it as he has to describe tag at his own. The resultant tag space is noisy or very sparse and dilutes the purpose of tagging. The effective solution is Tag Recommendation System that automatically suggests appropriate set of tags to user while annotating resource. In this paper, we propose a framework that does not depend on tagging history of the resource or user and thereby capable of suggesting tags to the resources which are being submitted to the system first time. We model tag recommendation task as multi-label text classification problem and use Naive Bayes classifier as the base learner of the multilabel classifier. We experiment with Boolean, bag-of-words and term frequency-inverse document frequency (TFIDF) representation of the resources and fit appropriate distribution to the data based on the representation used. Impact of feature selection on the effectiveness of the tag recommendation is also studied. Effectiveness of the proposed framework is evaluated through precision, recall and f-measure metrics.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信