邻域协同过滤参数对重磅偏差性能的影响

Emre Yalcin
{"title":"邻域协同过滤参数对重磅偏差性能的影响","authors":"Emre Yalcin","doi":"10.35377/saucis...1065794","DOIUrl":null,"url":null,"abstract":"Collaborative filtering algorithms are efficient tools for providing recommendations with reasonable accuracy performances to individuals. However, the previous research has realized that these algorithms are undesirably biased towards blockbuster items. i.e., both popular and highly-liked items, in their recommendations, resulting in recommendation lists dominated by such blockbuster items. As one most prominent types of collaborative filtering approaches, neighborhood-based algorithms aim to produce recommendations based on neighborhoods constructed based on similarities between users or items. Therefore, the utilized similarity function and the size of the neighborhoods are critical parameters on their recommendation performances. This study considers three well-known similarity functions, i.e., Pearson, Cosine, and Mean Squared Difference, and varying neighborhood sizes and observes how they affect the algorithms’ blockbuster bias and accuracy performances. The extensive experiments conducted on two benchmark data collections conclude that as the size of neighborhoods decreases, these algorithms generally become more vulnerable to blockbuster bias while their accuracy increases. The experimental works also show that using the Cosine metric is superior to other similarity functions in producing recommendations where blockbuster bias is treated more; however, it leads to having unqualified recommendations in terms of predictive accuracy as they are usually conflicting goals.","PeriodicalId":257636,"journal":{"name":"Sakarya University Journal of Computer and Information Sciences","volume":"21 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-06-15","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Effects of neighborhood-based collaborative filtering parameters on their blockbuster bias performances\",\"authors\":\"Emre Yalcin\",\"doi\":\"10.35377/saucis...1065794\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Collaborative filtering algorithms are efficient tools for providing recommendations with reasonable accuracy performances to individuals. However, the previous research has realized that these algorithms are undesirably biased towards blockbuster items. i.e., both popular and highly-liked items, in their recommendations, resulting in recommendation lists dominated by such blockbuster items. As one most prominent types of collaborative filtering approaches, neighborhood-based algorithms aim to produce recommendations based on neighborhoods constructed based on similarities between users or items. Therefore, the utilized similarity function and the size of the neighborhoods are critical parameters on their recommendation performances. This study considers three well-known similarity functions, i.e., Pearson, Cosine, and Mean Squared Difference, and varying neighborhood sizes and observes how they affect the algorithms’ blockbuster bias and accuracy performances. The extensive experiments conducted on two benchmark data collections conclude that as the size of neighborhoods decreases, these algorithms generally become more vulnerable to blockbuster bias while their accuracy increases. The experimental works also show that using the Cosine metric is superior to other similarity functions in producing recommendations where blockbuster bias is treated more; however, it leads to having unqualified recommendations in terms of predictive accuracy as they are usually conflicting goals.\",\"PeriodicalId\":257636,\"journal\":{\"name\":\"Sakarya University Journal of Computer and Information Sciences\",\"volume\":\"21 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-06-15\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Sakarya University Journal of Computer and Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.35377/saucis...1065794\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Sakarya University Journal of Computer and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.35377/saucis...1065794","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

协同过滤算法是向个人提供具有合理精度性能的推荐的有效工具。然而,先前的研究已经意识到,这些算法对重磅产品有不可取的偏见。也就是说,在他们的推荐中,受欢迎的和高度喜欢的物品,导致推荐列表被这些重磅商品所主导。作为一种最突出的协同过滤方法,基于邻域的算法旨在根据用户或物品之间的相似性构建邻域,从而产生推荐。因此,使用的相似函数和邻域的大小是影响其推荐性能的关键参数。本研究考虑了三个众所周知的相似函数,即Pearson, cos和Mean Squared Difference,以及不同的邻域大小,并观察了它们如何影响算法的重磅偏差和准确性性能。在两个基准数据集上进行的大量实验得出结论,随着邻域大小的减小,这些算法通常更容易受到重磅炸弹偏见的影响,而它们的准确性却在提高。实验工作还表明,在重磅偏差较多的情况下,使用余弦度量在生成推荐时优于其他相似函数;然而,就预测准确性而言,它会导致不合格的建议,因为它们通常是相互冲突的目标。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Effects of neighborhood-based collaborative filtering parameters on their blockbuster bias performances
Collaborative filtering algorithms are efficient tools for providing recommendations with reasonable accuracy performances to individuals. However, the previous research has realized that these algorithms are undesirably biased towards blockbuster items. i.e., both popular and highly-liked items, in their recommendations, resulting in recommendation lists dominated by such blockbuster items. As one most prominent types of collaborative filtering approaches, neighborhood-based algorithms aim to produce recommendations based on neighborhoods constructed based on similarities between users or items. Therefore, the utilized similarity function and the size of the neighborhoods are critical parameters on their recommendation performances. This study considers three well-known similarity functions, i.e., Pearson, Cosine, and Mean Squared Difference, and varying neighborhood sizes and observes how they affect the algorithms’ blockbuster bias and accuracy performances. The extensive experiments conducted on two benchmark data collections conclude that as the size of neighborhoods decreases, these algorithms generally become more vulnerable to blockbuster bias while their accuracy increases. The experimental works also show that using the Cosine metric is superior to other similarity functions in producing recommendations where blockbuster bias is treated more; however, it leads to having unqualified recommendations in terms of predictive accuracy as they are usually conflicting goals.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信