{"title":"协同过滤中的灰羊用户识别:一种基于分布的技术","authors":"Benjamin Gras, A. Brun, A. Boyer","doi":"10.1145/2930238.2930242","DOIUrl":null,"url":null,"abstract":"The collaborative filtering (CF) approach in recommender systems assumes that users' preferences are consistent among users. Although accurate, this approach fails on some users. We presume that some of these users belong to a small community of users who have unusual preferences, such users are not compliant with the CF underlying assumption. They are grey sheep users. This paper aims at accurately identifying grey sheep users. We introduce a new distribution-based grey sheep users identification technique, that borrows from outlier detection and from information retrieval, while taking into account the specificities of preference data on which CF relies: extreme sparsity, imprecision and users' bias. The experimental evaluation conducted on a state-of-the-art dataset shows that this new distribution-based technique outperforms state-of-the-art grey sheep users identification techniques.","PeriodicalId":339100,"journal":{"name":"Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization","volume":"278 5","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-07-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Identifying Grey Sheep Users in Collaborative Filtering: A Distribution-Based Technique\",\"authors\":\"Benjamin Gras, A. Brun, A. Boyer\",\"doi\":\"10.1145/2930238.2930242\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The collaborative filtering (CF) approach in recommender systems assumes that users' preferences are consistent among users. Although accurate, this approach fails on some users. We presume that some of these users belong to a small community of users who have unusual preferences, such users are not compliant with the CF underlying assumption. They are grey sheep users. This paper aims at accurately identifying grey sheep users. We introduce a new distribution-based grey sheep users identification technique, that borrows from outlier detection and from information retrieval, while taking into account the specificities of preference data on which CF relies: extreme sparsity, imprecision and users' bias. The experimental evaluation conducted on a state-of-the-art dataset shows that this new distribution-based technique outperforms state-of-the-art grey sheep users identification techniques.\",\"PeriodicalId\":339100,\"journal\":{\"name\":\"Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization\",\"volume\":\"278 5\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-07-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1145/2930238.2930242\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2016 Conference on User Modeling Adaptation and Personalization","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2930238.2930242","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Identifying Grey Sheep Users in Collaborative Filtering: A Distribution-Based Technique
The collaborative filtering (CF) approach in recommender systems assumes that users' preferences are consistent among users. Although accurate, this approach fails on some users. We presume that some of these users belong to a small community of users who have unusual preferences, such users are not compliant with the CF underlying assumption. They are grey sheep users. This paper aims at accurately identifying grey sheep users. We introduce a new distribution-based grey sheep users identification technique, that borrows from outlier detection and from information retrieval, while taking into account the specificities of preference data on which CF relies: extreme sparsity, imprecision and users' bias. The experimental evaluation conducted on a state-of-the-art dataset shows that this new distribution-based technique outperforms state-of-the-art grey sheep users identification techniques.