{"title":"探索协同过滤推荐的集启发相似度量","authors":"Q. Le, Thi-Xinh Le","doi":"10.1109/KSE53942.2021.9648825","DOIUrl":null,"url":null,"abstract":"The similarity measure is an important component used in collaborative filtering recommender systems (CFRSs) to determine the set of users having the same behavior with regard to the selected items. The measure is typically defined on sets of real-valued or discrete-valued vectors. For discrete-valued vectors, similarity measures are inspired by the comparison of sets and the cardinality of sets. In this paper, we aim to explore set-inspired similarity measures for CFRSs, including Fuzzy sets index, Jaccard index, Sorensen coefficient, and Symmetric difference, with four collaborative filtering methods: (i) user-based, (ii) item-based, (iii) user clustering-based, and (iv) item clustering-based methods. We conduct extensive experiments to evaluate the effect of different measures on the benchmark datasets. An important result is that all four of these measures outperform the Pearson coefficient and Cosine measures in both recommendation effectiveness and computation time. Empirical evidence also shows that the Symmetric difference measure provides better results than all remaining measures.","PeriodicalId":130986,"journal":{"name":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","volume":"1999 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-11-10","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Exploring Set-Inspired Similarity Measures for Collaborative Filtering Recommendation\",\"authors\":\"Q. Le, Thi-Xinh Le\",\"doi\":\"10.1109/KSE53942.2021.9648825\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The similarity measure is an important component used in collaborative filtering recommender systems (CFRSs) to determine the set of users having the same behavior with regard to the selected items. The measure is typically defined on sets of real-valued or discrete-valued vectors. For discrete-valued vectors, similarity measures are inspired by the comparison of sets and the cardinality of sets. In this paper, we aim to explore set-inspired similarity measures for CFRSs, including Fuzzy sets index, Jaccard index, Sorensen coefficient, and Symmetric difference, with four collaborative filtering methods: (i) user-based, (ii) item-based, (iii) user clustering-based, and (iv) item clustering-based methods. We conduct extensive experiments to evaluate the effect of different measures on the benchmark datasets. An important result is that all four of these measures outperform the Pearson coefficient and Cosine measures in both recommendation effectiveness and computation time. Empirical evidence also shows that the Symmetric difference measure provides better results than all remaining measures.\",\"PeriodicalId\":130986,\"journal\":{\"name\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"volume\":\"1999 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-11-10\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 13th International Conference on Knowledge and Systems Engineering (KSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/KSE53942.2021.9648825\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 13th International Conference on Knowledge and Systems Engineering (KSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/KSE53942.2021.9648825","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Exploring Set-Inspired Similarity Measures for Collaborative Filtering Recommendation
The similarity measure is an important component used in collaborative filtering recommender systems (CFRSs) to determine the set of users having the same behavior with regard to the selected items. The measure is typically defined on sets of real-valued or discrete-valued vectors. For discrete-valued vectors, similarity measures are inspired by the comparison of sets and the cardinality of sets. In this paper, we aim to explore set-inspired similarity measures for CFRSs, including Fuzzy sets index, Jaccard index, Sorensen coefficient, and Symmetric difference, with four collaborative filtering methods: (i) user-based, (ii) item-based, (iii) user clustering-based, and (iv) item clustering-based methods. We conduct extensive experiments to evaluate the effect of different measures on the benchmark datasets. An important result is that all four of these measures outperform the Pearson coefficient and Cosine measures in both recommendation effectiveness and computation time. Empirical evidence also shows that the Symmetric difference measure provides better results than all remaining measures.