Rachmadian Trihatmaja, Yudistira Dwi Wardhana Asnar
{"title":"利用离群点标记、聚类和关联规则挖掘提高协同过滤性能","authors":"Rachmadian Trihatmaja, Yudistira Dwi Wardhana Asnar","doi":"10.1109/ICODSE.2018.8705883","DOIUrl":null,"url":null,"abstract":"Collaborative Filtering (CF) is a popular recommendation method because it can provide recommendations personally. Under conditions of data sparsity, CF recommendation systems are known to have low accuracy because available historical information is not enough to properly identify preferences. Based on experiments conducted by researchers, the factor that limits the accuracy of the CF recommendation system is the number of recommendation items that exceed the system requirements. The existence of outliers with too much items also affect the recommendation results. This study attempted to apply model-based CFs to improve the accuracy of CF recommendations under data sparsity conditions. Conduct outlier labeling, clustering, and association rule mining implemented from preprocessing as a combination of data processing methods to generate recommendation items. Experiments conducted on Groceries dataset, a real-world point-of-sale transactions data from grocery outlet. The results of the evaluation indicate that the proposed method can improve the accuracy of basic CF by 26% with the quality improvement of the recommendation result by 24%.","PeriodicalId":362422,"journal":{"name":"2018 5th International Conference on Data and Software Engineering (ICoDSE)","volume":"122 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"Improving the Performance of Collaborative Filtering Using Outlier Labeling, Clustering, and Association Rule Mining\",\"authors\":\"Rachmadian Trihatmaja, Yudistira Dwi Wardhana Asnar\",\"doi\":\"10.1109/ICODSE.2018.8705883\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Collaborative Filtering (CF) is a popular recommendation method because it can provide recommendations personally. Under conditions of data sparsity, CF recommendation systems are known to have low accuracy because available historical information is not enough to properly identify preferences. Based on experiments conducted by researchers, the factor that limits the accuracy of the CF recommendation system is the number of recommendation items that exceed the system requirements. The existence of outliers with too much items also affect the recommendation results. This study attempted to apply model-based CFs to improve the accuracy of CF recommendations under data sparsity conditions. Conduct outlier labeling, clustering, and association rule mining implemented from preprocessing as a combination of data processing methods to generate recommendation items. Experiments conducted on Groceries dataset, a real-world point-of-sale transactions data from grocery outlet. The results of the evaluation indicate that the proposed method can improve the accuracy of basic CF by 26% with the quality improvement of the recommendation result by 24%.\",\"PeriodicalId\":362422,\"journal\":{\"name\":\"2018 5th International Conference on Data and Software Engineering (ICoDSE)\",\"volume\":\"122 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2018-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2018 5th International Conference on Data and Software Engineering (ICoDSE)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICODSE.2018.8705883\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Data and Software Engineering (ICoDSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICODSE.2018.8705883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Improving the Performance of Collaborative Filtering Using Outlier Labeling, Clustering, and Association Rule Mining
Collaborative Filtering (CF) is a popular recommendation method because it can provide recommendations personally. Under conditions of data sparsity, CF recommendation systems are known to have low accuracy because available historical information is not enough to properly identify preferences. Based on experiments conducted by researchers, the factor that limits the accuracy of the CF recommendation system is the number of recommendation items that exceed the system requirements. The existence of outliers with too much items also affect the recommendation results. This study attempted to apply model-based CFs to improve the accuracy of CF recommendations under data sparsity conditions. Conduct outlier labeling, clustering, and association rule mining implemented from preprocessing as a combination of data processing methods to generate recommendation items. Experiments conducted on Groceries dataset, a real-world point-of-sale transactions data from grocery outlet. The results of the evaluation indicate that the proposed method can improve the accuracy of basic CF by 26% with the quality improvement of the recommendation result by 24%.