Rachmadian Trihatmaja, Yudistira Dwi Wardhana Asnar
{"title":"Improving the Performance of Collaborative Filtering Using Outlier Labeling, Clustering, and Association Rule Mining","authors":"Rachmadian Trihatmaja, Yudistira Dwi Wardhana Asnar","doi":"10.1109/ICODSE.2018.8705883","DOIUrl":null,"url":null,"abstract":"Collaborative Filtering (CF) is a popular recommendation method because it can provide recommendations personally. Under conditions of data sparsity, CF recommendation systems are known to have low accuracy because available historical information is not enough to properly identify preferences. Based on experiments conducted by researchers, the factor that limits the accuracy of the CF recommendation system is the number of recommendation items that exceed the system requirements. The existence of outliers with too much items also affect the recommendation results. This study attempted to apply model-based CFs to improve the accuracy of CF recommendations under data sparsity conditions. Conduct outlier labeling, clustering, and association rule mining implemented from preprocessing as a combination of data processing methods to generate recommendation items. Experiments conducted on Groceries dataset, a real-world point-of-sale transactions data from grocery outlet. The results of the evaluation indicate that the proposed method can improve the accuracy of basic CF by 26% with the quality improvement of the recommendation result by 24%.","PeriodicalId":362422,"journal":{"name":"2018 5th International Conference on Data and Software Engineering (ICoDSE)","volume":"122 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2018-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2018 5th International Conference on Data and Software Engineering (ICoDSE)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICODSE.2018.8705883","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
Collaborative Filtering (CF) is a popular recommendation method because it can provide recommendations personally. Under conditions of data sparsity, CF recommendation systems are known to have low accuracy because available historical information is not enough to properly identify preferences. Based on experiments conducted by researchers, the factor that limits the accuracy of the CF recommendation system is the number of recommendation items that exceed the system requirements. The existence of outliers with too much items also affect the recommendation results. This study attempted to apply model-based CFs to improve the accuracy of CF recommendations under data sparsity conditions. Conduct outlier labeling, clustering, and association rule mining implemented from preprocessing as a combination of data processing methods to generate recommendation items. Experiments conducted on Groceries dataset, a real-world point-of-sale transactions data from grocery outlet. The results of the evaluation indicate that the proposed method can improve the accuracy of basic CF by 26% with the quality improvement of the recommendation result by 24%.