{"title":"Affect of Data Filter on Performance of Latent Semantic Analysis based Research Paper Recommender System","authors":"Javeria Almas, Usman Qamar","doi":"10.1109/ICCIA49625.2020.00017","DOIUrl":null,"url":null,"abstract":"Latent Semantic Analysis uses Singular Value Decomposition (SVD) to effectively retrieve relevant information from the information corpus. However, LSA has a high computational cost. In order to address this aspect, it is proposed to filter only those words carrying high semantic importance. The aim is to improve the execution time of semantic space construction and dimensionality reduction. We present how the use of data filter can effectively meet the proposed goals in comparison to baseline method of performing recommendations. The proposed system was assessed over a dataset of 80 articles (Titles and Abstracts). The results of the experiments show that the proposed system performed better in terms of elapsed time with an average precision of 85.54% (78.64% for baseline method) and an average recall of 92.96% (89.70% for baseline method).","PeriodicalId":237536,"journal":{"name":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","volume":"32 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2020-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2020 5th International Conference on Computational Intelligence and Applications (ICCIA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICCIA49625.2020.00017","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Latent Semantic Analysis uses Singular Value Decomposition (SVD) to effectively retrieve relevant information from the information corpus. However, LSA has a high computational cost. In order to address this aspect, it is proposed to filter only those words carrying high semantic importance. The aim is to improve the execution time of semantic space construction and dimensionality reduction. We present how the use of data filter can effectively meet the proposed goals in comparison to baseline method of performing recommendations. The proposed system was assessed over a dataset of 80 articles (Titles and Abstracts). The results of the experiments show that the proposed system performed better in terms of elapsed time with an average precision of 85.54% (78.64% for baseline method) and an average recall of 92.96% (89.70% for baseline method).