User tweets based genre prediction and movie recommendation using LSI and SVD

Sakshi Bansal, Chetna Gupta, Anuja Arora
{"title":"User tweets based genre prediction and movie recommendation using LSI and SVD","authors":"Sakshi Bansal, Chetna Gupta, Anuja Arora","doi":"10.1109/IC3.2016.7880220","DOIUrl":null,"url":null,"abstract":"The emerging popularity and raise in users' posts on social media gave birth to numerous research challenges. Out of all challenges users' centric context information based recommendation is one prime research area to recommend jobs, events and movies. Here in this research work we focus on movie context aware recommendation and for this purpose, we analyze users' posted movie tweets to understand their intentions for the same. Therefore, the objective of this research work is to predict genre of movies based on user's posted movie tweets and recommending movies to users' according to predicted genre. For this purpose, we pre-processed twitter extracted movie tweets using tokenization, porter stemming, stop word removal and use Word-Net dictionary for synonym matching. Further, we apply Latent Semantic Indexing technique which in turn involves Singular Value Decomposition on this pre-processed data and predicts genre on the basis of IMDb movie genre categorization. The predicted genre conveys the movie interest of the user and to recommend movie on the basis of predicted genre which is measured through euclidean distance. We have extracted IMdb given movie data and further predicted genre using our proposed technique. To validate this we divided our dataset using pareto principle and matched with IMDb given genre data set and achieved approximate 70% accuracy using our approach.","PeriodicalId":294210,"journal":{"name":"2016 Ninth International Conference on Contemporary Computing (IC3)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Ninth International Conference on Contemporary Computing (IC3)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IC3.2016.7880220","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 16

Abstract

The emerging popularity and raise in users' posts on social media gave birth to numerous research challenges. Out of all challenges users' centric context information based recommendation is one prime research area to recommend jobs, events and movies. Here in this research work we focus on movie context aware recommendation and for this purpose, we analyze users' posted movie tweets to understand their intentions for the same. Therefore, the objective of this research work is to predict genre of movies based on user's posted movie tweets and recommending movies to users' according to predicted genre. For this purpose, we pre-processed twitter extracted movie tweets using tokenization, porter stemming, stop word removal and use Word-Net dictionary for synonym matching. Further, we apply Latent Semantic Indexing technique which in turn involves Singular Value Decomposition on this pre-processed data and predicts genre on the basis of IMDb movie genre categorization. The predicted genre conveys the movie interest of the user and to recommend movie on the basis of predicted genre which is measured through euclidean distance. We have extracted IMdb given movie data and further predicted genre using our proposed technique. To validate this we divided our dataset using pareto principle and matched with IMDb given genre data set and achieved approximate 70% accuracy using our approach.
基于用户推文的类型预测和基于LSI和SVD的电影推荐
用户在社交媒体上的帖子越来越受欢迎,引发了许多研究挑战。在所有挑战中,以用户为中心的基于上下文信息的推荐是推荐工作、事件和电影的一个主要研究领域。在这项研究工作中,我们专注于电影上下文感知推荐,为此,我们分析用户发布的电影推文,以了解他们的意图。因此,本研究工作的目的是根据用户发布的电影推文预测电影类型,并根据预测的类型向用户推荐电影。为此,我们使用标记化、波特词干提取、停止词去除和使用word - net词典进行同义词匹配对twitter进行预处理。进一步,我们应用潜在语义索引技术,该技术对预处理数据进行奇异值分解,并在IMDb电影类型分类的基础上预测类型。预测的类型传达了用户对电影的兴趣,并在预测类型的基础上推荐电影,预测类型通过欧几里得距离测量。我们已经提取了IMdb给定的电影数据,并使用我们提出的技术进一步预测了类型。为了验证这一点,我们使用帕累托原则划分数据集,并与给定类型数据集的IMDb进行匹配,使用我们的方法获得了大约70%的准确率。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信