Classification of Tweets for Video Streaming Services’ Content Recommendation on Twitter

Indonesian Journal of Applied Statistics Pub Date : 2021-05-30 DOI:10.13057/IJAS.V4I1.49051

Kiki Ferawati, S. Z. Jannah

引用次数: 1

Abstract

Streaming services were popular platforms often visited by internet users. However, the abundance of content can be confusing for its users, prompting them to look for a recommendation from other people. Some of the users looked for content to enjoy with the help of Twitter. However, there were irrelevant tweets shown in the results, showing sentences not related at all to the content in the streaming services platform. This study addressed the classification of relevant and irrelevant tweets for streaming services’ content recommendation using random forests and the Convolutional Neural Network (CNN). The result showed that the CNN performed better in the test set with higher accuracy of 94% but slower in running time compared to the random forest. There were indeed distinctive characteristics between the two categories of the tweets. Finally, based on the resulting classification, users could identify the right words to use and avoid while searching on Twitter.Keywords: text mining, streaming services, classification, random forest, CNN

查看原文本刊更多论文

面向视频流媒体服务内容推荐的推文分类

流媒体服务是互联网用户经常访问的热门平台。然而，丰富的内容可能会让用户感到困惑，促使他们寻找其他人的推荐。一些用户在Twitter的帮助下寻找可以享受的内容。然而，结果中显示了不相关的推文，显示了与流媒体服务平台上的内容完全不相关的句子。本研究利用随机森林和卷积神经网络(CNN)解决了流媒体服务内容推荐中相关和不相关推文的分类问题。结果表明，与随机森林相比，CNN在测试集中表现更好，准确率高达94%，但运行时间较慢。这两类推文之间确实存在明显的特征。最后，基于分类结果，用户可以在Twitter上搜索时识别出正确的使用和避免使用的单词。关键词:文本挖掘，流媒体服务，分类，随机森林，CNN

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

Indonesian Journal of Applied Statistics

自引率

0.00%

发文量