A Comprehensive Comparative Evaluation of Machine Learning Algorithms on Facebook Comment Dataset

Muhammad Umair, Iffraah Rehman, Shamim Akhtar, Waqar Khan, Haider Abbas, R. Choudhary
{"title":"A Comprehensive Comparative Evaluation of Machine Learning Algorithms on Facebook Comment Dataset","authors":"Muhammad Umair, Iffraah Rehman, Shamim Akhtar, Waqar Khan, Haider Abbas, R. Choudhary","doi":"10.31645/jisrc.46.19.2.8","DOIUrl":null,"url":null,"abstract":"Data mining is an emerging technique with its application in various areas such as health care, education, travel, social media, and banking. The data can be either labeled or unlabeled. When it comes to social media, the various platforms generate an infinite amount of data. This data can be of immense importance as a lot of hidden information can be discovered after data mining. In this paper, machine-learning algorithms such as Decision Tress, SVM and Linear Regression and their variants are applied on Facebook comment dataset, obtained from UCI machine learning repository. The dataset has 40,949 instances and 54 attributes. The goal is to predict the number of comments a Facebook post will get based on various conditions. The results indicate that Fine Gaussian SVM variation of SVM yielded highest predication accuracy. The evaluation was done on different parameters such as average testing accuracy (%), Root Mean Square Error (RMSE), R- Squared, Mean Square Error (MSE), Mean Absolute Error (MAE), prediction speed (Obs/sec) and training time (Machine cycle). It is concluded that SVM is an ideal choice to solve prediction problems associated with social media data.","PeriodicalId":412730,"journal":{"name":"Journal of Independent Studies and Research Computing","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Independent Studies and Research Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31645/jisrc.46.19.2.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

Abstract

Data mining is an emerging technique with its application in various areas such as health care, education, travel, social media, and banking. The data can be either labeled or unlabeled. When it comes to social media, the various platforms generate an infinite amount of data. This data can be of immense importance as a lot of hidden information can be discovered after data mining. In this paper, machine-learning algorithms such as Decision Tress, SVM and Linear Regression and their variants are applied on Facebook comment dataset, obtained from UCI machine learning repository. The dataset has 40,949 instances and 54 attributes. The goal is to predict the number of comments a Facebook post will get based on various conditions. The results indicate that Fine Gaussian SVM variation of SVM yielded highest predication accuracy. The evaluation was done on different parameters such as average testing accuracy (%), Root Mean Square Error (RMSE), R- Squared, Mean Square Error (MSE), Mean Absolute Error (MAE), prediction speed (Obs/sec) and training time (Machine cycle). It is concluded that SVM is an ideal choice to solve prediction problems associated with social media data.
基于Facebook评论数据集的机器学习算法的综合比较评价
数据挖掘是一项新兴技术,在医疗、教育、旅游、社交媒体和银行等各个领域都有应用。数据可以标记,也可以不标记。当涉及到社交媒体时,各种平台产生了无限数量的数据。这些数据可能非常重要,因为在数据挖掘之后可以发现许多隐藏的信息。本文将Decision trees、SVM和Linear Regression等机器学习算法及其变体应用于Facebook评论数据集,该数据集来自UCI机器学习存储库。该数据集有40,949个实例和54个属性。目标是根据各种条件预测Facebook帖子的评论数量。结果表明,支持向量机的细高斯方差预测精度最高。对平均测试精度(%)、均方根误差(RMSE)、R平方、均方误差(MSE)、平均绝对误差(MAE)、预测速度(Obs/sec)和训练时间(机器周期)等参数进行评估。结果表明,支持向量机是解决社交媒体数据预测问题的理想选择。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信