Muhammad Umair, Iffraah Rehman, Shamim Akhtar, Waqar Khan, Haider Abbas, R. Choudhary
{"title":"基于Facebook评论数据集的机器学习算法的综合比较评价","authors":"Muhammad Umair, Iffraah Rehman, Shamim Akhtar, Waqar Khan, Haider Abbas, R. Choudhary","doi":"10.31645/jisrc.46.19.2.8","DOIUrl":null,"url":null,"abstract":"Data mining is an emerging technique with its application in various areas such as health care, education, travel, social media, and banking. The data can be either labeled or unlabeled. When it comes to social media, the various platforms generate an infinite amount of data. This data can be of immense importance as a lot of hidden information can be discovered after data mining. In this paper, machine-learning algorithms such as Decision Tress, SVM and Linear Regression and their variants are applied on Facebook comment dataset, obtained from UCI machine learning repository. The dataset has 40,949 instances and 54 attributes. The goal is to predict the number of comments a Facebook post will get based on various conditions. The results indicate that Fine Gaussian SVM variation of SVM yielded highest predication accuracy. The evaluation was done on different parameters such as average testing accuracy (%), Root Mean Square Error (RMSE), R- Squared, Mean Square Error (MSE), Mean Absolute Error (MAE), prediction speed (Obs/sec) and training time (Machine cycle). It is concluded that SVM is an ideal choice to solve prediction problems associated with social media data.","PeriodicalId":412730,"journal":{"name":"Journal of Independent Studies and Research Computing","volume":"70 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2022-01-27","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"A Comprehensive Comparative Evaluation of Machine Learning Algorithms on Facebook Comment Dataset\",\"authors\":\"Muhammad Umair, Iffraah Rehman, Shamim Akhtar, Waqar Khan, Haider Abbas, R. Choudhary\",\"doi\":\"10.31645/jisrc.46.19.2.8\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Data mining is an emerging technique with its application in various areas such as health care, education, travel, social media, and banking. The data can be either labeled or unlabeled. When it comes to social media, the various platforms generate an infinite amount of data. This data can be of immense importance as a lot of hidden information can be discovered after data mining. In this paper, machine-learning algorithms such as Decision Tress, SVM and Linear Regression and their variants are applied on Facebook comment dataset, obtained from UCI machine learning repository. The dataset has 40,949 instances and 54 attributes. The goal is to predict the number of comments a Facebook post will get based on various conditions. The results indicate that Fine Gaussian SVM variation of SVM yielded highest predication accuracy. The evaluation was done on different parameters such as average testing accuracy (%), Root Mean Square Error (RMSE), R- Squared, Mean Square Error (MSE), Mean Absolute Error (MAE), prediction speed (Obs/sec) and training time (Machine cycle). It is concluded that SVM is an ideal choice to solve prediction problems associated with social media data.\",\"PeriodicalId\":412730,\"journal\":{\"name\":\"Journal of Independent Studies and Research Computing\",\"volume\":\"70 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2022-01-27\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"Journal of Independent Studies and Research Computing\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.31645/jisrc.46.19.2.8\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"Journal of Independent Studies and Research Computing","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.31645/jisrc.46.19.2.8","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Comprehensive Comparative Evaluation of Machine Learning Algorithms on Facebook Comment Dataset
Data mining is an emerging technique with its application in various areas such as health care, education, travel, social media, and banking. The data can be either labeled or unlabeled. When it comes to social media, the various platforms generate an infinite amount of data. This data can be of immense importance as a lot of hidden information can be discovered after data mining. In this paper, machine-learning algorithms such as Decision Tress, SVM and Linear Regression and their variants are applied on Facebook comment dataset, obtained from UCI machine learning repository. The dataset has 40,949 instances and 54 attributes. The goal is to predict the number of comments a Facebook post will get based on various conditions. The results indicate that Fine Gaussian SVM variation of SVM yielded highest predication accuracy. The evaluation was done on different parameters such as average testing accuracy (%), Root Mean Square Error (RMSE), R- Squared, Mean Square Error (MSE), Mean Absolute Error (MAE), prediction speed (Obs/sec) and training time (Machine cycle). It is concluded that SVM is an ideal choice to solve prediction problems associated with social media data.