预测Facebook评论量的机器学习方法

Alaa Elsakran, Abdullah Al Amin, A. Alzaatreh
{"title":"预测Facebook评论量的机器学习方法","authors":"Alaa Elsakran, Abdullah Al Amin, A. Alzaatreh","doi":"10.1109/ICD47981.2019.9105863","DOIUrl":null,"url":null,"abstract":"There is an enormous amount of data uploaded to Facebook every day. Therefore, there is an essential need to analyze this data for many purposes. In this paper, a deep analysis on Facebook comment volume prediction have been conducted to demonstrate the effect of the number of comments on marketing. The data used consists of 603,813 observations provided by Facebook. It includes fifty-three input attributes that have been used to predict the number of comments a post is going to have in the next H hours. Moreover, several Machine Learning techniques have been applied to analyze this considerable amount of data. For instance, Stepwise Linear Regression, Gradient Boosting, Neural Network, and Decision Tree models. Findings indicate that Neural Network model outperformed the other models used in previous studies with the smallest error; when Stepwise Linear Regression is used as a variable selection for it. However, Gradient Boosting outperformed other models with a Hits@10 score of 6.7. Additionally, one of the most crucial outcomes of this study is determining the most significant variables. Particularly, it was found that the publication day of the post, number of check-ins, shares or likes on the page, and the number of comments in the last 24 or 48 hours were the most important variables. These variables contribute to garner more comments, which results in more viewers and higher profit for the marketer. The study concludes with some limitations and suggestions for future research.","PeriodicalId":277894,"journal":{"name":"2019 International Conference on Digitization (ICD)","volume":"56 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-11-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Machine Learning Approach to Predict Facebook Comment Volume\",\"authors\":\"Alaa Elsakran, Abdullah Al Amin, A. Alzaatreh\",\"doi\":\"10.1109/ICD47981.2019.9105863\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"There is an enormous amount of data uploaded to Facebook every day. Therefore, there is an essential need to analyze this data for many purposes. In this paper, a deep analysis on Facebook comment volume prediction have been conducted to demonstrate the effect of the number of comments on marketing. The data used consists of 603,813 observations provided by Facebook. It includes fifty-three input attributes that have been used to predict the number of comments a post is going to have in the next H hours. Moreover, several Machine Learning techniques have been applied to analyze this considerable amount of data. For instance, Stepwise Linear Regression, Gradient Boosting, Neural Network, and Decision Tree models. Findings indicate that Neural Network model outperformed the other models used in previous studies with the smallest error; when Stepwise Linear Regression is used as a variable selection for it. However, Gradient Boosting outperformed other models with a Hits@10 score of 6.7. Additionally, one of the most crucial outcomes of this study is determining the most significant variables. Particularly, it was found that the publication day of the post, number of check-ins, shares or likes on the page, and the number of comments in the last 24 or 48 hours were the most important variables. These variables contribute to garner more comments, which results in more viewers and higher profit for the marketer. The study concludes with some limitations and suggestions for future research.\",\"PeriodicalId\":277894,\"journal\":{\"name\":\"2019 International Conference on Digitization (ICD)\",\"volume\":\"56 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-11-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2019 International Conference on Digitization (ICD)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/ICD47981.2019.9105863\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2019 International Conference on Digitization (ICD)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICD47981.2019.9105863","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0

摘要

每天都有大量的数据上传到Facebook上。因此,有必要为许多目的分析这些数据。本文对Facebook评论量预测进行了深入分析,以证明评论数量对营销的影响。所使用的数据由Facebook提供的603,813个观察数据组成。它包括53个输入属性,用于预测在接下来的H小时内帖子将会有多少评论。此外,已经应用了几种机器学习技术来分析这大量的数据。例如,逐步线性回归,梯度增强,神经网络和决策树模型。研究结果表明,神经网络模型以最小的误差优于以往研究中使用的其他模型;当使用逐步线性回归作为变量选择时。然而,Gradient Boosting的Hits@10得分为6.7,优于其他模型。此外,本研究最重要的结果之一是确定最重要的变量。特别是,研究发现,帖子的发布日期、签到次数、页面上的分享或点赞次数以及最近24或48小时内的评论次数是最重要的变量。这些变量有助于获得更多的评论,从而为营销人员带来更多的观众和更高的利润。最后提出了研究的局限性和对未来研究的建议。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Machine Learning Approach to Predict Facebook Comment Volume
There is an enormous amount of data uploaded to Facebook every day. Therefore, there is an essential need to analyze this data for many purposes. In this paper, a deep analysis on Facebook comment volume prediction have been conducted to demonstrate the effect of the number of comments on marketing. The data used consists of 603,813 observations provided by Facebook. It includes fifty-three input attributes that have been used to predict the number of comments a post is going to have in the next H hours. Moreover, several Machine Learning techniques have been applied to analyze this considerable amount of data. For instance, Stepwise Linear Regression, Gradient Boosting, Neural Network, and Decision Tree models. Findings indicate that Neural Network model outperformed the other models used in previous studies with the smallest error; when Stepwise Linear Regression is used as a variable selection for it. However, Gradient Boosting outperformed other models with a Hits@10 score of 6.7. Additionally, one of the most crucial outcomes of this study is determining the most significant variables. Particularly, it was found that the publication day of the post, number of check-ins, shares or likes on the page, and the number of comments in the last 24 or 48 hours were the most important variables. These variables contribute to garner more comments, which results in more viewers and higher profit for the marketer. The study concludes with some limitations and suggestions for future research.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信