{"title":"基于多元分析的网络新闻流行度预测","authors":"Caiyun Liu, Wenjie Wang, Yuqing Zhang, Ying Dong, Fannv He, Chensi Wu","doi":"10.1109/CIT.2017.36","DOIUrl":null,"url":null,"abstract":"An increasing number of online news triggers wide academic concern for the prediction of news popularity, which is affected by users' behaviors and not easy to predict. However, existing methods that predict the popularity of online news after publication are not timely enough, and predicting before publication lacks discriminatory features. This paper explores the variables which may affect news popularity and presents a novel methodology to predict the popularity of online news before publication. Through the observation of news, we first find that grammatical construction of titles can affect news popularity, and experiments show that this feature can improve R^2 statistics of the prediction model by 6.62% exactly. Besides, we improve traditional category and author features by using logarithmic conversion to views first and calculating a score of these features instead of stuffing them into learning models directly. Using these features and two other features, we finally predict news popularity in two aspects: whether the news will be popular and how many views the news ultimately attract.","PeriodicalId":378423,"journal":{"name":"2017 IEEE International Conference on Computer and Information Technology (CIT)","volume":"98 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"16","resultStr":"{\"title\":\"Predicting the Popularity of Online News Based on Multivariate Analysis\",\"authors\":\"Caiyun Liu, Wenjie Wang, Yuqing Zhang, Ying Dong, Fannv He, Chensi Wu\",\"doi\":\"10.1109/CIT.2017.36\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"An increasing number of online news triggers wide academic concern for the prediction of news popularity, which is affected by users' behaviors and not easy to predict. However, existing methods that predict the popularity of online news after publication are not timely enough, and predicting before publication lacks discriminatory features. This paper explores the variables which may affect news popularity and presents a novel methodology to predict the popularity of online news before publication. Through the observation of news, we first find that grammatical construction of titles can affect news popularity, and experiments show that this feature can improve R^2 statistics of the prediction model by 6.62% exactly. Besides, we improve traditional category and author features by using logarithmic conversion to views first and calculating a score of these features instead of stuffing them into learning models directly. Using these features and two other features, we finally predict news popularity in two aspects: whether the news will be popular and how many views the news ultimately attract.\",\"PeriodicalId\":378423,\"journal\":{\"name\":\"2017 IEEE International Conference on Computer and Information Technology (CIT)\",\"volume\":\"98 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"16\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE International Conference on Computer and Information Technology (CIT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CIT.2017.36\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE International Conference on Computer and Information Technology (CIT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CIT.2017.36","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting the Popularity of Online News Based on Multivariate Analysis
An increasing number of online news triggers wide academic concern for the prediction of news popularity, which is affected by users' behaviors and not easy to predict. However, existing methods that predict the popularity of online news after publication are not timely enough, and predicting before publication lacks discriminatory features. This paper explores the variables which may affect news popularity and presents a novel methodology to predict the popularity of online news before publication. Through the observation of news, we first find that grammatical construction of titles can affect news popularity, and experiments show that this feature can improve R^2 statistics of the prediction model by 6.62% exactly. Besides, we improve traditional category and author features by using logarithmic conversion to views first and calculating a score of these features instead of stuffing them into learning models directly. Using these features and two other features, we finally predict news popularity in two aspects: whether the news will be popular and how many views the news ultimately attract.