R. Shreyas, D. Akshata, B. S. Mahanand, B. Shagun, C. Abhishek
{"title":"使用随机森林回归预测网络文章的受欢迎程度","authors":"R. Shreyas, D. Akshata, B. S. Mahanand, B. Shagun, C. Abhishek","doi":"10.1109/CCIP.2016.7802890","DOIUrl":null,"url":null,"abstract":"Predictive analysis using machine learning has been gaining popularity in recent times. In this paper, the Random Forest regression model is used to predict popularity of articles from the Online News Popularity data set. The performance of the Random Forest model is investigated and compared with other models. Impact of standardization, regularization, correlation, high bias/high variance and feature selection on the learning models are also studied. Results indicate that, the Random Forest approach predicts popular/unpopular articles with an accuracy of 88.8%.","PeriodicalId":354589,"journal":{"name":"2016 Second International Conference on Cognitive Computing and Information Processing (CCIP)","volume":"4 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-08-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"22","resultStr":"{\"title\":\"Predicting popularity of online articles using Random Forest regression\",\"authors\":\"R. Shreyas, D. Akshata, B. S. Mahanand, B. Shagun, C. Abhishek\",\"doi\":\"10.1109/CCIP.2016.7802890\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Predictive analysis using machine learning has been gaining popularity in recent times. In this paper, the Random Forest regression model is used to predict popularity of articles from the Online News Popularity data set. The performance of the Random Forest model is investigated and compared with other models. Impact of standardization, regularization, correlation, high bias/high variance and feature selection on the learning models are also studied. Results indicate that, the Random Forest approach predicts popular/unpopular articles with an accuracy of 88.8%.\",\"PeriodicalId\":354589,\"journal\":{\"name\":\"2016 Second International Conference on Cognitive Computing and Information Processing (CCIP)\",\"volume\":\"4 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2016-08-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"22\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2016 Second International Conference on Cognitive Computing and Information Processing (CCIP)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCIP.2016.7802890\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2016 Second International Conference on Cognitive Computing and Information Processing (CCIP)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCIP.2016.7802890","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Predicting popularity of online articles using Random Forest regression
Predictive analysis using machine learning has been gaining popularity in recent times. In this paper, the Random Forest regression model is used to predict popularity of articles from the Online News Popularity data set. The performance of the Random Forest model is investigated and compared with other models. Impact of standardization, regularization, correlation, high bias/high variance and feature selection on the learning models are also studied. Results indicate that, the Random Forest approach predicts popular/unpopular articles with an accuracy of 88.8%.