{"title":"Spam and Sentiment Analysis Model for Twitter Data using Statistical Learning","authors":"Anita, D. Gupta, Ashish Kumar","doi":"10.1145/2983402.2983404","DOIUrl":null,"url":null,"abstract":"The past empirical work of twitter spam detection and sentiment analysis is based on random selection of features for the generation of classification models. This paper focus on the selection of model by applying multiple linear regression using stat models for fitting n dimensional hyper plane predictor (i.e. Twitter features) to our response variable (i.e. Spam and sentiments). This paper includes following parts: 1) Spam Detection Classifier 2) Bayesian and Log-Likelihood based sentiment classifier 3) Evaluation of classification system using different machine learning algorithms (i.e. Binomial, CART and Random Forest). Our experimental evaluation demonstrates that the efficiency of Random Forest is higher compared to other algorithms of the proposed classification system.","PeriodicalId":283626,"journal":{"name":"Proceedings of the Third International Symposium on Computer Vision and the Internet","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2016-09-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the Third International Symposium on Computer Vision and the Internet","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/2983402.2983404","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 1
Abstract
The past empirical work of twitter spam detection and sentiment analysis is based on random selection of features for the generation of classification models. This paper focus on the selection of model by applying multiple linear regression using stat models for fitting n dimensional hyper plane predictor (i.e. Twitter features) to our response variable (i.e. Spam and sentiments). This paper includes following parts: 1) Spam Detection Classifier 2) Bayesian and Log-Likelihood based sentiment classifier 3) Evaluation of classification system using different machine learning algorithms (i.e. Binomial, CART and Random Forest). Our experimental evaluation demonstrates that the efficiency of Random Forest is higher compared to other algorithms of the proposed classification system.