{"title":"支持向量机在Twitter产品品牌情感分类中的优化","authors":"Jao Allen Banados, K. Espinosa","doi":"10.1109/IISA.2014.6878768","DOIUrl":null,"url":null,"abstract":"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.","PeriodicalId":298835,"journal":{"name":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter\",\"authors\":\"Jao Allen Banados, K. Espinosa\",\"doi\":\"10.1109/IISA.2014.6878768\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.\",\"PeriodicalId\":298835,\"journal\":{\"name\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA.2014.6878768\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2014.6878768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter
This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.