支持向量机在Twitter产品品牌情感分类中的优化

IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications Pub Date : 2014-07-07 DOI:10.1109/IISA.2014.6878768

Jao Allen Banados, K. Espinosa

{"title":"支持向量机在Twitter产品品牌情感分类中的优化","authors":"Jao Allen Banados, K. Espinosa","doi":"10.1109/IISA.2014.6878768","DOIUrl":null,"url":null,"abstract":"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.","PeriodicalId":298835,"journal":{"name":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter\",\"authors\":\"Jao Allen Banados, K. Espinosa\",\"doi\":\"10.1109/IISA.2014.6878768\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.\",\"PeriodicalId\":298835,\"journal\":{\"name\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA.2014.6878768\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2014.6878768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 15

摘要

本文提出了一种更好的方法来优化支持向量机对产品品牌的情感分类。情绪分析的兴起是为了解决对情绪进行分类和对某一产品品牌的正面或负面反馈进行分类的问题。本研究采用支持向量机学习算法，通过核的选择和SVM超参数的适当调整作为影响SVM精度的核心因素来提高算法的精度，通过大量的训练集来扩大向量和强支持向量的超平面。使用Twitter API收集情感，并进行预处理以过滤不必要的单词。为了能够使用给定的工具，将预处理的情感转换为支持向量机格式。由给定的默认参数所使用的SVM工具，以径向基函数为核类型。使用的支持向量机类型是C-SVC，一种多类分类。生成一个训练集，并将其用作测试集和初始结果的训练模型。该模型使用SVM与上述默认参数，使用3768条tweet作为训练集，得到的准确率为63.54%。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter

This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications

自引率

0.00%

发文量