支持向量机在Twitter产品品牌情感分类中的优化

Jao Allen Banados, K. Espinosa
{"title":"支持向量机在Twitter产品品牌情感分类中的优化","authors":"Jao Allen Banados, K. Espinosa","doi":"10.1109/IISA.2014.6878768","DOIUrl":null,"url":null,"abstract":"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.","PeriodicalId":298835,"journal":{"name":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","volume":"24 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2014-07-07","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"15","resultStr":"{\"title\":\"Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter\",\"authors\":\"Jao Allen Banados, K. Espinosa\",\"doi\":\"10.1109/IISA.2014.6878768\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.\",\"PeriodicalId\":298835,\"journal\":{\"name\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"volume\":\"24 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2014-07-07\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"15\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/IISA.2014.6878768\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"IISA 2014, The 5th International Conference on Information, Intelligence, Systems and Applications","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/IISA.2014.6878768","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 15

摘要

本文提出了一种更好的方法来优化支持向量机对产品品牌的情感分类。情绪分析的兴起是为了解决对情绪进行分类和对某一产品品牌的正面或负面反馈进行分类的问题。本研究采用支持向量机学习算法,通过核的选择和SVM超参数的适当调整作为影响SVM精度的核心因素来提高算法的精度,通过大量的训练集来扩大向量和强支持向量的超平面。使用Twitter API收集情感,并进行预处理以过滤不必要的单词。为了能够使用给定的工具,将预处理的情感转换为支持向量机格式。由给定的默认参数所使用的SVM工具,以径向基函数为核类型。使用的支持向量机类型是C-SVC,一种多类分类。生成一个训练集,并将其用作测试集和初始结果的训练模型。该模型使用SVM与上述默认参数,使用3768条tweet作为训练集,得到的准确率为63.54%。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Optimizing Support Vector Machine in classifying sentiments on product brands from Twitter
This paper involves giving a better solution in optimizing Support Vector Machine in classifying sentiments towards a product brand. Sentiment analysis rose to solve the problem of classifying sentiments and classifying as to positive or negative feedback towards a certain product brands. Using the Support Vector Machine learning algorithm, this study aims to improve the algorithm's accuracy through choice of kernel and proper tuning of SVM hyper-parameters as core factors in contributing to SVM accuracy, having a huge amount of training sets in order to widen the hyper plane of vectors and strong support vectors. The sentiments are gathered using the Twitter API and are pre-processed to filter unnecessary words. To be able to use the given tool, the pre-processed sentiments are converted to SVM format. By the given default parameters of the SVM tool used, with radial basis function as kernel type. The SVM type used is C-SVC, a multi-class classification. A training set is produced and is used as the training model for test sets and as of the initial results. The model produced an accuracy of 63.54% using SVM with the said default parameters and used 3768 tweets for training set.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信