使用贝叶斯和支持向量机分类器对英语短语和短信进行分类

J. Maier, K. Ferens
{"title":"使用贝叶斯和支持向量机分类器对英语短语和短信进行分类","authors":"J. Maier, K. Ferens","doi":"10.1109/CCECE.2009.5090166","DOIUrl":null,"url":null,"abstract":"This paper performs a comparative analysis of several different types of SMS text classifiers: weight enhanced Multinomial naive Bayes, Poisson naive Bayes, and L2-loss Support Vector Machine. The effects of preprocessing and incorporating additional features on the classifiers were examined. The preliminary experimental results show that the use of preprocessing and incorporating additional features produced no significant gain or loss in classification efficiency. However the feature space used by the classification methods decreased, which could be beneficial for resource limited environments. In addition the solutions to the SMS text classification may be applied to other problems, like the classification of English sentences. Our collection of text messages may not be statistically significant, because of very limited sources for text messages.","PeriodicalId":153464,"journal":{"name":"2009 Canadian Conference on Electrical and Computer Engineering","volume":"13 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2009-05-03","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"3","resultStr":"{\"title\":\"Classification of english phrases and SMS text messages using Bayes and Support Vector Machine classifiers\",\"authors\":\"J. Maier, K. Ferens\",\"doi\":\"10.1109/CCECE.2009.5090166\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"This paper performs a comparative analysis of several different types of SMS text classifiers: weight enhanced Multinomial naive Bayes, Poisson naive Bayes, and L2-loss Support Vector Machine. The effects of preprocessing and incorporating additional features on the classifiers were examined. The preliminary experimental results show that the use of preprocessing and incorporating additional features produced no significant gain or loss in classification efficiency. However the feature space used by the classification methods decreased, which could be beneficial for resource limited environments. In addition the solutions to the SMS text classification may be applied to other problems, like the classification of English sentences. Our collection of text messages may not be statistically significant, because of very limited sources for text messages.\",\"PeriodicalId\":153464,\"journal\":{\"name\":\"2009 Canadian Conference on Electrical and Computer Engineering\",\"volume\":\"13 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2009-05-03\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"3\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2009 Canadian Conference on Electrical and Computer Engineering\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/CCECE.2009.5090166\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2009 Canadian Conference on Electrical and Computer Engineering","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/CCECE.2009.5090166","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 3

摘要

本文对几种不同类型的SMS文本分类器进行了比较分析:权重增强多项式朴素贝叶斯、泊松朴素贝叶斯和L2-loss支持向量机。研究了预处理和附加特征对分类器的影响。初步的实验结果表明,使用预处理和加入附加特征对分类效率没有显著的增益或损失。然而,分类方法使用的特征空间减少了,这对于资源有限的环境是有利的。此外,SMS文本分类的解决方案也可以应用于其他问题,如英语句子的分类。由于短信的来源非常有限,我们收集的短信在统计上可能并不显著。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
Classification of english phrases and SMS text messages using Bayes and Support Vector Machine classifiers
This paper performs a comparative analysis of several different types of SMS text classifiers: weight enhanced Multinomial naive Bayes, Poisson naive Bayes, and L2-loss Support Vector Machine. The effects of preprocessing and incorporating additional features on the classifiers were examined. The preliminary experimental results show that the use of preprocessing and incorporating additional features produced no significant gain or loss in classification efficiency. However the feature space used by the classification methods decreased, which could be beneficial for resource limited environments. In addition the solutions to the SMS text classification may be applied to other problems, like the classification of English sentences. Our collection of text messages may not be statistically significant, because of very limited sources for text messages.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信