Md Deloar Hossan Jasy, Sakib Al Hasan, Md Ibrahim Khalil Sagor, Abdullah M. Noman, J. Ji
{"title":"基于支持向量机、KNN和朴素贝叶斯的情感分类性能评价","authors":"Md Deloar Hossan Jasy, Sakib Al Hasan, Md Ibrahim Khalil Sagor, Abdullah M. Noman, J. Ji","doi":"10.1109/contesa52813.2021.9657115","DOIUrl":null,"url":null,"abstract":"The rising use of the internet and social networks has opened up new avenues for individuals to express themselves. It’s also a platform with a plethora of information where an individual can see other people’s thoughts, which are diverged into numerous sentiment categories and are slowly becoming a primary part of the decision. This study makes a significant contribution to sentiment classification, which is effective in determining data in a big amount of tweets with de-contextualized sentiments which are often positive or negative, or in the middle. To accomplish this, we initially pre-processed the raw data, and then draw out the meaningful words and phrases (characteristic vector), then picked the characteristic vector list, and then applied machine-learning classification methods including Naive Bayes, KNN, and SVM. And at last, we assessed the classifier’s performance using the terms recall, accuracy, and precision, as well as the F1-score. Support Vector Machine has the highest accuracy of 92 percent, followed by KNN and Naive Bayes with 88 and 85 percent accuracy, respectively.","PeriodicalId":323624,"journal":{"name":"2021 International Conference on Computing, Networking, Telecommunications & Engineering Sciences Applications (CoNTESA)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-09","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"1","resultStr":"{\"title\":\"A Performance Evaluation of Sentiment Classification Applying SVM, KNN, and Naive Bayes\",\"authors\":\"Md Deloar Hossan Jasy, Sakib Al Hasan, Md Ibrahim Khalil Sagor, Abdullah M. Noman, J. Ji\",\"doi\":\"10.1109/contesa52813.2021.9657115\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"The rising use of the internet and social networks has opened up new avenues for individuals to express themselves. It’s also a platform with a plethora of information where an individual can see other people’s thoughts, which are diverged into numerous sentiment categories and are slowly becoming a primary part of the decision. This study makes a significant contribution to sentiment classification, which is effective in determining data in a big amount of tweets with de-contextualized sentiments which are often positive or negative, or in the middle. To accomplish this, we initially pre-processed the raw data, and then draw out the meaningful words and phrases (characteristic vector), then picked the characteristic vector list, and then applied machine-learning classification methods including Naive Bayes, KNN, and SVM. And at last, we assessed the classifier’s performance using the terms recall, accuracy, and precision, as well as the F1-score. Support Vector Machine has the highest accuracy of 92 percent, followed by KNN and Naive Bayes with 88 and 85 percent accuracy, respectively.\",\"PeriodicalId\":323624,\"journal\":{\"name\":\"2021 International Conference on Computing, Networking, Telecommunications & Engineering Sciences Applications (CoNTESA)\",\"volume\":\"1 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-12-09\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"1\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 International Conference on Computing, Networking, Telecommunications & Engineering Sciences Applications (CoNTESA)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/contesa52813.2021.9657115\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 International Conference on Computing, Networking, Telecommunications & Engineering Sciences Applications (CoNTESA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/contesa52813.2021.9657115","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
A Performance Evaluation of Sentiment Classification Applying SVM, KNN, and Naive Bayes
The rising use of the internet and social networks has opened up new avenues for individuals to express themselves. It’s also a platform with a plethora of information where an individual can see other people’s thoughts, which are diverged into numerous sentiment categories and are slowly becoming a primary part of the decision. This study makes a significant contribution to sentiment classification, which is effective in determining data in a big amount of tweets with de-contextualized sentiments which are often positive or negative, or in the middle. To accomplish this, we initially pre-processed the raw data, and then draw out the meaningful words and phrases (characteristic vector), then picked the characteristic vector list, and then applied machine-learning classification methods including Naive Bayes, KNN, and SVM. And at last, we assessed the classifier’s performance using the terms recall, accuracy, and precision, as well as the F1-score. Support Vector Machine has the highest accuracy of 92 percent, followed by KNN and Naive Bayes with 88 and 85 percent accuracy, respectively.