Machine Learning for Sentiment Analysis Utilizing Social Media

2023 2nd International Conference on Edge Computing and Applications (ICECAA) Pub Date : 2023-07-19 DOI:10.1109/ICECAA58104.2023.10212135

M. Arumugam, Snegaa S R, C. Jayanthi

{"title":"Machine Learning for Sentiment Analysis Utilizing Social Media","authors":"M. Arumugam, Snegaa S R, C. Jayanthi","doi":"10.1109/ICECAA58104.2023.10212135","DOIUrl":null,"url":null,"abstract":"Sentimental analysis is a crucial step in natural language processing that aids in figuring out whether a text has a positive, negative, or neutral sentiment. In this experiment, we analyzed the sentiments expressed in tweets that included text, emojis, and emoticons. To categorize the tweets into different sentiments, we utilized four different algorithms: Multinomial Naive Bayes (MNB),Random Forest, Support Vector Machine (SVM) and Decision Tree. In order to increase the model's accuracy, we also combined the predictions from the four algorithms using the Voting Classifier, an ensemble learning technique. To preprocess the data, we used various techniques, such as removing stop words, stemming, and converting emojis and emoticons to their corresponding text representations. The performance of each algorithm was then trained on the preprocessed data using various assessment measures, including accuracy, precision, F1-score and recall. The SVM method fared better than the other algorithms, obtaining an accuracy of 96.27%, according to the data. Furthermore, we applied ensemble learning techniques, such as bagging to improve the performance of all the four algorithms. We also used the Voting Classifier to combine the predictions of the bagging models to further improve the accuracy of the model. The results revealed that the accuracy was increased to 97.21% by combining the bagging and voting classifiers. Overall, the project demonstrates the effectiveness of various algorithms and ensemble learning methods in performing sentimental analysis on tweets containing text, emojis, and emoticons.","PeriodicalId":114624,"journal":{"name":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","volume":"74 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-07-19","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2023 2nd International Conference on Edge Computing and Applications (ICECAA)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICECAA58104.2023.10212135","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

Abstract

Sentimental analysis is a crucial step in natural language processing that aids in figuring out whether a text has a positive, negative, or neutral sentiment. In this experiment, we analyzed the sentiments expressed in tweets that included text, emojis, and emoticons. To categorize the tweets into different sentiments, we utilized four different algorithms: Multinomial Naive Bayes (MNB),Random Forest, Support Vector Machine (SVM) and Decision Tree. In order to increase the model's accuracy, we also combined the predictions from the four algorithms using the Voting Classifier, an ensemble learning technique. To preprocess the data, we used various techniques, such as removing stop words, stemming, and converting emojis and emoticons to their corresponding text representations. The performance of each algorithm was then trained on the preprocessed data using various assessment measures, including accuracy, precision, F1-score and recall. The SVM method fared better than the other algorithms, obtaining an accuracy of 96.27%, according to the data. Furthermore, we applied ensemble learning techniques, such as bagging to improve the performance of all the four algorithms. We also used the Voting Classifier to combine the predictions of the bagging models to further improve the accuracy of the model. The results revealed that the accuracy was increased to 97.21% by combining the bagging and voting classifiers. Overall, the project demonstrates the effectiveness of various algorithms and ensemble learning methods in performing sentimental analysis on tweets containing text, emojis, and emoticons.

查看原文本刊更多论文

利用社交媒体进行情感分析的机器学习

情感分析是自然语言处理的关键一步，它有助于确定文本的情绪是积极的、消极的还是中性的。在这个实验中，我们分析了推文中表达的情绪，包括文本、表情符号和表情符号。为了将推文分类为不同的情绪，我们使用了四种不同的算法:多项朴素贝叶斯(MNB)、随机森林、支持向量机(SVM)和决策树。为了提高模型的准确性，我们还使用投票分类器(一种集成学习技术)将四种算法的预测结合起来。为了预处理数据，我们使用了各种技术，例如删除停止词、词干提取以及将表情符号和表情符号转换为相应的文本表示。然后使用各种评估指标(包括准确性、精密度、f1分数和召回率)对每种算法的性能进行预处理数据训练。数据显示，SVM方法的准确率为96.27%，优于其他算法。此外，我们应用了集成学习技术，如bagging来提高所有四种算法的性能。我们还使用投票分类器将bagging模型的预测结合起来，进一步提高了模型的准确性。结果表明，将套袋分类器与投票分类器相结合，准确率提高到97.21%。总体而言，该项目展示了各种算法和集成学习方法在对包含文本、表情符号和表情符号的推文进行情感分析方面的有效性。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

求助全文

约1分钟内获得全文求助全文

来源期刊

2023 2nd International Conference on Edge Computing and Applications (ICECAA)

自引率

0.00%

发文量