推文的情感分析和表情化

International Conference on Pioneer and Innovative Studies Pub Date : 2023-06-13 DOI:10.59287/icpis.876

Enes Cerrahoğlu, Pınar Cihan

{"title":"推文的情感分析和表情化","authors":"Enes Cerrahoğlu, Pınar Cihan","doi":"10.59287/icpis.876","DOIUrl":null,"url":null,"abstract":"– Social media platforms have become a prevalent means for individuals to share their emotionsand thoughts. With millions of tweets being posted on Twitter every day, these tweets provide us with avast dataset. Conducting sentiment analysis on this dataset can be a valuable method to obtain meaningfulinsights about societal trends. For this purpose, a sentiment analysis model and a web interface thatemojifies emotions were developed using the Python programming language. This model works on tweetsshared on Twitter and utilizes natural language processing techniques to determine the sentiment of thetweets. In this study, 168.274 English tweets were collected using the Twitter API. The collected tweetsunderwent a cleaning process where URLs, hashtags, mentions, and emojis were removed. Then, theTextBlob Python library was employed to label the tweets as positive, negative, or neutral. The labeledtweets were subjected to classification accuracy testing using Gradient Boosting, Logistic Regression,Naive Bayes, Random Forest, and Support Vector Machines machine learning models. The findingsrevealed that logistic regression achieved the highest classification accuracy with 94%. Lastly, a webinterface was developed, which retrieves the last 50 tweets of a queried user's profile and appends a relevantemoji based on the sentiment of each tweet.","PeriodicalId":292916,"journal":{"name":"International Conference on Pioneer and Innovative Studies","volume":"36 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2023-06-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":"{\"title\":\"Sentiment Analysis and Emojification of Tweets\",\"authors\":\"Enes Cerrahoğlu, Pınar Cihan\",\"doi\":\"10.59287/icpis.876\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"– Social media platforms have become a prevalent means for individuals to share their emotionsand thoughts. With millions of tweets being posted on Twitter every day, these tweets provide us with avast dataset. Conducting sentiment analysis on this dataset can be a valuable method to obtain meaningfulinsights about societal trends. For this purpose, a sentiment analysis model and a web interface thatemojifies emotions were developed using the Python programming language. This model works on tweetsshared on Twitter and utilizes natural language processing techniques to determine the sentiment of thetweets. In this study, 168.274 English tweets were collected using the Twitter API. The collected tweetsunderwent a cleaning process where URLs, hashtags, mentions, and emojis were removed. Then, theTextBlob Python library was employed to label the tweets as positive, negative, or neutral. The labeledtweets were subjected to classification accuracy testing using Gradient Boosting, Logistic Regression,Naive Bayes, Random Forest, and Support Vector Machines machine learning models. The findingsrevealed that logistic regression achieved the highest classification accuracy with 94%. Lastly, a webinterface was developed, which retrieves the last 50 tweets of a queried user's profile and appends a relevantemoji based on the sentiment of each tweet.\",\"PeriodicalId\":292916,\"journal\":{\"name\":\"International Conference on Pioneer and Innovative Studies\",\"volume\":\"36 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2023-06-13\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"0\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Conference on Pioneer and Innovative Studies\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.59287/icpis.876\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Conference on Pioneer and Innovative Studies","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.59287/icpis.876","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}

引用次数: 0

摘要

——社交媒体平台已成为个人分享情感和想法的普遍手段。Twitter上每天发布数百万条推文，这些推文为我们提供了一个庞大的数据集。对该数据集进行情感分析可以是获得有关社会趋势的有意义见解的有价值的方法。为此，使用Python编程语言开发了情感分析模型和情感表情化web界面。该模型适用于Twitter上分享的推文，并利用自然语言处理技术来确定推文的情绪。在本研究中，使用Twitter API收集了168.274条英文tweets。收集到的推文经过了清理过程，url、标签、提及和表情符号都被删除了。然后，使用textblob Python库将tweet标记为积极、消极或中性。使用梯度增强、逻辑回归、朴素贝叶斯、随机森林和支持向量机机器学习模型对标记的推文进行分类准确性测试。研究结果显示，逻辑回归达到了最高的分类准确率，达到94%。最后，开发了一个web界面，该界面可以检索被查询用户个人资料的最近50条推文，并根据每条推文的情感添加相关的符号。

本文章由计算机程序翻译，如有差异，请以英文原文为准。

查看原文本刊更多论文

Sentiment Analysis and Emojification of Tweets

– Social media platforms have become a prevalent means for individuals to share their emotionsand thoughts. With millions of tweets being posted on Twitter every day, these tweets provide us with avast dataset. Conducting sentiment analysis on this dataset can be a valuable method to obtain meaningfulinsights about societal trends. For this purpose, a sentiment analysis model and a web interface thatemojifies emotions were developed using the Python programming language. This model works on tweetsshared on Twitter and utilizes natural language processing techniques to determine the sentiment of thetweets. In this study, 168.274 English tweets were collected using the Twitter API. The collected tweetsunderwent a cleaning process where URLs, hashtags, mentions, and emojis were removed. Then, theTextBlob Python library was employed to label the tweets as positive, negative, or neutral. The labeledtweets were subjected to classification accuracy testing using Gradient Boosting, Logistic Regression,Naive Bayes, Random Forest, and Support Vector Machines machine learning models. The findingsrevealed that logistic regression achieved the highest classification accuracy with 94%. Lastly, a webinterface was developed, which retrieves the last 50 tweets of a queried user's profile and appends a relevantemoji based on the sentiment of each tweet.

求助全文

通过发布文献求助，成功后即可免费获取论文全文。去求助

来源期刊

International Conference on Pioneer and Innovative Studies

自引率

0.00%

发文量