基于否定范围识别方法的增强Twitter情感分析模型

Monir Yahya Ali Salmony, Arman Rasool Faridi
{"title":"基于否定范围识别方法的增强Twitter情感分析模型","authors":"Monir Yahya Ali Salmony, Arman Rasool Faridi","doi":"10.1109/INDIACom51348.2021.00155","DOIUrl":null,"url":null,"abstract":"Sentiment Analysis (SA), which is also known as Opinion Mining, is a hot-fastest growing research area, making it challenging to follow all its activities. It intends to study peoples' thoughts, feelings, and attitudes about topics, events, issues, entities, individuals, and attributes in social media (e.g., social networking sites, forums, blogs, etc.) expressed by either text comments or tweets. Twitter is one of the world's largest online microblogging platforms that allows its users to freely post texts called tweets. It offers a wealth of information, therefore utilizing SA to analyze this information into positive or negative will assist organizations' and customers' decision-making that will have a significant impact on daily life. SA draws the attention of scientific research in the Natural Language Processing community due to the text structure challenges that may contain negation. Negation is a widespread linguistic structure that changes the text meaning to the opposite and affects text polarity. Therefore, it needs to be considered in sentiment analysis systems. In this paper, supervised machine learning models have been used as a baseline to categorize the sentiment of a Twitter dataset using (Bag of Words) and (Term Frequency Inverse Document Frequency) feature representation methods. Then we applied negation scope identification methods to find negated tokens and investigate how embedding these tokens can raise SA classifiers' accuracy. The results of the sentiment classification task show an improvement once considering these tokens.","PeriodicalId":415594,"journal":{"name":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","volume":"17 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-03-17","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":"{\"title\":\"An Enhanced Twitter Sentiment Analysis Model using Negation Scope Identification Methods\",\"authors\":\"Monir Yahya Ali Salmony, Arman Rasool Faridi\",\"doi\":\"10.1109/INDIACom51348.2021.00155\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Sentiment Analysis (SA), which is also known as Opinion Mining, is a hot-fastest growing research area, making it challenging to follow all its activities. It intends to study peoples' thoughts, feelings, and attitudes about topics, events, issues, entities, individuals, and attributes in social media (e.g., social networking sites, forums, blogs, etc.) expressed by either text comments or tweets. Twitter is one of the world's largest online microblogging platforms that allows its users to freely post texts called tweets. It offers a wealth of information, therefore utilizing SA to analyze this information into positive or negative will assist organizations' and customers' decision-making that will have a significant impact on daily life. SA draws the attention of scientific research in the Natural Language Processing community due to the text structure challenges that may contain negation. Negation is a widespread linguistic structure that changes the text meaning to the opposite and affects text polarity. Therefore, it needs to be considered in sentiment analysis systems. In this paper, supervised machine learning models have been used as a baseline to categorize the sentiment of a Twitter dataset using (Bag of Words) and (Term Frequency Inverse Document Frequency) feature representation methods. Then we applied negation scope identification methods to find negated tokens and investigate how embedding these tokens can raise SA classifiers' accuracy. The results of the sentiment classification task show an improvement once considering these tokens.\",\"PeriodicalId\":415594,\"journal\":{\"name\":\"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)\",\"volume\":\"17 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2021-03-17\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"4\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/INDIACom51348.2021.00155\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 8th International Conference on Computing for Sustainable Global Development (INDIACom)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/INDIACom51348.2021.00155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4

摘要

情感分析(SA),也被称为意见挖掘,是一个发展最快的热门研究领域,这使得跟踪其所有活动具有挑战性。它旨在研究人们对社交媒体(如社交网站、论坛、博客等)中的话题、事件、问题、实体、个人和属性的想法、感受和态度,这些想法、感受和态度可以通过文本评论或tweet来表达。Twitter是世界上最大的在线微博平台之一,它允许用户自由发布被称为tweet的文本。它提供了丰富的信息,因此利用情景分析将这些信息分析为积极或消极将有助于组织和客户的决策,这将对日常生活产生重大影响。由于可能包含否定的文本结构挑战,SA引起了自然语言处理界的科学研究的关注。否定是一种广泛存在的语言结构,它使语篇意义向相反的方向转变,影响语篇极性。因此,在情感分析系统中需要考虑它。在本文中,使用监督机器学习模型作为基线,使用(Bag of Words)和(Term Frequency Inverse Document Frequency)特征表示方法对Twitter数据集的情感进行分类。然后,我们应用否定范围识别方法来寻找否定令牌,并研究如何嵌入这些令牌来提高SA分类器的准确率。考虑到这些标记,情感分类任务的结果显示出改进。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
An Enhanced Twitter Sentiment Analysis Model using Negation Scope Identification Methods
Sentiment Analysis (SA), which is also known as Opinion Mining, is a hot-fastest growing research area, making it challenging to follow all its activities. It intends to study peoples' thoughts, feelings, and attitudes about topics, events, issues, entities, individuals, and attributes in social media (e.g., social networking sites, forums, blogs, etc.) expressed by either text comments or tweets. Twitter is one of the world's largest online microblogging platforms that allows its users to freely post texts called tweets. It offers a wealth of information, therefore utilizing SA to analyze this information into positive or negative will assist organizations' and customers' decision-making that will have a significant impact on daily life. SA draws the attention of scientific research in the Natural Language Processing community due to the text structure challenges that may contain negation. Negation is a widespread linguistic structure that changes the text meaning to the opposite and affects text polarity. Therefore, it needs to be considered in sentiment analysis systems. In this paper, supervised machine learning models have been used as a baseline to categorize the sentiment of a Twitter dataset using (Bag of Words) and (Term Frequency Inverse Document Frequency) feature representation methods. Then we applied negation scope identification methods to find negated tokens and investigate how embedding these tokens can raise SA classifiers' accuracy. The results of the sentiment classification task show an improvement once considering these tokens.
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信