{"title":"Sentiment Analysis Data Annotation Across Arabic Dialects","authors":"Samar Assem","doi":"10.21608/sjam.2024.284222.2297","DOIUrl":null,"url":null,"abstract":"Nowadays, with a vast amount of data being generated in the internet and social media platforms, there is much focus on sentiment analysis as it helps to extract data ultimately to analyze people’s opinions. However, much of the research on this topic has been regarding the English language with less attention being paid to other common languages such as Arabic. That is why there is a noticeable gap in research especially when considering the multitude of dialects in the Arabic language. This multitude of dialects is considered to be a double-edged weapon as it offers a rich amount of data for researchers from one hand and it stands as a challenge in front of the annotators and the researchers from another. Accordingly, this article aims to carry out a critical review of the recent studies conducted regarding sentiment analysis and data annotation for Arabic dialects that have been published in the last 10 years. This review offers a taxonomy of data preprocessing and annotation methods. Moreover, it displays the challenges, motivations and recommendations in addition to an in-depth analysis of the current trends in the field of Arabic dialects and sentiment analysis. Accordingly, this literature review postulates new research gaps and future directions to drive more scholars into contributing to Arabic SA research. Finally, the contribution of the current study aims at making more successful multilingual sentiment analysis implementations in addition to providing some understanding of ASA in a plenty of settings.","PeriodicalId":228048,"journal":{"name":"مجلة بحوث کلية الآداب . جامعة المنوفية","volume":"15 15","pages":""},"PeriodicalIF":0.0000,"publicationDate":"2024-07-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"0","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"مجلة بحوث کلية الآداب . جامعة المنوفية","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/sjam.2024.284222.2297","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 0
Abstract
Nowadays, with a vast amount of data being generated in the internet and social media platforms, there is much focus on sentiment analysis as it helps to extract data ultimately to analyze people’s opinions. However, much of the research on this topic has been regarding the English language with less attention being paid to other common languages such as Arabic. That is why there is a noticeable gap in research especially when considering the multitude of dialects in the Arabic language. This multitude of dialects is considered to be a double-edged weapon as it offers a rich amount of data for researchers from one hand and it stands as a challenge in front of the annotators and the researchers from another. Accordingly, this article aims to carry out a critical review of the recent studies conducted regarding sentiment analysis and data annotation for Arabic dialects that have been published in the last 10 years. This review offers a taxonomy of data preprocessing and annotation methods. Moreover, it displays the challenges, motivations and recommendations in addition to an in-depth analysis of the current trends in the field of Arabic dialects and sentiment analysis. Accordingly, this literature review postulates new research gaps and future directions to drive more scholars into contributing to Arabic SA research. Finally, the contribution of the current study aims at making more successful multilingual sentiment analysis implementations in addition to providing some understanding of ASA in a plenty of settings.
如今,随着互联网和社交媒体平台产生大量数据,情感分析备受关注,因为它有助于提取数据,最终分析人们的观点。然而,有关这一主题的研究大多涉及英语,对阿拉伯语等其他常用语言的关注较少。这就是为什么在研究方面存在明显差距的原因,尤其是在考虑到阿拉伯语的多种方言时。方言的多样性被认为是一把双刃剑,一方面为研究人员提供了丰富的数据,另一方面也对注释者和研究人员提出了挑战。因此,本文旨在对过去 10 年中发表的有关阿拉伯语方言情感分析和数据注释的最新研究进行批判性回顾。本综述对数据预处理和注释方法进行了分类。此外,除了对阿拉伯语方言和情感分析领域的当前趋势进行深入分析外,它还展示了挑战、动机和建议。因此,本文献综述提出了新的研究缺口和未来方向,以推动更多学者为阿拉伯语 SA 研究做出贡献。最后,本研究的贡献在于,除了让人们对大量环境中的阿拉伯语情感分析有一定的了解外,还能更成功地实施多语言情感分析。