Workshop on Arabic Natural Language Processing最新文献

SAIDS: A Novel Approach for Sentiment Analysis Informed of Dialect and Sarcasm SAIDS：根据方言和讽刺进行情感分析的新方法

Workshop on Arabic Natural Language Processing Pub Date : 2023-01-06 DOI: 10.48550/arXiv.2301.02521

Abdelrahman Kaseb, Mona Farouk

{"title":"SAIDS: A Novel Approach for Sentiment Analysis Informed of Dialect and Sarcasm","authors":"Abdelrahman Kaseb, Mona Farouk","doi":"10.48550/arXiv.2301.02521","DOIUrl":"https://doi.org/10.48550/arXiv.2301.02521","url":null,"abstract":"Sentiment analysis becomes an essential part of every social network, as it enables decision-makers to know more about users’ opinions in almost all life aspects. Despite its importance, there are multiple issues it encounters like the sentiment of the sarcastic text which is one of the main challenges of sentiment analysis. This paper tackles this challenge by introducing a novel system (SAIDS) that predicts the sentiment, sarcasm and dialect of Arabic tweets. SAIDS uses its prediction of sarcasm and dialect as known information to predict the sentiment. It uses MARBERT as a language model to generate sentence embedding, then passes it to the sarcasm and dialect models, and then the outputs of the three models are concatenated and passed to the sentiment analysis model. Multiple system design setups were experimented with and reported. SAIDS was applied to the ArSarcasm-v2 dataset where it outperforms the state-of-the-art model for the sentiment analysis task. By training all tasks together, SAIDS achieves results of 75.98 FPN, 59.09 F1-score and 71.13 F1-score for sentiment analysis, sarcasm detection, and dialect identification respectively. The system design can be used to enhance the performance of any task which is dependent on other tasks.","PeriodicalId":355149,"journal":{"name":"Workshop on Arabic Natural Language Processing","volume":"7 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"2023-01-06","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"129176136","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 1

End-to-End Speech Translation of Arabic to English Broadcast News 阿拉伯语到英语广播新闻的端到端语音翻译

Workshop on Arabic Natural Language Processing Pub Date : 2022-12-11 DOI: 10.48550/arXiv.2212.05479

Fethi Bougares, Salim Jouili

引用次数: 0

IITD at WANLP 2022 Shared Task: Multilingual Multi-Granularity Network for Propaganda Detection IITD在WANLP 2022共享任务:用于宣传检测的多语言多粒度网络

Workshop on Arabic Natural Language Processing Pub Date : 2022-10-31 DOI: 10.48550/arXiv.2210.17190

Shubham Mittal, Preslav Nakov

引用次数: 3

Maknuune: A Large Open Palestinian Arabic Lexicon Maknuune:一个大的开放巴勒斯坦阿拉伯语词典

Workshop on Arabic Natural Language Processing Pub Date : 2022-10-24 DOI: 10.48550/arXiv.2210.12985

Shahd Dibas, Christian Khairallah, Nizar Habash, Omar Fayez Sadi, Tariq Sairafy, Karmel Sarabta, Abrar Ardah

引用次数: 0

The Shared Task on Gender Rewriting 性别重写的共同任务

Workshop on Arabic Natural Language Processing Pub Date : 2022-10-22 DOI: 10.48550/arXiv.2210.12410

Bashar Alhafni, Nizar Habash, Houda Bouamor, Ossama Obeid, Sultan Alrowili, D. Alzeer, Khawla AlShanqiti, Ahmed Elbakry, Muhammad N. ElNokrashy, Mohamed Gabr, Abderrahmane Issam, Abdel-Naser Qaddoumi, K. Vijay-Shanker, Mahmoud Zyate

引用次数: 1

Joint Coreference Resolution for Zeros and non-Zeros in Arabic 阿拉伯语零和非零联合共同参考决议

Workshop on Arabic Natural Language Processing Pub Date : 2022-10-21 DOI: 10.48550/arXiv.2210.12169

Abdulrahman Aloraini, Sameer Pradhan, Massimo Poesio

引用次数: 1

MANorm: A Normalization Dictionary for Moroccan Arabic Dialect Written in Latin Script 用拉丁文字书写的摩洛哥阿拉伯语方言规范化词典

Workshop on Arabic Natural Language Processing Pub Date : 2022-06-18 DOI: 10.48550/arXiv.2206.09167

Randa Zarnoufi, H. Jaafar, Walid Bachri, Mounia Abik

引用次数: 2

Emoji Sentiment Roles for Sentiment Analysis: A Case Study in Arabic Texts 表情符号情感分析中的情感角色:阿拉伯语文本的案例研究

Workshop on Arabic Natural Language Processing Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.wanlp-1.32

Shatha Ali A. Hakami, R. Hendley, Phillip Smith

引用次数: 3

Weakly and Semi-Supervised Learning for Arabic Text Classification using Monodialectal Language Models 基于单方言语言模型的阿拉伯文本分类弱和半监督学习

Workshop on Arabic Natural Language Processing Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.wanlp-1.24

Reem AlYami, Rabah A. Al-Zaidy

{"title":"Weakly and Semi-Supervised Learning for Arabic Text Classification using Monodialectal Language Models","authors":"Reem AlYami, Rabah A. Al-Zaidy","doi":"10.18653/v1/2022.wanlp-1.24","DOIUrl":"https://doi.org/10.18653/v1/2022.wanlp-1.24","url":null,"abstract":"The lack of resources such as annotated datasets and tools for low-resource languages is a significant obstacle to the advancement of Natural Language Processing (NLP) applications targeting users who speak these languages. Although learning techniques such as semi-supervised and weakly supervised learning are effective in text classification cases where annotated data is limited, they are still not widely investigated in many languages due to the sparsity of data altogether, both labeled and unlabeled. In this study, we deploy both weakly, and semi-supervised learning approaches for text classification in low-resource languages and address the underlying limitations that can hinder the effectiveness of these techniques. To that end, we propose a suite of language-agnostic techniques for large-scale data collection, automatic data annotation, and language model training in scenarios where resources are scarce. Specifically, we propose a novel data collection pipeline for under-represented languages, or dialects, that is language and task agnostic and of sufficient size for training a language model capable of achieving competitive results on common NLP tasks, as our experiments show. The models will be shared with the research community.","PeriodicalId":355149,"journal":{"name":"Workshop on Arabic Natural Language Processing","volume":"42 1","pages":"0"},"PeriodicalIF":0.0,"publicationDate":"1900-01-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":null,"resultStr":null,"platform":"Semanticscholar","paperid":"122024974","PeriodicalName":null,"FirstCategoryId":null,"ListUrlMain":null,"RegionNum":0,"RegionCategory":"","ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":"","EPubDate":null,"PubModel":null,"JCR":null,"JCRName":null,"Score":null,"Total":0}

引用次数: 0

iCompass Working Notes for the Nuanced Arabic Dialect Identification Shared task 精细阿拉伯语方言识别共享任务的iCompass工作笔记

Workshop on Arabic Natural Language Processing Pub Date : 1900-01-01 DOI: 10.18653/v1/2022.wanlp-1.41

Abir Messaoudi, Chayma Fourati, H. Haddad, Moez BenHajhmida

引用次数: 4