Topic and sentiment model applied to the colloquial Arabic: a case study of Maghrebi Arabic

Taoufiq Zarra, R. Chiheb, Rajae Moumen, R. Faizi, A. E. Afia
{"title":"Topic and sentiment model applied to the colloquial Arabic: a case study of Maghrebi Arabic","authors":"Taoufiq Zarra, R. Chiheb, Rajae Moumen, R. Faizi, A. E. Afia","doi":"10.1145/3128128.3128155","DOIUrl":null,"url":null,"abstract":"Recently, the multiplication of communication and sharing platforms such as social networks, personal blogs, forums, etc., has facilitated the expression of views and opinions about products, personalities, and public policy. However, gathering these points of view is a complex task that requires resolution of many problems in different disciplines, especially issues related to our language. Among the research areas, topic modeling and sentiment analysis stimulates interest and curiosity of the scientific community. Lately, the current economic, geo-political and geostrategic trends have made researchers specifically more interested in Arabic language, except that the majority of these studies focus on the classical Arabic; nevertheless it is a language of the elites which is different from what is mainly used on the Web. Our paper focuses on Maghrebi colloquial Arabic since the little research that exists in this area is limited to East colloquial Arabic. On a corpus extracted from different Facebook pages we implemented a supervised approach to extract the sentiments, and an unsupervised approach to extract topic, then we proposed a new, semi-supervised, approach in the Arabic language that combines the topic and the sentiment in a single model, in order to join each topic to a specific sentiment.","PeriodicalId":362403,"journal":{"name":"Proceedings of the 2017 International Conference on Smart Digital Environment","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-07-21","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"12","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"Proceedings of the 2017 International Conference on Smart Digital Environment","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1145/3128128.3128155","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 12

Abstract

Recently, the multiplication of communication and sharing platforms such as social networks, personal blogs, forums, etc., has facilitated the expression of views and opinions about products, personalities, and public policy. However, gathering these points of view is a complex task that requires resolution of many problems in different disciplines, especially issues related to our language. Among the research areas, topic modeling and sentiment analysis stimulates interest and curiosity of the scientific community. Lately, the current economic, geo-political and geostrategic trends have made researchers specifically more interested in Arabic language, except that the majority of these studies focus on the classical Arabic; nevertheless it is a language of the elites which is different from what is mainly used on the Web. Our paper focuses on Maghrebi colloquial Arabic since the little research that exists in this area is limited to East colloquial Arabic. On a corpus extracted from different Facebook pages we implemented a supervised approach to extract the sentiments, and an unsupervised approach to extract topic, then we proposed a new, semi-supervised, approach in the Arabic language that combines the topic and the sentiment in a single model, in order to join each topic to a specific sentiment.
话题和情感模型在阿拉伯语口语中的应用——以马格里布阿拉伯语为例
最近,社交网络、个人博客、论坛等交流和分享平台的激增,为人们表达对产品、个性和公共政策的看法和意见提供了便利。然而,收集这些观点是一项复杂的任务,需要解决不同学科的许多问题,特别是与我们的语言有关的问题。在研究领域中,主题建模和情感分析激发了科学界的兴趣和好奇心。最近,当前的经济、地缘政治和地缘战略趋势使研究人员对阿拉伯语特别感兴趣,但这些研究大多集中在古典阿拉伯语上;然而,它是一种精英的语言,与主要在网络上使用的语言不同。由于对马格里布阿拉伯语口语的研究很少,因此本文的重点是马格里布阿拉伯语口语。在从不同Facebook页面提取的语料库上,我们实施了一种监督方法来提取情感,一种非监督方法来提取主题,然后我们提出了一种新的半监督方法,用阿拉伯语将主题和情感结合在一个模型中,以便将每个主题加入到特定的情感中。
本文章由计算机程序翻译,如有差异,请以英文原文为准。
求助全文
约1分钟内获得全文 求助全文
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:481959085
Book学术官方微信