用于人格特征预测的埃及阿拉伯语方言预处理

M. Salim, S. Saad, M. Aref
{"title":"用于人格特征预测的埃及阿拉伯语方言预处理","authors":"M. Salim, S. Saad, M. Aref","doi":"10.21608/ijicis.2019.62603","DOIUrl":null,"url":null,"abstract":"Each individual has his own distinct character, making his own decisions which is based on his personality. Researchers in computer science field have tried to reach a model for extracting personality traits relying on user’s profiles on social network sites as an input. Content created by users such as text posts, photos and shared activities in social network sites are considered as a huge source of data. Regarding user-created text, it has been proved that text pre-processing has a great impact if was applied to text before using it in research. In this paper, the effect of pre-processing (stemming and stop word removal) and adding numerical features is tested on the performance of Arabic personality prediction using AraPersonality dataset, which yielded 3.0% and 6.7% overall improvement to baseline experiments in binary representation and multiclass representation respectively","PeriodicalId":244591,"journal":{"name":"International Journal of Intelligent Computing and Information Sciences","volume":"6 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2019-06-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"5","resultStr":"{\"title\":\"PREPROCESSING THE EGYPTIAN ARABIC DIALECT FOR PERSONALITY TRAITS PREDICTION\",\"authors\":\"M. Salim, S. Saad, M. Aref\",\"doi\":\"10.21608/ijicis.2019.62603\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"Each individual has his own distinct character, making his own decisions which is based on his personality. Researchers in computer science field have tried to reach a model for extracting personality traits relying on user’s profiles on social network sites as an input. Content created by users such as text posts, photos and shared activities in social network sites are considered as a huge source of data. Regarding user-created text, it has been proved that text pre-processing has a great impact if was applied to text before using it in research. In this paper, the effect of pre-processing (stemming and stop word removal) and adding numerical features is tested on the performance of Arabic personality prediction using AraPersonality dataset, which yielded 3.0% and 6.7% overall improvement to baseline experiments in binary representation and multiclass representation respectively\",\"PeriodicalId\":244591,\"journal\":{\"name\":\"International Journal of Intelligent Computing and Information Sciences\",\"volume\":\"6 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2019-06-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"5\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"International Journal of Intelligent Computing and Information Sciences\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.21608/ijicis.2019.62603\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"International Journal of Intelligent Computing and Information Sciences","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.21608/ijicis.2019.62603","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 5

摘要

每个人都有自己独特的性格,做出自己的决定,这是基于他的个性。计算机科学领域的研究人员试图建立一种基于用户在社交网站上的个人资料作为输入的人格特征提取模型。用户在社交网站上创建的文本帖子、照片和共享活动等内容被认为是一个巨大的数据来源。对于用户创建的文本,在研究文本之前,已经证明文本预处理对文本有很大的影响。本文利用arpersonpersonality数据集测试了预处理(词干提取和停止词去除)和添加数字特征对阿拉伯语人格预测性能的影响,在二进制表示和多类表示方面分别比基线实验提高了3.0%和6.7%
本文章由计算机程序翻译,如有差异,请以英文原文为准。
PREPROCESSING THE EGYPTIAN ARABIC DIALECT FOR PERSONALITY TRAITS PREDICTION
Each individual has his own distinct character, making his own decisions which is based on his personality. Researchers in computer science field have tried to reach a model for extracting personality traits relying on user’s profiles on social network sites as an input. Content created by users such as text posts, photos and shared activities in social network sites are considered as a huge source of data. Regarding user-created text, it has been proved that text pre-processing has a great impact if was applied to text before using it in research. In this paper, the effect of pre-processing (stemming and stop word removal) and adding numerical features is tested on the performance of Arabic personality prediction using AraPersonality dataset, which yielded 3.0% and 6.7% overall improvement to baseline experiments in binary representation and multiclass representation respectively
求助全文
通过发布文献求助,成功后即可免费获取论文全文。 去求助
来源期刊
自引率
0.00%
发文量
0
×
引用
GB/T 7714-2015
复制
MLA
复制
APA
复制
导出至
BibTeX EndNote RefMan NoteFirst NoteExpress
×
提示
您的信息不完整,为了账户安全,请先补充。
现在去补充
×
提示
您因"违规操作"
具体请查看互助需知
我知道了
×
提示
确定
请完成安全验证×
copy
已复制链接
快去分享给好友吧!
我知道了
右上角分享
点击右上角分享
0
联系我们:info@booksci.cn Book学术提供免费学术资源搜索服务,方便国内外学者检索中英文文献。致力于提供最便捷和优质的服务体验。 Copyright © 2023 布克学术 All rights reserved.
京ICP备2023020795号-1
ghs 京公网安备 11010802042870号
Book学术文献互助
Book学术文献互助群
群 号:604180095
Book学术官方微信