Abdel-Karim Al-Tamimi, A. Shatnawi, Esraa Bani-Issa
{"title":"YouTube评论的阿拉伯语情感分析","authors":"Abdel-Karim Al-Tamimi, A. Shatnawi, Esraa Bani-Issa","doi":"10.1109/AEECT.2017.8257766","DOIUrl":null,"url":null,"abstract":"With the current level of ubiquity of social media websites, obtaining the users preferences automatically became a crucial task to assess their tendencies and behaviors online. Arabic language as one of the most spoken languages in the world and the fastest growing language on the Internet motivates us to provide reliable automated tools that can perform sentiment analysis to reveal users opinions. In this paper, we present our work of Arabic comments classification based on our collected and manually annotated YouTube Arabic comments. We share our classification results utilizing the most commonly used supervised classifiers: SVM-RBF, KNN, and Bernoulli NB classifiers. Experiments were performed using both raw and language-normalized datasets. We show that SVM-RBF outperformed other classification methods with an f-measure of 88.8% using a normalized dataset with two polarities.","PeriodicalId":286127,"journal":{"name":"2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)","volume":"45 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2017-10-01","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"19","resultStr":"{\"title\":\"Arabic sentiment analysis of YouTube comments\",\"authors\":\"Abdel-Karim Al-Tamimi, A. Shatnawi, Esraa Bani-Issa\",\"doi\":\"10.1109/AEECT.2017.8257766\",\"DOIUrl\":null,\"url\":null,\"abstract\":\"With the current level of ubiquity of social media websites, obtaining the users preferences automatically became a crucial task to assess their tendencies and behaviors online. Arabic language as one of the most spoken languages in the world and the fastest growing language on the Internet motivates us to provide reliable automated tools that can perform sentiment analysis to reveal users opinions. In this paper, we present our work of Arabic comments classification based on our collected and manually annotated YouTube Arabic comments. We share our classification results utilizing the most commonly used supervised classifiers: SVM-RBF, KNN, and Bernoulli NB classifiers. Experiments were performed using both raw and language-normalized datasets. We show that SVM-RBF outperformed other classification methods with an f-measure of 88.8% using a normalized dataset with two polarities.\",\"PeriodicalId\":286127,\"journal\":{\"name\":\"2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)\",\"volume\":\"45 1\",\"pages\":\"0\"},\"PeriodicalIF\":0.0000,\"publicationDate\":\"2017-10-01\",\"publicationTypes\":\"Journal Article\",\"fieldsOfStudy\":null,\"isOpenAccess\":false,\"openAccessPdf\":\"\",\"citationCount\":\"19\",\"resultStr\":null,\"platform\":\"Semanticscholar\",\"paperid\":null,\"PeriodicalName\":\"2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)\",\"FirstCategoryId\":\"1085\",\"ListUrlMain\":\"https://doi.org/10.1109/AEECT.2017.8257766\",\"RegionNum\":0,\"RegionCategory\":null,\"ArticlePicture\":[],\"TitleCN\":null,\"AbstractTextCN\":null,\"PMCID\":null,\"EPubDate\":\"\",\"PubModel\":\"\",\"JCR\":\"\",\"JCRName\":\"\",\"Score\":null,\"Total\":0}","platform":"Semanticscholar","paperid":null,"PeriodicalName":"2017 IEEE Jordan Conference on Applied Electrical Engineering and Computing Technologies (AEECT)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/AEECT.2017.8257766","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
With the current level of ubiquity of social media websites, obtaining the users preferences automatically became a crucial task to assess their tendencies and behaviors online. Arabic language as one of the most spoken languages in the world and the fastest growing language on the Internet motivates us to provide reliable automated tools that can perform sentiment analysis to reveal users opinions. In this paper, we present our work of Arabic comments classification based on our collected and manually annotated YouTube Arabic comments. We share our classification results utilizing the most commonly used supervised classifiers: SVM-RBF, KNN, and Bernoulli NB classifiers. Experiments were performed using both raw and language-normalized datasets. We show that SVM-RBF outperformed other classification methods with an f-measure of 88.8% using a normalized dataset with two polarities.