Ahmed El-Sayed, Shaimaa Y. Lazem, Mohamed M. Abougabal
{"title":"An Arabic Egyptian Dialect COVID-19 Twitter Dataset (ArECTD)","authors":"Ahmed El-Sayed, Shaimaa Y. Lazem, Mohamed M. Abougabal","doi":"10.1109/JAC-ECC54461.2021.9691451","DOIUrl":null,"url":null,"abstract":"Citizens are increasingly expressing their ideas and feelings on social media platforms such as Twitter. During the coronavirus crisis, numerous emotions are exposed, including sadness, anger, fear, sympathy, surprise, etc. The Arabic Egyptian Dialect COVID-19 Twitter Dataset (ArECTD), comprised of 78K tweets, was collected in the period from the 1st of January 2020 till the 30th of May 2021 focusing on the Egyptian dialect. It was annotated using a combination of manual and a semi-supervised self-learning technique. The tweets of ArECTD were categorized into 10 emotions (sarcasm, sadness, anger, fear, sympathy, joy, hope, surprise, love, and none). Emotion analysis of this dataset could help decision makers understand and respond to the public reactions during the pandemic.","PeriodicalId":354908,"journal":{"name":"2021 9th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","volume":"1 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-12-13","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"2","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 9th International Japan-Africa Conference on Electronics, Communications, and Computations (JAC-ECC)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/JAC-ECC54461.2021.9691451","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 2
Abstract
Citizens are increasingly expressing their ideas and feelings on social media platforms such as Twitter. During the coronavirus crisis, numerous emotions are exposed, including sadness, anger, fear, sympathy, surprise, etc. The Arabic Egyptian Dialect COVID-19 Twitter Dataset (ArECTD), comprised of 78K tweets, was collected in the period from the 1st of January 2020 till the 30th of May 2021 focusing on the Egyptian dialect. It was annotated using a combination of manual and a semi-supervised self-learning technique. The tweets of ArECTD were categorized into 10 emotions (sarcasm, sadness, anger, fear, sympathy, joy, hope, surprise, love, and none). Emotion analysis of this dataset could help decision makers understand and respond to the public reactions during the pandemic.