{"title":"Multi-Topic Labelling Classification Based on LSTM","authors":"Duha AlBatayha","doi":"10.1109/ICICS52457.2021.9464531","DOIUrl":null,"url":null,"abstract":"The necessity for automatic classification of some resources has become extremely important given the fast-increasing number of electronic resources. People's opinions have been extracted from social media sites using Artificial Intelligence (AI). Despite this, the majority of current research focuses on extrapolating features from texts. Multi-label textual data classification is a significant problem in terms of the increasing amount of data available and the growing difficulties of assigning each text piece with one label. Examples include news and email articles. This work focuses on multi-label classification of Arabic texts. After dataset collection; several architectures were tested for this task. Bidirectional Long Short-Term Memory networks (BiLSTM) showed the superior results with F-score equal 86.6 in development set, and F1-score equal 82.24 in leaderboard Mowjaz competition.","PeriodicalId":421803,"journal":{"name":"2021 12th International Conference on Information and Communication Systems (ICICS)","volume":"40 1","pages":"0"},"PeriodicalIF":0.0000,"publicationDate":"2021-05-24","publicationTypes":"Journal Article","fieldsOfStudy":null,"isOpenAccess":false,"openAccessPdf":"","citationCount":"4","resultStr":null,"platform":"Semanticscholar","paperid":null,"PeriodicalName":"2021 12th International Conference on Information and Communication Systems (ICICS)","FirstCategoryId":"1085","ListUrlMain":"https://doi.org/10.1109/ICICS52457.2021.9464531","RegionNum":0,"RegionCategory":null,"ArticlePicture":[],"TitleCN":null,"AbstractTextCN":null,"PMCID":null,"EPubDate":"","PubModel":"","JCR":"","JCRName":"","Score":null,"Total":0}
引用次数: 4
Abstract
The necessity for automatic classification of some resources has become extremely important given the fast-increasing number of electronic resources. People's opinions have been extracted from social media sites using Artificial Intelligence (AI). Despite this, the majority of current research focuses on extrapolating features from texts. Multi-label textual data classification is a significant problem in terms of the increasing amount of data available and the growing difficulties of assigning each text piece with one label. Examples include news and email articles. This work focuses on multi-label classification of Arabic texts. After dataset collection; several architectures were tested for this task. Bidirectional Long Short-Term Memory networks (BiLSTM) showed the superior results with F-score equal 86.6 in development set, and F1-score equal 82.24 in leaderboard Mowjaz competition.